Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture Published: September 20, 2025Share on Twitter Facebook Google+ LinkedIn Previous Next