Episode #490 from 2:28:46

Text diffusion models and other new research directions

Yeah. Okay, let's... we talked a lot about many things, certainly about what was exciting last year. But this year, one of the things you guys mentioned that's exciting is the scaling of text diffusion models and just a different exploration of text diffusion. Can you talk about what that is and what possibilities it holds? So, different kinds of approaches than the current LMs? Yeah, so we talked a lot about the transformer architecture and the autoregressive transformer architecture specifically, like GPT. And it doesn't mean no one else is working on anything else. People are always on the lookout for the next big thing, because I think it would be almost stupid not to. Sure, right now the transformer architecture is the thing and it works best, but it's always a good idea to not put all your eggs into one basket. People are developing alternatives to the autoregressive transformer. One of them would be, for example, text diffusion models.

Why this moment matters

Yeah. Okay, let's... we talked a lot about many things, certainly about what was exciting last year. But this year, one of the things you guys mentioned that's exciting is the scaling of text diffusion models and just a different exploration of text diffusion. Can you talk about what that is and what possibilities it holds? So, different kinds of approaches than the current LMs? Yeah, so we talked a lot about the transformer architecture and the autoregressive transformer architecture specifically, like GPT. And it doesn't mean no one else is working on anything else. People are always on the lookout for the next big thing, because I think it would be almost stupid not to. Sure, right now the transformer architecture is the thing and it works best, but it's always a good idea to not put all your eggs into one basket. People are developing alternatives to the autoregressive transformer. One of them would be, for example, text diffusion models.

Starts at 2:28:46
People and topics
All moments
Text diffusion models and other new research directions chapter timestamp | State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI | EpisodeIndex