Episode #490 from 2:50:21
Robotics
I think we should say that there's a lot of exciting stuff going on in the AI space. My mind has recently been really focused on robotics, so today we almost entirely didn't talk about robotics. There's a lot of stuff on image generation and video generation. I think it's fair to say that the most exciting research work in terms of intensity and fervor is in the LLM space, which is why I think it's justified for us to focus on the LLMs we're discussing. But it'd be nice to bring in certain things that might be useful. For example, world models—there's growing excitement about that. Do you think there will be any use in this coming year for world models in the LLM space? Also with LLMs, what's an interesting thing here is I think if we unlock more LLM capabilities, it also automatically unlocks all the other fields because it makes progress faster. Because, you know, a lot of researchers and engineers use LLMs for coding. So even if they work on robotics, if you optimize these LLMs that help with coding, it pays off. But then yes, world models are interesting. It's basically where you have the model run a simulation of the world—like a little toy version of the real thing—which can unlock capabilities like data the LLM is not aware of. It can simulate things. I think LLMs happen to work well by pre-training and doing next-token prediction, but we could do this in a more sophisticated way.
Why this moment matters
I think we should say that there's a lot of exciting stuff going on in the AI space. My mind has recently been really focused on robotics, so today we almost entirely didn't talk about robotics. There's a lot of stuff on image generation and video generation. I think it's fair to say that the most exciting research work in terms of intensity and fervor is in the LLM space, which is why I think it's justified for us to focus on the LLMs we're discussing. But it'd be nice to bring in certain things that might be useful. For example, world models—there's growing excitement about that. Do you think there will be any use in this coming year for world models in the LLM space? Also with LLMs, what's an interesting thing here is I think if we unlock more LLM capabilities, it also automatically unlocks all the other fields because it makes progress faster. Because, you know, a lot of researchers and engineers use LLMs for coding. So even if they work on robotics, if you optimize these LLMs that help with coding, it pays off. But then yes, world models are interesting. It's basically where you have the model run a simulation of the world—like a little toy version of the real thing—which can unlock capabilities like data the LLM is not aware of. It can simulate things. I think LLMs happen to work well by pre-training and doing next-token prediction, but we could do this in a more sophisticated way.