Episode #431 from 57:57

AI deception

But see, I'm very concerned about system being used to control the masses. But in that case, the developers know about the kind of control that's happening. You're more concerned about the next stage where even the developers don't know about the deception. Correct. I don't think developers know everything about what they are creating. They have lots of great knowledge, we're making progress on explaining parts of a network. We can understand, "Okay, this note get excited, then this input is presented, this cluster of notes." But we're nowhere near close to understanding the full picture, and I think it's impossible. You need to be able to survey an explanation. The size of those models prevents a single human from absorbing all this information, even if provided by the system. So either we're getting model as an explanation for what's happening and that's not comprehensible to us or we're getting compressed explanation, [inaudible 00:59:01] compression, where here, "Top 10 reasons you got fired." It's something, but it's not a full picture.

Why this moment matters

But see, I'm very concerned about system being used to control the masses. But in that case, the developers know about the kind of control that's happening. You're more concerned about the next stage where even the developers don't know about the deception. Correct. I don't think developers know everything about what they are creating. They have lots of great knowledge, we're making progress on explaining parts of a network. We can understand, "Okay, this note get excited, then this input is presented, this cluster of notes." But we're nowhere near close to understanding the full picture, and I think it's impossible. You need to be able to survey an explanation. The size of those models prevents a single human from absorbing all this information, even if provided by the system. So either we're getting model as an explanation for what's happening and that's not comprehensible to us or we're getting compressed explanation, [inaudible 00:59:01] compression, where here, "Top 10 reasons you got fired." It's something, but it's not a full picture.

Starts at 57:57
People and topics
All moments
AI deception chapter timestamp | Roman Yampolskiy: Dangers of Superintelligent AI | EpisodeIndex