Episode #431 from 1:04:30

Verification

But a lot of the conversation I'm having with you now is also kind of wondering almost at a technical level, how can AI escape control? What would that system look like? Because it, to me, is terrifying and fascinating. And also fascinating to me is maybe the optimistic notion it's possible to engineer systems that defend against that. One of the things you write a lot about in your book is verifiers. So, not humans. Humans are also verifiers. But software systems that look at AI systems, and help you understand, "This thing is getting real weird." Help you analyze those systems. So maybe this is a good time to talk about verification. What is this beautiful notion of verification? My claim is, again, that there are very strong limits in what we can and cannot verify. A lot of times when you post something on social media, people go, "Oh, I need citation to a peer reviewed article." But what is a peer reviewed article? You found two people in a world of hundreds of thousands of scientists who said, "Ah, whatever, publish it. I don't care." That's the verifier of that process. When people say, "Oh, it's formally verified software or mathematical proof," we accept something close to 100% chance of it being free of all problems. But you actually look at research, software is full of bugs, old mathematical theorems, which have been proven for hundreds of years have been discovered to contain bugs, on top of which we generate new proofs and now we have to redo all that.

Why this moment matters

But a lot of the conversation I'm having with you now is also kind of wondering almost at a technical level, how can AI escape control? What would that system look like? Because it, to me, is terrifying and fascinating. And also fascinating to me is maybe the optimistic notion it's possible to engineer systems that defend against that. One of the things you write a lot about in your book is verifiers. So, not humans. Humans are also verifiers. But software systems that look at AI systems, and help you understand, "This thing is getting real weird." Help you analyze those systems. So maybe this is a good time to talk about verification. What is this beautiful notion of verification? My claim is, again, that there are very strong limits in what we can and cannot verify. A lot of times when you post something on social media, people go, "Oh, I need citation to a peer reviewed article." But what is a peer reviewed article? You found two people in a world of hundreds of thousands of scientists who said, "Ah, whatever, publish it. I don't care." That's the verifier of that process. When people say, "Oh, it's formally verified software or mathematical proof," we accept something close to 100% chance of it being free of all problems. But you actually look at research, software is full of bugs, old mathematical theorems, which have been proven for hundreds of years have been discovered to contain bugs, on top of which we generate new proofs and now we have to redo all that.

Starts at 1:04:30
People and topics
All moments
Verification chapter timestamp | Roman Yampolskiy: Dangers of Superintelligent AI | EpisodeIndex