Episode #385 from 40:48
ChatGPT
If we could just talk about that before we jump back to some other interesting topics in Wikipedia. Let's talk about GPT-4 and large language models. So they are in part trained on Wikipedia content. What are the pros and cons of these language models? What are your thoughts? Yeah, so I mean, there's a lot of stuff going on. Obviously the technology has moved very quickly in the last six months and looks poised to do so for some time to come. So first things first, part of our philosophy is the open licensing, the free licensing, the idea that this is what we're here for. We are a volunteer community and we write this encyclopedia. We give it to the world to do what you like with, you can modify it, redistribute it, redistribute modified versions, commercially, non-commercially. This is the licensing. So in that sense, of course it's completely fine. Now, we do worry a bit about attribution because it is a Creative Commons Attribution Share-Alike License. So attribution is important, not just because of our licensing model and things like that, but it's just proper attribution is just good intellectual practice.
Why this moment matters
If we could just talk about that before we jump back to some other interesting topics in Wikipedia. Let's talk about GPT-4 and large language models. So they are in part trained on Wikipedia content. What are the pros and cons of these language models? What are your thoughts? Yeah, so I mean, there's a lot of stuff going on. Obviously the technology has moved very quickly in the last six months and looks poised to do so for some time to come. So first things first, part of our philosophy is the open licensing, the free licensing, the idea that this is what we're here for. We are a volunteer community and we write this encyclopedia. We give it to the world to do what you like with, you can modify it, redistribute it, redistribute modified versions, commercially, non-commercially. This is the licensing. So in that sense, of course it's completely fine. Now, we do worry a bit about attribution because it is a Creative Commons Attribution Share-Alike License. So attribution is important, not just because of our licensing model and things like that, but it's just proper attribution is just good intellectual practice.