Wikipedia is licensed under Creative Commons and is thus available for commercial usage, meaning that users are relatively free to share and edit the text so long as it is attributed correctly.
No, you've got the license terms quite wrong. It's CC-BY-SA, not CC-BY, and that makes a world of difference. Let's correct for that and see where we are.
But how does this work with AI training? It seems that AI could freely use the information to educate itself, but in folding that knowledge into the commercial product that AI will increasingly become, it is not clear how companies can properly attribute Wikipedia’s contributions when the information is integrated into the Digital God’s mind without clear delineation? If it wanted to, Wikipedia could absolutely change their license and prohibit AI usage, but this seems counter to their open-source mission. Some people worry, furthermore, that such free training would “allow[] companies like OpenAI? to exploit the open web to create closed commercial datasets for their models.” This concern is less about copyright than it is the integrity of such knowledge, driven by the fear that AI interpretation upon AI interpretation will warp the information displayed on sites like Wikipedia. |