Github Copilot investigation
2022-10-18 18:33:08.613566+02 by Dan Lyke 2 comments
This is fantastic: Github Copilot investigation:
If Microsoft and OpenAI chose to use these repos subject to their respective open-source licenses, Microsoft and OpenAI would’ve needed to publish a lot of attributions, because this is a minimal requirement of pretty much every open-source license. Yet no attributions are apparent.
Therefore, Microsoft and OpenAI must be relying on a fair-use argument. In fact we know this is so, because former GitHub CEO Nat Friedman claimed during the Copilot technical preview that “training [machine-learning] systems on public data is fair use”.
But a hell of a lot of that code is GPL or LGPL licensed and one can apparently recreate it with the right prompts...