
GitHub’s AI Copilot Might Get You Sued If You Use It
Some are even abandoning GitHub because of it
GitHub just announced its latest, shiny product: an artificial intelligence (AI) called Copilot. It’s a machine learning-powered software that can write code by itself, generating quite impressive programming functions. Yet, it has people pulling out of GitHub and worrying about lawsuits.
The AI works similarly to other OpenAI-powered code-generating tools. The user writes a comment describing what they want the AI to write, and the AI makes it happen. What makes Copilot unique is that it also takes initiatives on its own, suggesting autocompletions on the fly.
It sounds really cool, doesn’t it? If you know me, you know that I’m often excited about artificial intelligence; I even published a book wherein I described technologies similar to Copilot. But there are plenty of issues surrounding machine learning, and GitHub is experiencing these dilemmas already on day one. Usually, the source of the drama for any machine learning application lies in its data, and the outcry surrounding Copilot follows that rule. More specifically, in the case of Copilot, the problem lies in how GitHub went about gathering the data to build the algorithm.
“Unfortunately, the user has no way of knowing if the algorithm made a particular piece of code up by itself or stole it from a code repository protected by a license.”
Like any machine learning algorithm, Copilot has learned how to do its thing (write code) by being fed data of things (code) that work. According to GitHub itself, this AI has been trained on billions of lines of code from GitHub repositories. And so, when Copilot writes code for a user, it draws from those billions of lines.
Unfortunately, the user has no way of knowing if the algorithm made a particular piece of code up by itself or stole it from a code repository protected by a license.
Oh, and when I say stole, I mean it.
One software engineer posted a picture on Twitter, which shows a piece of code generated by Copilot when it was asked to write an “About me”-page. Comically, the code is ripped straight from the page of a real person.
