There are two ways that copyright is related to AI.
As with many aspects of the rapidly advancing field of AI, the impact of copyright law on how AI models are created and how their output can be used is still unsettled.
Confused about copyright? LNDL's Copyright Information Center can help. Schedule a copyright consultation or attend a copyright workshop.
The development of generative AI models, particularly large language models (LLM) like ChatGPT or image creation and recognition AI tools such as DALL-E, Midjourney, and Stable Diffusion are reliant on using copyrighted works as training data. Most AI creators have ingested this data from online collections containing copyrighted works without permission from or compensation to copyright holders. Whether or not the use of this data constitutes a violation of copyright law is currently undetermined. AI creators are claiming their use is considered a fair use under copyright law, while copyright holders are claiming it is an infringement of their rights and have sued the AI creators. It may take years for these court cases to resolve and provide a more definitive answer.
Arguments that use of copyrighted materials in training AI is a fair use:
Arguments that use of copyright materials in training AI is not a fair use:
According to the United States Copyright Office in order for a work to be eligible for copyright, it must have been created by a human. Creating prompts to input into generative AI models does not constitute enough human involvement to qualify the output for copyright, as the person inputting the prompt has very little control over what is actually created. However, a human may use or modify the AI created works in a way that qualifies the final creation for copyright.
Examples
Beyond the rights granted by the United States Copyright Office, the terms of service of the AI tools may dictate what rights someone using their model has over the output. Users should read the terms of service of the particular tool they are using to determine what they are able to do with the output.
Most AI developers are not sharing what data they have used to train their models, so it is difficult to know for sure if your works are being used to train any AI models. However, there are three likely scenarios in which your creations may wind up in AI training data. You may or may not be able to control the use at this point.