It appears that this new report reaffirms the standards regarding the copyrightability of AI-generated works that the Copyright Office set out in its guidance issued two years ago. Because the Office has concluded that no new legislation is necessary, we can likely assume that the criteria for copyrightability of AI-generated works in the United States are now almost settled.
Interestingly, there was a lawsuit in China last year concerning the copyrightability of images generated with Stable Diffusion, and the court recognized copyright in images created simply by entering prompts.
According to the U.S. Copyright Office’s position, no matter how complex the prompt input might be, humans cannot exercise creative control over the final output; the traditional creative and expressive elements are determined by the AI rather than by humans. Therefore, such works are not considered the product of human creativity and are thus not copyrightable. In contrast, in the Chinese lawsuit, the court held that through the human’s complex prompt input and other actions, the human’s selection, arrangement, and individual judgment were embodied in the AI-generated work. Thus, the court deemed the AI-generated work to be a human creation and therefore copyrightable. (Note that in Japan there is still no case law on this matter. However, based on the Agency for Cultural Affairs’ guidance, it appears that if the prompts contain sufficient human creativity, or if there is creativity and ingenuity in repeated attempts or in combining multiple AI-generated outputs, that might be enough to satisfy the requirement. This could be seen as an approach somewhere between the positions of the U.S. and China.)
When discussing Open Source AI, the divergence in how different jurisdictions interpret the copyrightability of AI-generated works likely comes down to how synthetic data is treated, don’t you think?
In the United States, I believe there is very little scope for copyright to arise in synthetic data. However, in China, it seems there may be a possibility that the same process used to create synthetic data could result in copyright being recognized.