Perspectives on copyright and plagiarism, in the context of Generative Artificial Intelligence

Authors

  • Artur Marques Instituto Politécnico de Santarém, Escola Superior de Gestão e Tecnologia
  • Filipe Madeira Instituto Politécnico de Santarém, Escola Superior de Gestão e Tecnologia; Pólo de Literacia Digital e Inclusão Social do Centro de Investigação em Artes e Comunicação (PLDIS-CIAC) https://orcid.org/0000-0002-2227-7006

DOI:

https://doi.org/10.25746/ruiips.v11.i3.32523

Keywords:

Copyright, Plagiarism, Machine Learning, GANs, Sentiment analysis

Abstract

Although Artificial Intelligence for Content Generation, or Generative AI (GAI), is decades old, it is in the 2020s that it reaches such critical mass, that it is not easy to identify activity sectors immune to its applications and implications.

 

This article presents two perspectives, somewhat contrarian to popular opinion - as measured by social network sentiment analysis - on two problems that gained visibility with the massification of GAI and Machine Learning (ML) systems:

  • The problem of copyright attribution to works based on these systems;
  • ML-based applications for plagiarism control.

 

The current perception seems to be that:

  • GAI-based creations cannot be protected by Copyright, because such protection is only available to humans, as seen in cases such as the photographer David Slater and the self-portrait of a macaque using his equipment, in which the United States Copyright Office recalled that "works by ... non-human entities are not eligible for copyright protection".
  • Plagiarism detection has reached such levels of availability and acceptance, namely in the form of AI-based tools for Learning Management Systems (LMSs), that they are now trusted by many educators.

 

Our perspective is misaligned from these opinions, and can be summed up as follows:

 

  • In fact, there is Copyright protection for any human-attributable work with a minimum of creative effort, so GAI-based tools that assist the process, are legitimate. The mere effort to identify, select and apply them should meet the requirements, with precedent in cases such as “Feist Publications, Inc. vs. Rural Telephone Service Co., Inc. ”, in which protection was granted to content directories, such as phone numbers, where it was considered that the “... selection and arrangement … constitute minimal creative effort ”.
  • Regarding the detection of “plagiarism” by AI tools, in fact the expression commonly refers to the detection of similarities, not to the radical meaning of the term, which is the non-recognition of the work of others. Regarding assessing how much of a work is copied, in particular if some text is the output of an AI system, in the limit this detection is impossible and the available solutions are not reliable.

Almost paradoxically, it is the same GANS (Generative Adversarial Networks) that eventually support the confidence in solutions for this purpose, that also justify the perspective that there is no solution, on the long run. A GAN, is the opposition between a generator and an evaluator or discriminator system. The evaluator has “vast” knowledge of certain works, such as texts or real photographs; the generator begins by creating random outputs, which the discriminator will easily detect as a fake. As the process runs, both parts get better, but from a certain iteration, the generator is expected to produce content indistinguishable of reality. Thus, the very same approach that offers a solution to detection, is a solution to its defeat.

Published

2023-12-31

How to Cite

Marques, A., & Madeira, F. (2023). Perspectives on copyright and plagiarism, in the context of Generative Artificial Intelligence. Revista Da UI_IPSantarém, 11(3), 103–105. https://doi.org/10.25746/ruiips.v11.i3.32523

Most read articles by the same author(s)

1 2 > >>