OpenAI: Copyright Case Over AI Training Thrown Out

6 days ago

In many areas, we are watching history in the making. technology is changing things are a rapid pace. This means the future is going to be largely determined by what is taking place today.

Few dispute the impact that generative AI is already making. The question remains as to what the eventual impact will be. Areas such as jobs, safety, and the economy are going to be affected. At this time, we have no idea exactly how that will unfold.

OpenAi and its CEO, Sam Altman, are at the center of this. It made a lot of headlines through the lawsuits which were filed against the company. Most of these came from news organizations who assert copyright infringement.

Naturally, the rulings in these cases is going to set the precedent for the next couple decades. Many speculate that Big Tech is violating existing laws.

We now have the first indication where things could be heading.

Source

OpenAI Copyright Case Thrown Out

Before diving into this discussion, I will state that I am not a lawyer, so my opinion here is not based upon legal training. At the same time, laws within the United States vary among the states and, of course, there is a difference between countries.

Nevertheless, a case in New York federal court was tossed by the judge who determined the plaintiffs failed to show harm.

Judge Colleen McMahon dismissed the case after finding that the plaintiffs failed to show concrete harm from OpenAI's use of their content as training data. Unlike other lawsuits targeting AI companies, this case focused on the removal of copyright management information rather than direct copyright violations—though Judge McMahon noted the underlying issue remained the same.

Courts dealing with future cases will have to decide if the same applies to outright copyright violations.

The decision to cite the failure to show material harm is going to set a precedent. Here we are seeing the claim that OpenAI is creating a competing product.

The judge's decision supported the fair use defense of OpenAI and other AI companies, noting that ChatGPT creates synthesized responses from its training rather than copying content directly. She emphasized that the likelihood of ChatGPT reproducing exact copies of articles is minimal, and pointed out that factual information in articles isn't copyrighted anyway.

This is where things could get hairy for those filing based upon copyright violations.

Inspiration

The basic mechanism of these models is not input/output. These cases are based upon the idea that these companies are reproducing their content. It simply is not how the technology operates.

Actually, it that were true, we would not have hallucination problems with these models. When prompted, it would spit out a identical replica of what was input.

Instead, the output from these models is a novel creation. The information generated is the same as this article, something completely new.

What this means is the models being trained on the information is akin to being inspired.

For example, the works of Stephen King are covered under copyright laws. I cannot take one of his books, photocopy it, place a new cover on it, and sell it as my own. That is obviously illegal.

I can, however, read all of Stephen King's books in detail. If desired, I could study his writing style. Character formation, the establishment of plots, and building of suspense could be aspects that I focus upon.

Perhaps I am successful to the point where I can write Stephen King horror almost as well as he can. In that instance, I am the closest thing to him there is.

There is one problem: I am not Stephen King. Even though I trained myself on his material, nothing I put out will be from Stephen King. He is my inspiration and that is it. Even if my writing style resembles him, to the point where many have difficulty distinguishing, nothing I do will be his.

In my unprofessional legal view, this would seem to be the roadblock many of these cases are going to encounter.

Even if the AI is trained on articles from the New York Times, it does not spit out an identical copy. There might be a similar writing style to some of the authors but that is it. We cannot say that output from ChatGPT is a New York Times production.

Of course, there is also the issue where it was trained on a lot of other material. How do you separate the New York Times material from the Washington Post if the model was trained on both?

Information Yearns To Be Free

This is a concept that goes back to the early days of the Internet.

Many of the early cypherpunks (and those who follow that mindset) believe information should be free. The Internet is the world's largest copy machine. governments have done their best to apply (rewrite) copyright laws to the digital realm.

The results are mixed at best.

It is evident the business model for information changed over the last 40 years. We can see the main way of monetizing information was through advertising. It became about clicks.

While many dispute this model, it is a way to keep information free.

The Internet certainly brought down the cost of information. While there is a lot of nonsense online, we can most of what we need for zero charge. Social media, albeit cesspools in many instances, has a lot of information that use to only be available from those who specialized in the delivery of that content (i.e. newspapers and television stations).

Generative AI is taking this to an entirely new level.

Like always, the legal system is slow to catch up. With the pace things are moving, governments have no chance of keeping pace. These lawsuits will take years to fully resolve.

By then, the world will be a completely different place.

For now, OpenAi won the first round. There are still dozens of rounds in this fight so it is far from over.

What Is Hive

Posted Using InLeo Alpha

hive-167922 openai models training lawsuits copyright generativeai legal neoxian mancave

0.000

8 comments

@day1001 50

6 days ago

Things are getting more exciting in the US especially in the technology industries. Excitedly waiting here for the unfolding events. !BBH

0.000

@bbhbot 51

6 days ago

@taskmaster4450! @day1001 likes your content! so I just sent 1 BBH to your account on behalf of @day1001. _(4/5)

(html comment removed: )

0.000

@zulfrontado 67

6 days ago

The use of AI causes a lot of controversy because of its misuse or ignorance. I personally think that AIs are tools, just like the Internet are quite useful to perform tasks. Of course, you have to use them wisely and honestly.

0.000

@fredaig 66

6 days ago

Things are getting strange everyday sometimes i just wonder why really was the world made with the level of news am seeing each and everyday

0.000

@hivebuzz 74

6 days ago

Congratulations @taskmaster4450! You have completed the following achievement on the Hive blockchain And have been rewarded with New badge(s)

	You received more than 2340000 upvotes. Your next target is to reach 2350000 upvotes.

_{You can view your badges on your board and compare yourself to others in the Ranking}
_{If you no longer want to receive notifications, reply to this comment with the word STOP}

0.000

@sinistry 62

5 days ago

I agree with your Stephen King analogy. Back in the 90s a recording artist named Beck hit the scene with a very unique sound and style, with the song Loser. Shortly there after a band named The butthole surfers (smh) released Pepper, which sounded identical to Beck’s sound, but wasn’t a copyright violation. In poor taste, maybe, but not illegal. I think AI is a similar situation. It can be inspired at a faster pace than humans, but it’s still fundamentally the same process.

0.000

@taskmaster4450le 81

5 days ago

Exactly.

We could do the same thing with painting. Many study the works of a particular artist in great detail and end up mimicking that. That does not mean the artist is suddenly the next Monet.

0.000

@sinistry 62

4 days ago

My biggest gripe about AI is people who whine about AI taking their jobs. It's not a new issue, as you've pointed out time and time, again, but everyone acts like is it when it impacts their industry. I find it to be in poor taste to whine about the tech that's making (not you personally) your job obsolete while using tech that put someone else out of work, and no one in modern society can honestly claim they aren't guilty of it.

The problem isn't that tech is taking jobs. The problem is that the distribution of the resources generated by technological advancement isn't distributed in a manner that upholds the promises of a better life for humanity, but is horded by a few powerful entities while everyone else gets left behind or swept away.

0.000