Meta trained AI model on pirated books despite knowing legal troubles

AI models are becoming more sophisticated due to the quality and cache of data they are trained on. However, training models on data, especially the protected one, may have its consequences. Google, Microsoft-backed OpenAI and Facebook parent Meta, at some point in the last year, have been criticised for ‘stealing’ data. Meta, for one, seems to have run into a lot of legal troubles for using copyrighted data to train Llama.

Citing a new filing in a case related to copyright infringement initially brought earlier this year, a report by news agency Reuters says that the company lawyers warned it about the legal perils of using thousands of pirated books to train its AI models, but Meta did it anyway.

Read more

You may also like

More in IT

Comments are closed.