OpenAI used data from Google’s YouTube to train its AI models: Report
Sam Altman-run OpenAI, which is now backed by Microsoft, reportedly trained its artificial intelligence (AI) models on Google-owned YouTube by scrapping its data.
According to a report in The Information, OpenAI “has secretly used data from the site (YouTube) to train some of its artificial intelligence models”.
YouTube is the single biggest and richest source of imagery, audio and text transcripts on the web.
While Google researchers have been using YouTube to develop its next large-language model called Gemini, “the value of YouTube hasn’t been lost on OpenAI, either”.