OpenAI transcribed Google’s YouTube videos to train AI models: Report

OpenAI reportedly transcribed over one million hours of YouTube videos to collect training data for its advanced GPT-4 model, disregarding the Google-owned platform’s copyright rules. According to a report by The New York Times, Microsoft-backed OpenAI used an indigenous speech recognition tool called Whisper to transcribe audio from YouTube videos to yield conversational text, which was then used to train the AI model that powers ChatGPT.

According to the report, makers of ChatGPT internally discussed on how the use of YouTube data for training might be against the platform’s policy.

OpenAI transcribed Google’s YouTube videos to train AI models: Report

More in Business Standard

YouTube to block Hong Kong’s protest anthem videos after court order

Centre to launch ‘decent’ OTT platform: What’s in store for viewers?

Sony Group posts 7% fall in annual profit, narrowly misses PS5 target

Must Read Articles

Poor Show

Zomato & Swiggy

Regulating Big Tech

Interview with Preetham Uthaiah, EVP – Marketing & Strategy Saankhya Labs

DoT launches portal for centralised RoW approvals

NCAER Working Paper

Real-Time-Bidding of your Data

DoT to get 33 pc stake of Vi for Rs 16,133 cr interest dues

National Data Governance Framework

BSNL 4G

Archives

You may also like

More in Business Standard

Must Read Articles

Archives