Apple confirms that Apple Intelligence has not been trained with YouTube videos
Only public and licensed content

- July 18, 2024
- Updated: July 1, 2025 at 11:10 PM

With the presentation of Apple Intelligence during the last WWDC 2024, Apple claimed that Apple Intelligence models are trained with licensed data, including data selected to improve specific features, as well as publicly available data collected by your web crawler. This claim was questioned earlier this week, and now Apple is providing a defense.
OpenELM is not part of Apple Intelligence, it’s just an open-source product for research
A few days ago, an investigation revealed that certain companies allegedly used YouTube subtitles to train their artificial-intelligence models. Not directly, but through a dataset compiled by a non-profit organization called EleutherAI. With or without knowledge of the fact, the report concluded that, therefore, Anthropic, Nvidia, SalesForce, Apple, and others had used the content of over 170,000 videos by popular creators such as MKBHD and Mr. Beast, generating considerable controversy regarding the ethics and legality of their methods. However, Apple has made an important clarification regarding the use of this data.
Apple confirmed to 9to5Mac that its OpenELM model, although trained with this data, is not used to power any of the features of its artificial-intelligence suite, known as Apple Intelligence. According to the company, OpenELM was created solely for research purposes, to contribute to the scientific community and the development of open-source language models.
The OpenELM model was published as open-source and is still widely available, including on Apple’s Machine Learning research website. This allows researchers from around the world to access and use this model in their own research projects.
Alongside this statement, Apple confirmed that they have no plans to develop new versions of the OpenELM model. As far as we know, this model has already fulfilled its purpose and will become less relevant as the rest of Apple Intelligence products evolve without it.
The whole event is undoubtedly a reminder of the complexity and ethical challenges that companies face in the era of big data and AI. The reliability and performance of an artificial intelligence depend to a large extent on the dataset used for its training. A careful and measured selection, as we see Apple doing, is definitely the way to go to achieve product efficiency and quality.
Architect | Founder of hanaringo.com | Apple Technologies Trainer | Writer at Softonic and iDoo_tech, formerly at Applesfera
Latest from David Bernal Raspall
- Generative AI vs Generative Design: What Each Technology Can (and Can’t) Do in Architecture
- The Software Behind the Success of Zootopia 2: How Autodesk Maya and Flow Production Tracking Brought It to Life
- The Web You Remember Is Waiting Here: Share Your Memory and Win a Trip to Switzerland
- Wonder 3D in Autodesk Flow Studio levels up the game: check out the AI that models anything in seconds
You may also like
NewsWhat comes after 'Super Mario Galaxy'? Five possible projects about the future of Nintendo in film
Read more
- News
Everyone thinks that 'Euphoria' is an original series, but in reality, it is just a remake
Read more
NewsThe new Fable has specialists making the cutscenes: Blizzard Entertainment
Read more
NewsThe mayor of New York, Zohran Mamdani, has explained US policy in a way that everyone can understand: with Mario Kart
Read more
NewsEven a dachshund can play LOL, as long as it's with a magical cat
Read more
NewsThe creators of Stranger Things are leaving Netflix to head to its most direct competitor
Read more