OpenAI’s AI Data Trainers to Use the Financial Times Archives

ai data trainer, openai, financial times, ai, data, archived content

OpenAI’s new deal with the Financial Times allows its AI data trainers to use archived content for the AI models.

  • OpenAI will integrate FT content into ChatGPT’s responses to user queries, offering summarized content directly within the conversational interface.
  • ChatGPT will also provide links back to FT articles for transparency and attribution.

OpenAI has signed a deal with the Financial Times (FT) so that its developers can use archived FT articles to train artificial intelligence (AI) models.

Under the terms of the agreement, FT will provide OpenAI access to its extensive archive of content. OpenAI’s AI data trainers will then use this invaluable repository for the company’s AI models, especially ChatGPT. This product is already quite remarkable in simulating human conversation and generating text very close to human writing. However, now, it will be enhanced with insights courtesy of FT.

In return, the AI developers will integrate FT content into ChatGPT’s responses to user queries. This feature enables ChatGPT’s vast user base to access summarized content from the Financial Times directly within the conversational interface. The responses will also provide users with links back to FT articles, ensuring transparency and attribution to the original source material.

John Ridding, CEO of FT Group, finds the collaboration only logical, “It’s right, of course, that AI platforms pay publishers for the use of their material. OpenAI understands the importance of transparency, attribution, and compensation — all essential for us.”

This deal is the 5th of its kind that OpenAI has penned with major publications. Over the past year, OpenAI has had similar agreements with the US-based Associated Press, Germany’s Axel Springer, France’s Le Monde, and Spain’s Prisa Media. However, financial terms were never disclosed.

In the grand scheme of things, licensing deals are the way to go. They provide a clear legal framework for using copyrighted material, eliminating ambiguity and reducing the risk of copyright infringement lawsuits. The copyright holder also maintains some control over their work, and the AI developer has assurances about their ability to use the data. In addition, these agreements ensure that the AI data trainers are using high-quality material that might not be otherwise freely available for the AI models.

However, they are not the solution, but rather a band-aid to the main problem here: the use of copyrighted material in AI training. Without appropriate laws that would protect a copyright holder’s rights, it falls on the AI companies to preserve these rights. If we expect to have accurate AI models that can cite their credible sources properly and dispel misinformation, we need copyrighted material. But we need to preserve the rights of those who spilled everything into the pieces of writing and art.


Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Intelligent Tech sections to stay informed and up-to-date with our daily articles.