The New York Times prohibits using its content to train AI models

By Reggie Thompson August 16, 2023 3 mins read 254 Views

The New York Times has taken preemptive measures to stop its content from being used to train artificial intelligence models. As reported by Adweek, the NYT updated its Terms of Service on August 3rd to prohibit its content — inclusive of text, photographs, images, audio/video clips, “look and feel,” metadata, or compilations — from being used in the development of “any software program, including, but not limited to, training a machine learning or artificial intelligence (AI) system.”

The updated terms now also specify that automated tools like website crawlers designed to use, access, or collect such content cannot be used without written permission from the publication. The NYT says that refusing to comply with these new restrictions could result in unspecified fines or penalties. Despite introducing the new rules to its policy, the publication doesn’t appear to have made any changes to its robots.txt — the file that informs search engine crawlers which URLs can be accessed.

The move could be in response to a recent update to Google’s privacy policy that discloses the search giant may collect public data from the web to train its various AI services, such as Bard or Cloud AI. Many large language models powering popular AI services like OpenAI’s ChatGPT are trained on vast datasets that could contain copyrighted or otherwise protected materials scraped from the web without the original creator’s permission.

That said, the NYT also signed a $100 million deal with Google back in February that allows the search giant to feature Times content across some of its platforms over the next three years. The publication said that both companies will work together on tools for content distribution, subscriptions, marketing, ads, and “experimentation,” so it’s possible that the changes to the NYT terms of service are directed at other companies like OpenAI or Microsoft. Semafor reported on Sunday that the Times had dropped out of a media coalition attempting to jointly negotiate with tech companies over AI training data — which means if it does strike deals with companies, it could be more likely on a case-by-case basis.

OpenAI recently announced that website operators can now block its GPTBot web crawler from scraping their websites. Microsoft also added some new restrictions to its own T&Cs that ban people from using its AI products to “create, train, or improve (directly or indirectly) any other AI service,” alongside banning users from scraping or otherwise extracting data from its AI tools.

Earlier this month, several news organizations including The Associated Press and the European Publishers’ Council signed an open letter calling for global lawmakers to usher in rules that would require transparency into training datasets and consent of rights holders before using data for training.

Update 10:15AM ET: Added report on the Times dropping out of a media coalition for negotiating over AI data use.

------------
Read More
By: Jess Weatherbed
Title: The New York Times prohibits using its content to train AI models
Sourced From: www.theverge.com/2023/8/14/23831109/the-new-york-times-ai-web-scraping-rules-terms-of-service
Published Date: Mon, 14 Aug 2023 11:26:27 +0000

Did you miss our previous article...
https://trendinginbusiness.business/technology/watchos-10-beta-6-now-available-to-developers-with-namedrop-references

New York Times AI models

The Download: America’s gun crisis, and how AI video models work

July 29, 2026 1 Views

A Cobbled-Together 1940s Cottage Inspired This Breezy Coastal Home in the U.K.

July 29, 2026 0 Views

Beefeater to close all 106 restaurants on 10 September

July 29, 2026 0 Views

iOS 26 is coming today: These are the top 7 features you need to try first

July 29, 2026 2 Views

In Mexico, Three Generations of Family Set Down Roots With a Wooded Compound

July 29, 2026 2 Views

Bezos names Amazon chip arm as company’s next pillar

July 29, 2026 2 Views

The Download: America’s gun crisis, and how AI video models work

A Cobbled-Together 1940s Cottage Inspired This Breezy Coastal Home in the U.K.

Beefeater to close all 106 restaurants on 10 September

iOS 26 is coming today: These are the top 7 features you need to try first

In Mexico, Three Generations of Family Set Down Roots With a Wooded Compound

The New York Times prohibits using its content to train AI models

Latest Posts

The Download: America’s gun crisis, and how AI video models work

A Cobbled-Together 1940s Cottage Inspired This Breezy Coastal Home in the U.K.

Beefeater to close all 106 restaurants on 10 September

iOS 26 is coming today: These are the top 7 features you need to try first

In Mexico, Three Generations of Family Set Down Roots With a Wooded Compound

Bezos names Amazon chip arm as company’s next pillar

Categories

Trending Posts

Late payments will limit £2bn lending expansion, credit firm says

Open letter to FHFA, HUD, CFPB and Congress

Latest Posts

Top 10 Business Intelligence Trends

Social License to Operate

Top Business Trends That Will Shape The World

Most Shared

4 Multifamily Real Estate Trends For 2022, With Scott Hawksworth

4 Ways to Promote Art Businesses Using Social Media Technology

8 Steps To An Effective Social Media Strategy

Popular Tags

Newsletter

The New York Times prohibits using its content to train AI models

Share This

Latest Posts

The Download: America’s gun crisis, and how AI video models work

A Cobbled-Together 1940s Cottage Inspired This Breezy Coastal Home in the U.K.

Beefeater to close all 106 restaurants on 10 September

iOS 26 is coming today: These are the top 7 features you need to try first

In Mexico, Three Generations of Family Set Down Roots With a Wooded Compound

Bezos names Amazon chip arm as company’s next pillar

Categories

Trending Posts