The New York Times blocks OpenAI’s web crawler

L4sBot · 2 years ago

Bappity · 2 years ago

as if a text file is going to stop them

@SuckMyWang@lemmy.world · 2 years ago

deleted by creator

SerotoninSwells · 2 years ago

NYT also uses a third party bot identification and mitigation service.

@Treczoks@lemmy.world · 2 years ago

The question is: Does that crawler adhere to robot.txt policies?

@poke@sh.itjust.works · 2 years ago

They made a flag specifically for their crawler, so they can say that they do but in the most annoying way possible.

AutoTL;DR · edit-2 2 years ago

removed by mod

@kucuva@lemmy.ml · 2 years ago

what is the ai being trained for anyways, how to be a NYT journalist?