NYT looks like it’s updated it’s robots.txt file to disallow the Open AI bot from scraping it’s data. Pretty interested to see if they just update their user agent string or if they’ll respect it

  • plz1
    link
    fedilink
    English
    362 years ago

    Updating user agent doesn’t natter unless NYT is actively blocking that, too. Updating robots.txt is purely a “gentleman’s agreement” that OpenAI will respect it. OpenAI would be dumb to ignore it, hat all said, because it’d trigger the lawyer shenanigans to ensue.

      • @WarmSoda@lemm.ee
        link
        fedilink
        English
        42 years ago

        I wonder how much of a boost sites get from Reddit and lemmy, etc. Even with posts that have the text copy/pasted I imagine it has to give them traffic.