• @daq@lemmy.sdf.org
    link
    fedilink
    English
    14 days ago

    Huh? I can reach my site via curl that has neither. How did you come up with this random set of requirements?

    • @grysbok@lemmy.sdf.org
      link
      fedilink
      English
      04 days ago

      Odd. I just tried

      curl https://www.scrapingcourse.com/cloudflare-challenge

      and got

      Enable JavaScript and cookies to continue

      I’m clearly not on the same setup as you are, but my off-the-cuff guess is that your curl command was issued from a system that cloudflare already recognized (IP whitelist, cookies, I dunno).

      Anyways, I’m reading through this blog post on using cURL with cloudflare-protected sites and I’m finding it interesting.

      • @daq@lemmy.sdf.org
        link
        fedilink
        English
        13 days ago

        Of course their challenge requires those things. How else could they implement it? Most users will never be presented with a challenge though and it is trivial to disable if you don’t want to ever challenge anyone. I was just saying CF blocks ML crawlers.