• rtxn@lemmy.world
    link
    fedilink
    English
    arrow-up
    91
    arrow-down
    1
    ·
    edit-2
    9 months ago

    I would be shocked if any big corpo actually gave a shit about it, AI or no AI.

    if exists("/robots.txt"):
        no it fucking doesn't
    
    • bionicjoey@lemmy.ca
      link
      fedilink
      English
      arrow-up
      48
      ·
      9 months ago

      Robots.txt is in theory meant to be there so that web crawlers don’t waste their time traversing a website in an inefficient way. It’s there to help, not hinder them. There is a social contract being broken here and in the long term it will have a negative impact on the web.

    • DingoBilly@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      9 months ago

      Yeah I always found it surprising that everyone just agreed to follow a text file on a website on how to act. It’s one of the worst thought out/significant issues with browsing still out there from the beginning pretty much.