• DaGeek247@fedia.io
    link
    fedilink
    arrow-up
    2
    ·
    12 hours ago

    Just think of your point that they are using residential IP addresses. How do they get these addresses?

    You can ping all of the ipv4 addresses in under an hour. If all you’re looking for is publicly available words written by people, you only have to poke port 80 and then suddenly you have practically every possible small self-hosted website out there.

    • dudeami0@lemmy.dudeami.win
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      10 hours ago

      When I say residential IP addresses, I mostly mean proxies using residential IPs, which allow scrappers to mask themselves as organic traffic.

      Edit: Your point stands on there are a lot of services without these protections in place, but a lot of services are protective against scrapping.