How to block AI Crawler Bots using robots.txt file

Cynicus Rex@lemmy.ml · 1 year ago

How to block AI Crawler Bots using robots.txt file

Daemon Silverstein@thelemmy.club · edit-2 7 months ago

deleted by creator

IphtashuFitz@lemmy.world · 1 year ago

Oh there are definitely ways to circumvent many bot protections if you really want to work at it. Like a lot of web protection tools/systems, it’s largely about frustrating the attacker to the point that they give up and move on.

Having said that, I know Akamai can detect at least some instances where browsers are controlled as you suggested. My employer (which is an Akamai customer and why I know a bit about all this) uses tools from a company called Saucelabs for some automated testing. My understanding is that our QA teams can create tests that launch Chrome (or other browsers) and script their behavior to log into our website, navigate around, test different functionality, etc. I know that Akamai can recognize this traffic as potentially malicious because we have to configure the Akamai WAF to explicitly allow this traffic to our sites. I believe Akamai classifies this traffic as a “headless” Chrome impersonator bot.

How to block AI Crawler Bots using robots.txt file

How to block AI Crawler Bots using robots.txt file

Just a moment...