Question 1

What does this tool check exactly?

Accepted Answer

For each of the 11 AI crawlers it performs two checks: whether the site's robots.txt file allows or blocks that specific bot, and whether a live request with that bot's User-Agent actually gets through or is blocked by a firewall, WAF, or CAPTCHA challenge.

Question 2

Why might a bot be blocked by firewall but allowed in robots.txt?

Accepted Answer

Many websites use CDN-level firewalls (like Cloudflare, Akamai, or Sucuri) that block requests based on User-Agent strings, IP reputation, or behavioral analysis. These blocks happen at the server level before robots.txt is even relevant. A site owner may not realize their WAF is blocking AI crawlers.

Question 3

Why should I allow AI crawlers on my website?

Accepted Answer

If AI models like ChatGPT, Claude, or Perplexity cannot access your content, they cannot cite or reference your website in their answers. As more users shift from traditional search to AI-powered search, blocking AI crawlers means losing visibility in a growing channel.

Question 4

Which AI crawlers does the tool test?

Accepted Answer

The tool tests 11 AI crawlers: GPTBot and ChatGPT-User (OpenAI), ClaudeBot and Claude-Web (Anthropic), PerplexityBot (Perplexity), Google-Extended (Google), Applebot-Extended (Apple), CCBot (Common Crawl), Bytespider (ByteDance), cohere-ai (Cohere), and Amazonbot (Amazon).

Question 5

What is the difference between Robots only and Firewall verdicts?

Accepted Answer

Robots only means the site's robots.txt blocks the bot, but the server still serves the page if the bot ignores robots.txt (robots.txt is a directive, not enforcement). Firewall means robots.txt allows the bot, but the server actively blocks it via WAF or status codes. Both means blocked at both levels.

Question 6

Is robots.txt actually enforced by AI crawlers?

Accepted Answer

Major AI companies like OpenAI, Anthropic, and Google have committed to respecting robots.txt directives. However, robots.txt is a voluntary protocol. The live fetch check in this tool reveals whether the server also enforces access at the firewall level, which provides actual technical blocking.

Can AI bots actually reach your site?

What the verdicts mean

If AI can't crawl you, AI can't cite you

Frequently Asked Questions

Let's talk