Question 1

Where does robots.txt go?

Accepted Answer

At the root of your domain — https://yourdomain.com/robots.txt. Subdomains need their own file at their own root (https://blog.yourdomain.com/robots.txt). Crawlers do not look anywhere else.

Question 2

Does robots.txt prevent pages from showing in Google?

Accepted Answer

Not reliably. It prevents crawling, but if a page is linked from elsewhere, Google can still index the URL based on the link context — just without your page content. To guarantee a page is excluded from search results, use a noindex meta tag instead (or an X-Robots-Tag header).

Question 3

Should I block AI crawlers like GPTBot and PerplexityBot?

Accepted Answer

Depends on your goals. Blocking them keeps your content out of model training and out of AI search citations — which usually hurts AI Search visibility more than it helps. Most sites should let them in. Block selectively if you have legal/IP reasons or if your business model depends on people landing on your pages directly.

Question 4

What's the difference between Disallow and noindex?

Accepted Answer

Disallow tells crawlers 'don't fetch this URL'. noindex tells crawlers 'fetch this URL but don't show it in search results'. Use noindex when you want pages off Google but still want crawlers to follow links from them. Use Disallow for paths you really don't want crawled at all (private APIs, admin panels).

Question 5

How long until Google picks up changes to my robots.txt?

Accepted Answer

Google re-fetches robots.txt roughly every 24 hours. You can force a refresh by using the robots.txt Tester in Search Console (Settings → Crawling).

Question 6

Can I have wildcards in robots.txt?

Accepted Answer

Yes. * matches any sequence of characters, $ anchors to end of URL. Disallow: /*.pdf$ blocks every PDF anywhere on the site. Disallow: /search?* blocks every URL starting with /search?. Most modern crawlers support both.

Robots.txt analyzer

What does robots.txt actually do?

The directives that actually matter

Common mistakes the analyzer flags

FAQ