Question 1

What is a robots.txt file?

Accepted Answer

A robots.txt file is a plain text file placed at the root of your website (example.com/robots.txt) that tells search engine crawlers which URLs they may or may not crawl. It is used mainly to manage crawl traffic and keep low-value pages from wasting crawl budget — it is not a security mechanism.

Question 2

Where do I upload my robots.txt file?

Accepted Answer

Upload it to the root directory of your domain so it is reachable at https://yourdomain.com/robots.txt. It must be named exactly robots.txt in lowercase. Subdirectory locations like /blog/robots.txt are ignored by crawlers.

Question 3

Does robots.txt block a page from appearing in Google?

Accepted Answer

Not reliably. A page disallowed in robots.txt can still be indexed if other sites link to it. To keep a page out of search results, use a noindex meta tag on a crawlable page, or password-protect it. Robots.txt controls crawling, not indexing.

Question 4

How do I block AI crawlers like GPTBot in robots.txt?

Accepted Answer

Add a user-agent group for each AI crawler with a Disallow: / rule. Common AI training crawlers include GPTBot (OpenAI), ClaudeBot (Anthropic), Google-Extended (Google AI training), CCBot (Common Crawl), and PerplexityBot. Our generator adds all of these with one checkbox.

Question 5

Should I include my sitemap in robots.txt?

Accepted Answer

Yes. Adding a Sitemap: line with the full URL of your XML sitemap helps all search engines discover your pages faster, including engines you haven't manually submitted to. You can list multiple sitemap lines if you have more than one.

Question 6

Can a wrong robots.txt hurt my SEO?

Accepted Answer

Yes — a single mistaken line like Disallow: / under User-agent: * blocks your entire site from being crawled, which can remove it from search results over time. Always double-check your rules and test the file in Google Search Console after uploading.

Free Robots.txt Generator

How to use your robots.txt file

Which AI crawlers can you block?

Frequently asked questions

Related free tools

Meta Tag Generator

Password Generator

User-agent	Operator	Used for
GPTBot	OpenAI	Training data collection
ClaudeBot	Anthropic	Training data collection
Google-Extended	Google	Gemini AI training (does not affect Google Search ranking)
CCBot	Common Crawl	Open web archive used in many AI datasets
PerplexityBot	Perplexity	AI answer engine indexing