We use cookies to improve your experience and analyze our traffic. By clicking "Accept", you consent to our use of cookies.

Back to Tools

Free robots.txt Generator

Control how search engines and AI crawlers access your site. Generate a perfect robots.txt file instantly.

GLOBAL SETTINGS (All Crawlers)

SITEMAP URL

SPECIFIC RULES

GENERATED ROBOTS.TXT

User-agent: *
Allow: /

About this tool

The most common robots.txt mistake is treating it like a security feature. It is not one. Disallow: /admin/ doesn't hide your admin panel from anyone — it just politely asks well-behaved crawlers to skip it, and helpfully tells everyone else exactly where to look.

What robots.txt is good for: managing crawl budget on large sites, keeping search engines out of staging, and (in 2026 specifically) deciding which AI bots get to read your content. The new question every site owner has to answer is whether to allow GPTBot, Claude-Web, PerplexityBot, and Google-Extended. The trade-off is real: blocking them prevents AI training on your content but also prevents AI citation. For most small business sites, the math favors allowing them — citation drives qualified referral traffic; the alternative is invisibility in the surfaces eating the most search demand.

Our generator builds a sensible default: allow general crawlers, allow the major AI bots (with explicit listings so future-you knows what's allowed), block obvious crawl waste (admin paths, search results, faceted URLs), and point at your sitemap. You can customize before you copy. Don't paste blindly.

Frequently asked questions

What does robots.txt do?+
Tells web crawlers which paths on your site they can or can't request. It's an access directive — not a security mechanism. Sensitive content needs auth, not robots.txt.
Should I block AI crawlers?+
Depends on your goal. Allowing GPTBot, Claude-Web, and PerplexityBot increases the chance your content is cited in AI Overviews. Blocking them prevents AI training but also reduces visibility.
Will Disallow remove pages from Google?+
No. Disallow blocks crawling but the URL can still appear in search if it's linked from elsewhere. Use a noindex meta tag for true exclusion.
Where does robots.txt live?+
Always at the root: https://yoursite.com/robots.txt. Crawlers do not check subdirectory locations.

Related resources