AI Robots.txt Generator - Block GPTBot & AI Crawlers
Block AI training bots (GPTBot, ClaudeBot, Google-Extended) from your website. Protect content from LLM training while allowing search engines.
Quick Presets
Recommended: "Block Training, Allow Browsing" balances protection with visibility in AI answers.
AI Crawlers
Additional Options
robots.txt
Implementation Guide
- Copy the generated robots.txt content above
- Create/edit the file at your website root:
https://yoursite.com/robots.txt - Merge with existing rules if you already have a robots.txt file
- Test by visiting your robots.txt URL directly in a browser
- Monitor your server logs for crawler activity (User-Agent headers)
Crawler Categories
Training Used to collect data for AI model training
Browsing Real-time web access for AI assistants
Dataset Builds public datasets used by multiple AI companies
Aggressive Known for high crawl rates or unclear policies