AI Content Signals
AI Content Signals allows you to easily implement the Content Signals Policy in your WordPress site’s robots.txt file. This gives you more control over how AI crawlers and large language models (LLMs) can use your content.
What are Content Signals?
Content Signals is an extension to the robots.txt standard created by Cloudflare that lets you specify three types of permissions for AI crawlers:
- search – Allow or deny search indexing and traditional search results
- ai-input – Allow or deny using your content for real-time AI responses (RAG, AI Overviews)
- ai-train – Allow or deny using your content for training AI models
Key Features
- Easy-to-use settings page in WordPress admin
- Set global defaults for all crawlers
- Configure specific settings for individual AI bots (GPTBot, ClaudeBot, PerplexityBot, etc.)
- Add custom bot User-Agents
- Supports both physical and virtual robots.txt files
- Option to create physical robots.txt with basic WordPress rules
- Preview generated Content Signals before applying
- Optional legal text with EU Directive reference
- Works with existing robots.txt from SEO plugins
- Automatic sitemap detection and inclusion
Supported Bots
The plugin includes predefined settings for major AI crawlers:
- OpenAI GPTBot and ChatGPT-User
- Anthropic ClaudeBot and Claude-Web
- Perplexity Bot
- Google Extended (Bard/Gemini)
- Common Crawl Bot
- Meta/Facebook Bot
- And many more…
Important Notice
Content Signals is a declarative standard – it expresses your preferences but does not technically enforce them. AI companies are not legally required to respect these signals, though the plugin includes legal text referencing EU copyright directives.
This plugin works best when combined with other protection measures like traditional robots.txt rules and server-level bot management.
