AI crawlers are the bots that fetch web pages on behalf of AI systems, either to build training corpora or to retrieve live content at answer time. The major named agents include GPTBot and OAI-SearchBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot (Perplexity), Google-Extended (Google's AI products), and Applebot-Extended (Apple Intelligence).
A site controls their access through robots.txt user-agent rules and can offer them a curated reading list via llms.txt. For a brand that wants to be represented accurately in AI answers, the decision is usually to explicitly allow the reputable crawlers and give them clean, structured content to read — the opposite of the reflex to block everything.
