PufferSights – AI Crawler Insights Wordpress Plugin - Rating, Reviews, Demo & Download
Plugin Description
PufferSights monitors 100+ known AI crawler and AI agent user agents, hashes IP addresses, groups traffic by bot, provider, crawl purpose, content type, and response status, tracks human referrals from AI surfaces, and can publish a dynamic llms.txt content map for public site content.
The dashboard summarizes:
- HTTP traffic by bot.
- Crawl purpose.
- Content type.
- Response status.
- AI referrals and crawl-to-refer ratio.
- Top crawled content.
- Tracked agent count.
- Dynamic llms.txt content map.
- robots.txt audit and policy snippets.
The crawler registry is based on current public operator documentation and industry references for OpenAI, Anthropic, Perplexity, Google, Apple, Common Crawl, Meta, ByteDance, Microsoft, Amazon, and related AI crawler operators.
The plugin does not contact any external service. All analytics data is stored in your own WordPress database.
robots.txt publishing is off by default. The plugin can generate and optionally publish policies for:
- Monitor only.
- Block training crawlers.
- Allow AI search/user-action bots while blocking training crawlers.
- Block all known AI bots.
llms.txt publishing is on by default and can be disabled in the PufferSights settings. The generated /llms.txt file lists selected published public pages and posts in Markdown so AI assistants can find the site’s main public content more easily.
Important Notes
User-agent detection is not bot verification. User agents can be spoofed. Raw IP addresses are not stored; the plugin stores a salted hash for rough uniqueness.
robots.txt is voluntary. Use a WAF, CDN, or server-level controls when technical enforcement is required.
Google-Extended and Applebot-Extended are robots.txt control tokens rather than normal request user agents, so they appear in robots.txt audits and policy snippets but usually do not appear in request logs.
llms.txt is a content map, not an access-control policy. It does not replace robots.txt and does not force AI systems to use or cite your content.
Privacy
PufferSights stores local analytics for public, logged-out requests only. It does not track wp-admin pages, logged-in users, AJAX requests, or WP-Cron requests.
The plugin stores:
- Request time and date.
- Event type, such as AI crawler request or AI referral.
- HTTP method.
- Request path without query string.
- HTTP response status.
- MIME/content group.
- Matched crawler or AI referral provider.
- User-agent string and user-agent hash.
- Salted one-way hash of the request IP address.
- Referrer origin only, such as
https://chatgpt.com, without referrer path or query string.
The plugin does not store raw IP addresses, cookies, browser local storage, or complete referrer URLs. It does not send analytics, telemetry, crawler records, or site data to third-party services.
If llms.txt publishing is enabled, the plugin serves a Markdown overview of selected published public posts and pages at /llms.txt. Drafts, private posts, and password-protected posts are not included.
Administrators can disable tracking, clear captured events, and configure retention from the PufferSights admin page. The default retention period is 90 days. On uninstall, the plugin removes its custom analytics table, saved options, and scheduled cleanup hook.
The plugin also adds suggested disclosure text to WordPress’ Privacy Policy Guide.
Screenshots
No screenshots provided

