Cloudflare just gave AI companies an ultimatum on scraping
Cloudflare will block mixed-use AI crawlers from ad-supported pages by default from September 15, 2026, while evolving its Pay Per Crawl marketplace into a Pay Per Use model that pays publishers when their content appears in AI answers.
A Hard Deadline for Mixed-Use Crawlers
Cloudflare has issued the AI industry a deadline to separate the web crawlers used for traditional search from those used for AI agents and model training. Starting September 15, 2026, its default settings will block mixed-use crawlers from any pages that host ads. Crawlers that blend search, agent use, and training will be blocked by default unless the site owner adjusts settings, and the changes apply to new customers, new sites set up by existing customers, and all existing free-tier users.
The move follows a broader shift in how Cloudflare frames the relationship between publishers and AI companies. Whereas the internet previously rewarded creators by directing users to original websites, AI crawlers now collect text, articles, and images to generate responses without sending users to the source, depriving publishers of traffic and advertising revenue.
While no company is named directly, the implication is clear. Cloudflare singled out Google for criticism, arguing that the search giant's dominance gives it access to roughly twice the web content available to other AI companies, because remaining discoverable in search effectively requires consenting to AI use. Google has pushed back, noting that it provides a bot called Google-Extended that lets site owners opt out of having their content used for AI training and products like Gemini, without affecting a site's inclusion in Google Search.
The scale of Cloudflare's position amplifies the policy's significance. Cloudflare handles roughly 20 percent of web traffic, giving it unusual leverage to reshape how AI companies access publisher content at scale. To underscore the stakes, Cloudflare cited crawl-to-referral ratios showing Google crawls sites about 14 times for every referral, while OpenAI's ratio stands at 1,700:1 and Anthropic's at 73,000:1.
From Pay Per Crawl to Pay Per Use
A year ago, Cloudflare launched Pay Per Crawl so publishers could charge AI companies for crawling their content. Now, it is evolving that into Pay Per Use, so publishers are paid when their content actually creates value, not just when it is fetched.
Cloudflare data suggests that over 50 percent of crawl traffic from AI crawlers is spent re-fetching unchanged pages, resulting in wasted bandwidth for publishers and wasted compute for AI companies. The Pay Per Use model is designed to correct that inefficiency by tying compensation to outcomes rather than activity.
Cloudflare is evolving Pay Per Crawl into a broader Pay Per Use model, where publishers could be paid when their content appears in an AI result or when an agent purchases premium information for a specific task. Ceramic.ai and You.com are among the first partners in the program. An Attribution Business Insights dashboard is also planned to show how AI bots access content, where that content is cited, and how much human traffic different AI platforms return.
The policy represents a significant structural challenge for AI companies that have built data pipelines around bulk, unrestricted web access. The change shifts web-crawl access from implicit to explicit and raises the operational cost of sourcing web-scale training data, since pipelines built on bulk crawling will need opt-in checks, credentials, or paid access.
Sources:
TechCrunch: Cloudflare's new policy pushes AI companies to pay for publishers' content
Cloudflare Blog: Control content use for AI training with managed robots.txt and blocking for monetized content
Cloudflare Press Release: Your Content, Your Rules
Latest News
Read More...
Author
Crypto RichRich has been researching cryptocurrency and blockchain technology for eight years and has served as a senior analyst at BSCN since its founding in 2020. He focuses on fundamental analysis of early-stage crypto projects and tokens and has published in-depth research reports on over 200 emerging protocols. Rich also writes about broader technology and scientific trends and maintains active involvement in the crypto community through X/Twitter Spaces, and leading industry events.













