WEB INFRASTRUCTURE
A Cloudflare elindítja a Content Signals szolgáltatást, hogy kontrollt adjon az alkotóknak az AI adatgyűjtés felett
Publishers, content creators, and platforms across the internet are calling this moment a 'web infrastructure revolt.' Cloudflare just launched a new addition to robots.txt files that lets website owners express preferences for how their content gets used after it's accessed. Cloudflare is already deploying this for 3.8M domains that use their managed robots.txt feature, automatically signaling that they don't want their content used for AI training. While not a technical block, they create a clear, standardized way for website owners to set their own rules.
- 'search' signal: Controls if content can be used to build a search index.
- 'ai-input' signal: Controls if content can be input into AI models for real-time answers.
- 'ai-train' signal: Controls if content can be used to train or fine-tune AI models.
- Combined with enforcement tools like Cloudflare's WAF and Bot Management to give creators control.
Miért fontos?
For 25 years, the deal was simple: you could scrape content, but you'd send referral traffic and give attribution. That deal might as well be dead. Now, we're fighting to figure out what comes next to ensure creators are properly attributed and compensated.