Prevent AI Training - Metastem/Wikiless GitHub Wiki
Known tags and settings suggested to opt out of having your content used for AI training.
- robots.txt: A collection of tags to add to your own robots.txt. Automate generation with darkvisitors.com.
-
meta-tags.html: Tags to add to your own
<head>
. - headers.txt: HTTP headers for responses. Installation is outside the scope of this document.
- ai.txt: An alternative to robots.txt by Spawning, the company behind haveibeentrained.com.
- ip-ranges.txt: Known IP ranges for AI crawlers. Links to the canonical source included.
- tdmrep.json: Web protocol for expressing the reservation of rights relative to text & data mining (TDM).
-
OpenAI: Email your organization ID to [email protected] to opt out.
- Mobile apps (iOS & Android): Go to settings > personalization and uncheck the Memory option. Turn off "Improve the model for everyone" under Data Controls. Uncheck "Include your audio/video recordings" under VOICE MODE.
- StabilityAI: Opt out at haveibeentrained.com.
- AWS: Follow steps described in this article to stop AI data usage.
- Substack: Go to Settings > Publication details and switch it on.
- WordPress and Tumblr: Follow the provided links.
- The Stack: Find your repo(s) on Am I in The Stack? and click Opt-Out at the bottom to open a request.
- How to Block ChatGPT From Using Your Website Content
- All Deviations Are Opted Out of AI Datasets
- OpenAI Terms of Use
- Stability AI plans to let artists opt out of Stable Diffusion 3 image training
- Stop AI Data Mining in its Tracks with AI.txt
- Sites scramble to block ChatGPT web crawler after instructions emerge
- An update on web publisher controls
- Dark Visitors: A List of Known AI Agents on the Internet
- TDM Reservation Protocol (TDMRep)