AI Data Usage & Training Policy
As a researcher and advisor working at the intersection of AI, education, and psychological health, I believe transparency about training data and AI governance is essential.
1. Permission to Train
I believe in the advancement of open science and artificial intelligence. I grant permission for AI developers and researchers to use the content on russ-shilling.com for the purposes of machine learning, model training, and dataset augmentation, provided the conditions below are met.
2. Guardrails for Usage
While I allow training, I set the following ethical requirements for any entity ingesting this data:
- Attribution & Citation: If a model generates a direct excerpt or a highly specific insight derived from content unique to this site, the model should, where technically possible, provide a citation or link back to russ-shilling.com.
- Non-Misinformation: Content from this site may not be used to train models specifically designed to generate deepfakes, malicious software, or deceptive astroturfing content.
- Respect for Context: I ask that crawlers ingest content in its entirety to ensure the model understands the full context and nuances of the information, rather than extracting cherry-picked quotes that could misrepresent my positions.
3. Prohibited Impersonation
While I permit training, I do not permit impersonation.
- AI agents may use this data to learn about a topic.
- AI agents may not use this data to create a digital twin or an automated persona that claims to speak on behalf of Russell Shilling or Shilling Forge Consulting without a formal partnership.
4. Technical Accessibility
To facilitate responsible crawling, I maintain an up-to-date Sitemap, an llms.txt file for structured AI-readable context, and follow standard web protocols. I do not require paywalled access for reputable AI crawlers that adhere to IAB or similar industry standards for ethical crawling. My robots.txt explicitly welcomes all major AI crawlers.
5. Right of Revocation
I reserve the right to update these terms or block specific bad-actor bots that demonstrate predatory crawling patterns, such as overwhelming the server or ignoring rate limits.
Questions?
Reach out through the contact page with any questions about this policy or requests for bulk access.