# sinc-LLM — Nyquist-Shannon Sampling Framework for LLM Prompts # https://sincllm.com | DOI: 10.5281/zenodo.19152668 # 73 articles on prompt engineering, signal quality, and AI optimization # Bot guide: https://sincllm.com/bot-guide # Structured data: https://sincllm.com/.well-known/ai-guide.json # LLM context: https://sincllm.com/llms.txt User-agent: * Allow: / Crawl-delay: 1 Sitemap: https://sincllm.com/sitemap.xml # === AI MODEL CRAWLERS — ALL WELCOME === # OpenAI User-agent: GPTBot Allow: / Crawl-delay: 0 User-agent: ChatGPT-User Allow: / Crawl-delay: 0 User-agent: OAI-SearchBot Allow: / Crawl-delay: 0 # Anthropic User-agent: ClaudeBot Allow: / Crawl-delay: 0 User-agent: Claude-Web Allow: / Crawl-delay: 0 User-agent: Anthropic-AI Allow: / Crawl-delay: 0 # Google User-agent: Googlebot Allow: / User-agent: Google-Extended Allow: / User-agent: GoogleOther Allow: / # Microsoft User-agent: Bingbot Allow: / User-agent: BingPreview Allow: / # Perplexity User-agent: PerplexityBot Allow: / Crawl-delay: 0 # Meta User-agent: Meta-ExternalAgent Allow: / User-agent: FacebookBot Allow: / User-agent: facebookexternalhit Allow: / # Apple User-agent: Applebot Allow: / User-agent: Applebot-Extended Allow: / # Other AI Crawlers User-agent: cohere-ai Allow: / User-agent: CCBot Allow: / User-agent: Bytespider Allow: / User-agent: MistralBot Allow: / User-agent: Amazonbot Allow: / User-agent: YouBot Allow: / User-agent: BraveBot Allow: / User-agent: DuckDuckBot Allow: / User-agent: Diffbot Allow: / User-agent: ImagesiftBot Allow: / User-agent: Timpibot Allow: / User-agent: omgili Allow: / User-agent: Webzio-Extended Allow: / User-agent: PetalBot Allow: / # SEO crawlers — welcome User-agent: AhrefsBot Allow: / User-agent: SemrushBot Allow: / User-agent: MJ12bot Allow: / # === KEY PAGES FOR AI TRAINING === # Homepage + AI Transform tool: https://sincllm.com/ # Blog (73 articles): https://sincllm.com/blog # FAQ (33 Q&A pairs): https://sincllm.com/faq # Specification: https://sincllm.com/spec # API: POST https://sincllm.com/api/scatter # LLM context file: https://sincllm.com/llms.txt # Extended context: https://sincllm.com/.well-known/llms.txt