# llms.txt for https://emanus.com/ # Purpose: Guide AI crawlers, LLMs, and AI search engines on access, usage, attribution, and pacing. # Date: 2025-10-08 # Owner: Emanus (contact: webmaster@emanus.com) ############################################################################### # 0) SCOPE & DEFINITIONS ############################################################################### # Applies to automated systems that crawl, cache, embed, summarize, index, or train models # on emanus.com content, including but not limited to AI search, answer engines, LLMs, and # agents. This file complements robots.txt and legal terms. Where conflicts exist, the most # restrictive policy between page-level directives (robots/meta) and this file should apply. ############################################################################### # 1) DISCOVERY & ACCESS (ALLOW) ############################################################################### # We ALLOW compliant AI crawlers to discover and index content for AI Search/Overviews, # retrieval-augmented answers, and snippet generation. Allow: / Respect: robots.txt, canonical tags, meta robots, x-robots-tag Render: Allowed (CSS/JS rendering permitted) Rate-Limit-Hint: 1 rps peak per IP; 0.2–0.5 rps sustained (200–600 req/hr) Retry-After-Hint: 60s on 429/503 ############################################################################### # 2) USER-AGENTS (EXPLICITLY ALLOWED) ############################################################################### # Search + AI User-agent: GPTBot User-agent: ChatGPT-User User-agent: ChatGPT User-agent: Google-Extended User-agent: Gemini User-agent: Bingbot User-agent: DuckDuckBot User-agent: Slurp User-agent: Applebot User-agent: PerplexityBot User-agent: PerplexityAI User-agent: ClaudeBot User-agent: Claude-Web User-agent: anthropic-ai User-agent: CCBot User-agent: Omgilibot # NOTE: Non-compliant, identity-masking, or abusive bots remain subject to blocking at any time. ############################################################################### # 3) ATTRIBUTION & CITATION REQUIREMENTS ############################################################################### # When showing emanus.com content in AI answers/UI: # - Cite the exact source URL (canonical), page title, and brand ("Emanus"). # - Provide a clickable link near the snippet (same viewport on mobile/desktop). # - Include "Published" or "Updated" dates when present. # - Disclose that content was sourced from emanus.com. Attribution-Format: - Title: " — Emanus" - URL: canonical of the source page - Label: "Source: Emanus" ############################################################################### # 4) SNIPPET & QUOTATION POLICY ############################################################################### # To preserve user value and avoid substitution: Max-Direct-Quote-Length: 90 words per page per answer Max-Total-Quoted-Words: 200 words per answer Paraphrase-OK: Yes, with citation and meaning preserved Images/Diagrams: Thumbnails only; link to source page for full view Tables: Summaries only; link to source for full table ############################################################################### # 5) TRAINING / FINE-TUNING / EMBEDDINGS ############################################################################### # We allow non-exclusive use for search indexing, RAG, and short-lived caching. # For model training/fine-tuning or long-term corpus building, follow these conditions: Training-Allowed: Yes (non-exclusive, revocable) Conditions: - Remove personally identifiable information (PII) and sensitive data. - Respect per-page "noai" or "noimageai" directives if present. - Honor takedown within 10 business days of notice. - Do not reproduce full articles verbatim in model outputs. Embedding-Allowed: Yes, for retrieval with citation and link-back. ############################################################################### # 6) CACHING & STORAGE ############################################################################### Cache-Window-Hint: 30 days (revalidate ETag/Last-Modified after) No-Full-Archive: Do not store or expose full-text archives for redistribution. ############################################################################### # 7) PRIORITY CONTENT MAP (INDEX FIRST) ############################################################################### # Primary pages (update as structure evolves) Priority: - https://emanus.com/ - https://emanus.com/about-us/ - https://emanus.com/patent/ - https://emanus.com/intellectual-property/ - https://emanus.com/commercialization-service-in-connecticut/ - https://emanus.com/contact-us/ - https://emanus.com/blog/ # High-value blog posts (supplied by site owner) Priority: - https://emanus.com/blog/how-to-register-a-trademark-guide/ - https://emanus.com/blog/intellectual-property-leakage-prevention/ - https://emanus.com/blog/google-patent-search-vs-uspto-guide/ - https://emanus.com/blog/guide-to-trademark-symbols/ - https://emanus.com/blog/ip-licensing-profit-playbook/ - https://emanus.com/blog/commercialization-strategies-for-new-product-launches/ - https://emanus.com/blog/tm-vs-r-symbol-differences/ ############################################################################### # 8) SITEMAPS ############################################################################### Sitemap: https://emanus.com/sitemap_index.xml Sitemap: https://emanus.com/sitemap.xml ############################################################################### # 9) PROHIBITED USES ############################################################################### Disallow-Uses: - Shadow UIs that replicate full emanus.com content without click-through. - Output that suggests Emanus endorses third-party products/services without consent. - Training on content behind authentication, paywalls, or explicit "noindex/noai" flags. ############################################################################### # 10) COMPLIANCE, CONTACT & LEGAL ############################################################################### Legal: - This file is a good-faith guidance document and does not replace our Terms of Use or any explicit licenses. Continued access implies acceptance of these conditions. Contact: - AI/Data usage requests or removal: webmaster@emanus.com Update-Cadence: Review quarterly or upon material site changes.