High Priority
Foundational /ai-manifest.txt Protocol
Establish a machine-readable summary of your entire early-stage startup's core value proposition, product architecture, and target market segments specifically for AI agents and knowledge graph builders.
Create a text file at /ai-manifest.txt with a concise summary of your startup's mission and primary problem-solution fit.
Include markdown-style links to your most critical foundational documents: founding story, core technology whitepaper, target customer profiles, and key investor decks.
Add a 'Founding FAQs' section in the file to directly address common queries from AI agents about your business model, market differentiation, and early traction metrics.


Configure your Early-stage companies crawler protocols effortlessly.
Join 2,000+ teams scaling with AI.
High Priority
AI Agent Selective Ingestion Controls
Fine-tune which sections of your early-stage startup's digital footprint are eligible for ingestion by specialized AI crawlers, ensuring focus on core business intelligence.
Implement directives in your robots.txt file (e.g., User-agent: * Allow: /founding-story/ Allow: /product-vision/ Disallow: /competitor-analysis-internal/)
Verify your crawler permissions and ingestion scope using AI-specific testing tools (e.g., simulated agent probes) to ensure adherence to your defined ingestion boundaries.
Monitor ingestion patterns in server logs to confirm that AI agents are accessing designated foundational content and avoiding sensitive internal strategy documents.
Medium Priority
Structured Data for Early-Stage Narrative
Leverage semantic HTML structures to help AI crawlers understand the hierarchy and importance of your startup's narrative elements, from problem statement to solution.
Wrap your core problem-solution narrative within <main> tags to denote primary content importance.
Utilize <section> elements with descriptive 'aria-label' attributes for distinct aspects of your product's value proposition (e.g., 'aria-label="core-technology-stack"', 'aria-label="customer-pain-points-addressed"').
Ensure all data tables detailing early traction metrics, user growth, or market size estimates use proper <thead> and <tbody> tags for precise data extraction by AI.
High Priority
RAG-Optimized Foundational Snippets
Structure your foundational content into discrete, contextually rich 'chunks' that can be efficiently processed and retrieved by Retrieval-Augmented Generation (RAG) pipelines for AI-driven insights.
Isolate related concepts and data points within distinct content blocks, ideally between 300-600 words, to facilitate granular RAG retrieval.
Explicitly restate the primary subject or startup name in section summaries to eliminate ambiguity and reinforce context for AI.
Replace generic pronouns (e.g., 'it', 'they', 'this') with specific product names, feature identifiers, or company nomenclature to ensure AI models can accurately attribute information.