
Open-source RAG API written in Rust. First token response time under 200 ms, 94.5% accuracy on over 3,000 questions. Hybrid search with reranking and streaming responses. Upload PDFs, ask questions, and get answers quickly. Self-host the entire system in minutes. Connect your knowledge to your AI voice, AI agents, and any product where retrieval speed is a bottleneck.
Code/Dev