LLM

EpsteinRAG – RAG Search Engine

Full-stack RAG search engine enabling natural language querying across 44,886+ declassified government documents with AI-generated answers and source citations.

FastAPINext.jsPostgreSQLMeilisearchGeminiPython

GitHub

AI Search Results — RAG analysis with source citations

System Architecture

Overview

A full-stack RAG search engine for natural language querying across 44,886+ declassified documents.

Problem Statement

Declassified documents are difficult to search using traditional keyword methods.

Solution & Approach

Hybrid search: keyword (Meilisearch) + semantic vector search (Gemini Embedding 001, 3072-dim) with pgvector.

Key Features

Hybrid search: keyword + semantic vector (3072-dim embeddings)
~16ms cached responses via SHA-256 query caching
Real-time streaming via Server-Sent Events (SSE)
Reduced latency from 8-10s to 2-3s for first token
Citations linked to original DOJ documents

Results & Impact

~16ms cached responses with streaming. All answers include numbered citations.

Lessons Learned

Hybrid search combining keyword and semantic provides better results than either alone.