ABCDullahh.
LLM

EpsteinRAG – RAG Search Engine

Full-stack RAG search engine enabling natural language querying across 44,886+ declassified government documents with AI-generated answers and source citations.

FastAPINext.jsPostgreSQLMeilisearchGeminiPython
GitHub
AI Search Results — RAG analysis with source citations

System Architecture

[SYS] ─── epsteinrag-hybrid.arch

Overview

A full-stack RAG search engine for natural language querying across 44,886+ declassified documents.

Problem Statement

Declassified documents are difficult to search using traditional keyword methods.

Solution & Approach

Hybrid search: keyword (Meilisearch) + semantic vector search (Gemini Embedding 001, 3072-dim) with pgvector.

Key Features

  • Hybrid search: keyword + semantic vector (3072-dim embeddings)
  • ~16ms cached responses via SHA-256 query caching
  • Real-time streaming via Server-Sent Events (SSE)
  • Reduced latency from 8-10s to 2-3s for first token
  • Citations linked to original DOJ documents

Results & Impact

~16ms cached responses with streaming. All answers include numbered citations.

Lessons Learned

Hybrid search combining keyword and semantic provides better results than either alone.

PreviousQuranVerif – AI Quran SearchNextASIH Salon – Digital Atelier

Let's build
something amazing

Jakarta, Indonesia

© 2026 Faizal Lutfi