Production RAG Pipeline

Interactive visualization of Retrieval-Augmented Generation architecture. Watch how data flows through each component in real-time.

Speed:
Step:
Query Input
📝
Query Input
⚙️
Query Processing
HyDE + Expansion
💾
Semantic Cache
Query Lookup
Write-Back
Cache Hit
Cache Miss

Hybrid Retrieval

🔤
Sparse Search
BM25
🔽
Filter
Metadata
🧠
Dense Search
Vector DB
🔽
Filter
Metadata
🔀
Fusion
RRF
📊
Reranker
Cross-Encoder
📋
Context Assembly
Prompt Building
🤖
Generation
LLM
Response