GitHunt
AR

Ariyan-Pro/RAG-Latency-Optimization

CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.

No README found.

Languages

Python99.6%Dockerfile0.4%

Contributors

Created January 23, 2026
Updated January 24, 2026
Ariyan-Pro/RAG-Latency-Optimization | GitHunt