I designed and deployed a production-ready RAG system that enables accurate, low-latency querying of private PDF documents by grounding LLM responses in verified document context.

Live Demo: Hugging Face Space
📄 Tech Stack: Python · LangChain (LCEL) · Pinecone · Groq API · Llama 3 · MiniLM Embeddings · Gradio