RAG with LlamaParse from LlamaIndex and LangChain
Uses Qdrant and Mixtral via Groq API🔥
👨🏾💻 GitHub ⭐️| 🐦 Twitter | 📹 YouTube | 👔LinkedIn | ☕️Ko-fi

LlamaIndex is a great data framework and recently they introduced LlamaParse where we can parse PDF in a simplest way possible. Wouldn’t it be cool, if we can create a clean and better quality document out of PDF to feed into the LLM ? If your answer is yes, you landed in the right post 😎
Tools used: LlamaParse, LangChain, Fastembed, Qdrant, Groq, Chainlit
It would be good for you to understand things if you first go through my previous blog post.
- Super Easy way to Parse PDFs ✅🔥
- Groq: Insanely Fast Inference 🚀 | World’s First Language Processing Unit (LPU)
If you are a follower you know I don’t try to replicate things in different places, if you are new, now you know. I have a Github repo which has code of what I am going to list here.
What is the flow in the code ? Here it is.
- In
ingest.pyfile, I have explained how we can use LlamaParse to parse documents ( get clean documents ) and how to make that document available for LangChain. After that, I have explained how to create embeddings which are stored in the qdrant cloud. - In
app.pyfile, I have explained how to create a simple Chainlit app to have conversation with your document. Use models from Groq for super fast Inference.
Now, you know how to create a parser, do some extra steps to use that with Langchain and use that parsed document to create embeddings and store it in Qdrant. And finally use models from Groq for fast Inference with Chainlit.





