From the very beginning, the CoSpaceGPT team knew they needed robust, highly accurate RAG pipeline capable of retrieving context across many different file types, including:
- PDFs with heavy visuals
- Diagrams and images
- PowerPoints and Word documents
- Large multi-file project workspaces
They first attempted to build this internally, but the limitations became clear as usage grew. As Sanat explained:
"We wanted to have our own RAG system because our use case is very much about accuracy of retrieval from files. We actually tried building our own one first. It worked for some time, but it quickly broke when there were a lot of files involved… we were just doing similarity search over embedded chunks."
With increasing file volumes, mixed media (PDFs, diagrams, images), and multi-file queries, their in-house approach couldn't scale or maintain accuracy. They also required:
- Stable, isolated context boundaries per chat thread
- Long-term knowledge bases for ongoing projects
- A way to handle continuous file uploads from users inside those chats
That's when they began evaluating external providers and their requirements led them to Needle.
"We then sought to find a provider which could do this for us. And yeah, we found Needle — I think it was easy to integrate and it gave us pretty good accuracy."
Sanat noted that the integration required minimal engineering effort, and this ease allowed the team to focus on product improvements rather than infrastructure complexity.