Original Reddit post

Hey everyone, ​I recently finished building a Model Context Protocol (MCP) index containing roughly 3 million arXiv papers. My goal was to make it easier to connect local and cloud LLMs directly to a massive corpus of ML and STEM research to help reduce hallucinated citations and improve research workflows. ​The index is live, but before I open it up broadly, I want to make sure the retrieval quality actually holds up against highly niche, complex queries (especially for obscure math, hyper-specific domains, or newer architectures). ​I’m looking for a small group of folks (around 20) to try it out, try to break the retrieval system, and give me brutal feedback on the relevance of the fetched papers. ​If you want to stress-test it with your own LLM setup and see how it performs with your daily research queries, let me know in the comments or shoot me a DM and I’ll send you the connection details! submitted by /u/Divyansh3021

Originally posted by u/Divyansh3021 on r/ClaudeCode