Original Reddit post

Hypothetically, if someone built a system that could reason across an extremely large amount of context (100m+) with near-perfect accuracy, and it scored around 98% on MRCR V2 across all needle tests, what would you do with it? Assume the LLM is only one component of the larger system, not the entire system itself. How would you prove its capabilities in a way that would be hard to doubt? submitted by /u/NonStopSix

Originally posted by u/NonStopSix on r/ArtificialInteligence