eifachposteMB to AI (Reddit RSS)English · 6 hours ago

The Singularity Gate: a benchmark for paradigm-shifting scientific discoveries published strictly after model cutoff

0

1

The Singularity Gate: a benchmark for paradigm-shifting scientific discoveries published strictly after model cutoff

eifachposteMB to AI (Reddit RSS)English · 6 hours ago

0

Original Reddit post

Just released a benchmark called The Singularity Gate. Tests whether frontier AI can predict paradigm-breaking scientific discoveries published after their training cutoff. Top score: 17.75% (partial credit, Opus 4.7). Fully-correct outcome rate: 0% across all respondents. The framing is necessity, not sufficiency. A model that can predict paradigm-breaking discoveries from priors it already has isn’t necessarily Einstein-level. A model that can’t is definitely not. Failing the gate rules out the capability. Passing doesn’t certify it. https://preview.redd.it/osbj2l19ac3h1.png?width=900&format=png&auto=webp&s=2247efb28b2c76babeebd0ce20340725f48140e4 https://preview.redd.it/mxr0r44bac3h1.png?width=488&format=png&auto=webp&s=eac7ca727f703fbd140981b5a33935d78b758ed6 Paper: https://doi.org/10.5281/zenodo.20358378 Site: https://singularitygate.org/ Happy to discuss methodology, related work, or the framing in the comments. submitted by /u/lordpermaximum

Originally posted by u/lordpermaximum on r/ArtificialInteligence

You must log in or # to comment.

Chat