Hi, dev here. You can visit the site here: https://benchmarklist.com/ . Would love any feedback or evals we missed :)! We think AI evals and benchmarks are not tracked well today and hard to understand across many real world skills - we want to fix this! Thanks! submitted by /u/davidthesong
Originally posted by u/davidthesong on r/ArtificialInteligence
You must log in or # to comment.
