Original Reddit post

Got curious about how to actually put a number on the gaps between labs, so I made an attempt. Pulls from LMArena, LLM Stats and Artificial Analysis. Far from perfect as it builds on benchmarks which themselves are far from perfect, but it’s a start. Especially LLM Stats is hard as the data is self-published so you can not compare the labs head to head. Thoughts are appreciated. submitted by /u/CuSO4

Originally posted by u/CuSO4 on r/ArtificialInteligence