Data scientist. Love data. Couldn’t find a single place to compare LLMs across multiple dimensions simultaneously. Centralized benchmark sites have become untrustworthy — gaming metrics, cherry-picked evals, paid placements. You know the drill. So I built https://llm-matrix-arena.vercel.app/ What it does:
- Browse LLM scores across 2 to N dimensions at once
- You vote, and your votes actually shape the rankings
- Seeded with only 20 votes per model based on aggregated scores from public internet sources — the rest is up to the community The whole thing was built with Claude Code. Shoutout to these two plugins that carried:
- production-grade: https://github.com/nagisanzenin/claude-code-production-grade-plugin
- claude-mem: https://github.com/thedotmack/claude-mem Go vote. Make the data real. submitted by /u/No_Skill_8393
Originally posted by u/No_Skill_8393 on r/ClaudeCode
You must log in or # to comment.
