I built a site to browse and vote on LLMs across N dimensions using Claude Code entirely

www.reddit.com

I built a site to browse and vote on LLMs across N dimensions using Claude Code entirely

www.reddit.com

eifachposteMB to AI (Reddit RSS)English · 4 hours ago

Original Reddit post

Data scientist. Love data. Couldn’t find a single place to compare LLMs across multiple dimensions simultaneously. Centralized benchmark sites have become untrustworthy — gaming metrics, cherry-picked evals, paid placements. You know the drill. So I built https://llm-matrix-arena.vercel.app/ What it does:

Browse LLM scores across 2 to N dimensions at once
You vote, and your votes actually shape the rankings
Seeded with only 20 votes per model based on aggregated scores from public internet sources — the rest is up to the community The whole thing was built with Claude Code. Shoutout to these two plugins that carried:
production-grade: https://github.com/nagisanzenin/claude-code-production-grade-plugin
claude-mem: https://github.com/thedotmack/claude-mem Go vote. Make the data real. submitted by /u/No_Skill_8393

Originally posted by u/No_Skill_8393 on r/ClaudeCode

You must log in or # to comment.

Chat