Personal Lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
eifachposteMB to AI (Reddit RSS)English · 2 days ago

Independently maintained SWE-bench historical data to keep AI providers honest · Issue #445 · SWE-bench/experiments

github.com

external-link
message-square
0
link
fedilink
1
external-link

Independently maintained SWE-bench historical data to keep AI providers honest · Issue #445 · SWE-bench/experiments

github.com

eifachposteMB to AI (Reddit RSS)English · 2 days ago
message-square
0
link
fedilink
Publish a dashboard with historical results per model, updated daily · Issue #445 · SWE-bench/experiments
github.com
external-link
Following up on https://www.reddit.com/r/codex/comments/1tglnmq/comment/omh6tir/ I am requesting a live dashboard that would allow users to view each model's performance over time, ideally updated ...

Original Reddit post

Originally posted by u/cowwoc on r/ClaudeCode

alert-triangle
You must log in or # to comment.

AI (Reddit RSS)

ai_reddit

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !ai_reddit@lemmy.durstig.online

AI (Reddit RSS Feed)

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 19 users / day
  • 112 users / week
  • 260 users / month
  • 701 users / 6 months
  • 1 local subscriber
  • 29 subscribers
  • 14.5K Posts
  • 229 Comments
  • Modlog
  • mods:
  • eifachposte
  • BE: 0.19.15
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org