eifachposteMB to AI (Reddit RSS)English · 6 hours agoPlain-English browser test in a single command: two LLMs (one drives, one judges) so the model can't grade its own homework. Open-source, pre-1.0.v.redd.itexternal-linkmessage-square0linkfedilinkarrow-up11file-text
arrow-up11external-linkPlain-English browser test in a single command: two LLMs (one drives, one judges) so the model can't grade its own homework. Open-source, pre-1.0.v.redd.iteifachposteMB to AI (Reddit RSS)English · 6 hours agomessage-square0linkfedilinkfile-text