Original Reddit post

Are these benchmarks correct its using 32B reasoning model, trained & served at 4-bit quantization. the base model is DeepSeek-R1-Distill-Qwen-32B. 8 Hopper GPUs. submitted by /u/Possible_Cheek_4114

Originally posted by u/Possible_Cheek_4114 on r/ArtificialInteligence

  • Eheran@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    58 minutes ago

    Why are you comparing to GPT4o? That is from 2 years ago. Same seems to apply to the others.