Original Reddit post

Are these benchmarks correct its using 32B reasoning model, trained & served at 4-bit quantization. the base model is DeepSeek-R1-Distill-Qwen-32B. 8 Hopper GPUs. submitted by /u/Possible_Cheek_4114

Originally posted by u/Possible_Cheek_4114 on r/ArtificialInteligence