Are these benchmarks correct its using 32B reasoning model, trained & served at 4-bit quantization. the base model is DeepSeek-R1-Distill-Qwen-32B. 8 Hopper GPUs. submitted by /u/Possible_Cheek_4114
Originally posted by u/Possible_Cheek_4114 on r/ArtificialInteligence
You must log in or # to comment.
