News

First, we evaluated the full model using the QwQ demo on Hugging Face. Then, we tested a 4-bit quantized version on a 24 GB GPU (Nvidia 3090 or AMD Radeon RX 7900XTX) to assess the impact of ...