On March 6,Candy Apples Archives Alibaba released and open-sourced its new reasoning model, QwQ-32B, featuring 32 billion parameters. Despite being significantly smaller than DeepSeek-R1, which has 6,710 billion parameters (with 3.7 billion active), QwQ-32B matches its performance in various benchmarks. QwQ-32B excelled in math and coding tests, outperforming OpenAI’s o1-mini and distilled versions of DeepSeek-R1. It also scored higher than DeepSeek-R1 in some evaluations like LiveBench and IFEval. The model leverages reinforcement learning and integrates agent capabilities for critical thinking and adaptive reasoning. Notably, QwQ-32B requires much less computational power, making it deployable on consumer-grade hardware. This release aligns with Alibaba’s AI strategy, which includes significant investments in cloud and AI infrastructure. Following the release, Alibaba’s US stock rose 8.61% to $141.03, with Hong Kong shares up over 7%.[Jiemian, in Chinese]
Related Articles
China is weighing dust on streets to determine if the cleaners did a good enough job
2025-06-26 18:03
2032 views
Read More
Flying cars aren't real yet, but these supersonic vehicles already exist
2025-06-26 17:34
2816 views
Read More
Google will repair Hurricane Harvey victims' Pixel phones for free in Houston
2025-06-26 17:09
508 views
Read More
This toddler is cooking up a delicious storm on her YouTube show
2025-06-26 16:36
877 views
Read More
Victoria Justice's past pettiness towards Ariana Grande has become an incredible meme
2025-06-26 16:15
611 views
Read More
This toddler is cooking up a delicious storm on her YouTube show
2025-06-26 16:07
2744 views
Read More