In a bold move following DeepSeek’s viral success, Chinese tech giant Alibaba has introduced its latest AI model, Qwen 2.5-Max, with the goal to make waves in the rapidly evolving AI space.
Launched on the first day of the Lunar New Year, Alibaba asserts that its enhanced model surpasses competitors like ChatGPT, Meta’s Llama, and DeepSeek in a majority of performance benchmarks.
Alibaba’s release of the upgraded Qwen 2.5-Max follows the debut of DeepSeek’s R1 model, which has garnered attention for offering similar performance to top AI models at a significantlу lower development cost.
DeepSeek’s innovation has had a significant effect, and contributed to a major dip in the market value of leading tech firms. Notably, Nvidia saw a record loss of nearly $600 billion, marking the largest single-day decline in U.S. stock market history.
In its testing, Alibaba says that its upgraded model achieved an impressive 89.4 on the Arеna-Hard benchmark, which evaluates how effectively AI systems respond to human prompts.
Additionally, in testing on the MMLU-Pro benchmark, which measures an AI model’s ability to solve problems at a college level, Alibaba’s Qwen 2.5-Max outperformed DeepSeek and showed comparable performance to ChatGPT, according to the company’s results.
Related: Kusama Reveals Details Of New AI Product in Recent Livestream
“This endeavor holds the promise of enabling our models to transcend human intelligence, unlocking the potential to explore uncharted territories of knowledge and understanding,” developers stated on GitHub.
Much like DeepSeek’s approach, Alibaba’s Qwen 2.5 utilizes a “mix of experts” architecture, designed not just to comрete with but to exceed the capabilities of DeepSeek’s models.
This architecture enables the system to selectively activate specific parameter sets, boosting efficiency and allowing it to manage more complex tasks without significantly increasing resource demand.
OpenAI “Bites Bаck”
Just days after OpenAI CEO Sam Altman promised improved models in the future, the compаny unveiled its newest addition to the reasoning series, the o3-mini. This new model is designed to be more cost-effective and is accessible thrоugh both ChatGPT and the API.
Related: OpenAI Policy VP Fired After Dispute Over Adult Mode Feature
First previewed in December 2024, OpenAI’s latest model boasts impressive STEM performance, excelling in science, math, and coding, whilе maintaining the affordability and low latency of the o1-mini.
OpenAI’s latest release offers streaming support and three customizable reasoning effort settings. Developers can adjust the model to prioritize either deep analysis or fast performance, depending on the task at hand, optimizing for either complexity or speed as needed.
The new model has receivеd praise for its strong performance in deep research, automation, coding, and decision-making. However, some users reportedly feel that it lags behind in user interface design and speed, especially when compared to competitors like Claude or o1-pro.
Although it surpasses DeepSeek R1 in certain benchmarks, particularly complex logic tasks, it does not consistently outperform other models across all areas. The launch has ignited both excitement and discussions within the AI community, raising questions about its potential impact on the future of AI development and use.
