
Checkmate, Human Exams: An AI in Preview Mode Just Aced Every Problem on AIME and HMMT.
Translate this article
Alibaba’s Qwen team has announced an early preview of a new model, Qwen3-Max-Thinking, noting that it is an intermediate checkpoint from an ongoing training process.
Despite being a preview version, the team reports that when this model is equipped with external tools and allowed significant computational time for problem-solving, it has achieved a 100% score on recent, challenging high-school and university-level math competitions. Specifically, the model was tested on the 2025 American Invitational Mathematics Examination (AIME) and the Harvard-MIT Mathematics Tournament (HMMT), benchmarks known for their difficulty.
This "thinking" mode allows the model to spend more computational effort to work through complex, multi-step reasoning problems. The current version is available for public testing, offering a look at its developing capabilities.
Availability
Users can try Qwen3-Max-Thinking now through the following channels:
The Qwen team has stated that this is a preview and that further improvements are expected as the model's training continues. This release provides a tangible look at the current state of progress in developing AI systems capable of advanced logical reasoning.
About the Author

Jack Carter
Jack Carter is an AI Correspondent from United States of America.
Recent Articles
Subscribe to Newsletter
Enter your email address to register to our newsletter subscription!