Huawei CloudMatrix AI has reached a significant milestone in performance, surpassing Nvidia’s H800 GPUs in running DeepSeek’s R1 artificial intelligence model. A technical paper released by Huawei researchers highlights the achievements of the CloudMatrix384 architecture, which integrates Ascend 910C NPUs and Kunpeng CPUs in a supernode connected by a Unified Bus.
The performance metrics showcased by Huawei demonstrate impressive results, with the system achieving high throughput and efficiency ratings compared to competing systems. The system’s design, which enables direct communication and dynamic resource pooling, addresses challenges in AI infrastructure for large language model operations.
Despite the lack of third-party validation, Huawei’s claims suggest a leap forward in AI hardware innovation. The technical innovations behind the system, such as peer-to-peer serving architecture and hardware optimizations, contribute to its efficiency and performance.
In the geopolitical context of US-China tech tensions, Huawei’s advancements in AI hardware signal the company’s commitment to technological competitiveness. The research paper aims to build confidence in Chinese-developed NPUs as a viable alternative to Nvidia’s GPUs.
While independent verification is necessary to validate Huawei’s claims, the industry can glean insights from the technical approaches described in the paper. The intense competition in AI hardware underscores the importance of computational efficiency and innovation in driving industry progress.
[Photo by Shutterstock]
[See also: From cloud to collaboration: Huawei maps out AI future in APAC]
Want to learn more about cybersecurity and the cloud from industry leaders? Check out Cyber Security & Cloud Expo taking place in Amsterdam, California, and London.
Explore other upcoming enterprise technology events and webinars powered by TechForge [here].



