Nvidia's Jensen Huang cautions that DeepSeek operating on Huawei processors would lead to a 'terrible result' for the United States.
In summary, Nvidia CEO Jensen Huang cautioned on the Dwarkesh Podcast that if DeepSeek optimizes its AI models for Huawei’s Ascend chips rather than American hardware, it would represent “a horrible outcome” for the United States. This warning comes as the Chinese AI lab prepares to release its V4 foundation model on Huawei’s Ascend 950PR processor. The shift from Nvidia’s CUDA to Huawei’s CANN framework poses a potential disruption to the software-hardware reliance that has supported American AI dominance, even as US legislators are advocating to place DeepSeek on the entity list for export control.
During the podcast, Huang expressed concern that if future AI models are optimized differently from the American tech stack, and as “AI spreads to the rest of the world” using Chinese standards and technology, China could surpass the US. This remark is significant, given that Nvidia has greatly benefited from the current scenario where nearly every cutting-edge AI model is trained on Nvidia GPUs with the CUDA software framework.
Regarding DeepSeek’s developments, the lab is set to launch V4, a multimodal foundation model, expected later this month. Earlier reports indicated that V4 would operate on Huawei’s latest Ascend 950PR processor. Another report suggested that the model was trained using Nvidia’s Blackwell chips, potentially violating US export controls; however, these claims are not necessarily contradictory since a model can be trained on one type of hardware and utilized for inference on another.
The key aspect of the Huawei integration is the software migration involved. DeepSeek has invested months in rewriting its core code to function with Huawei’s CANN framework, transitioning away from the CUDA ecosystem that Nvidia has cultivated for two decades. CUDA’s dominance has functioned as an additional layer of American influence over AI, beyond just the hardware. Export restrictions can limit the Nvidia hardware that reaches China, but as long as Chinese labs are developing their software for CUDA, they remain reliant on the Nvidia ecosystem even when using different processors. DeepSeek’s shift to CANN eliminates that reliance.
DeepSeek’s V3 model, which was launched in late 2024, was trained on 2,048 Nvidia H800 GPUs, designed specifically for the Chinese market but banned from sale to China in 2023. The company has already demonstrated its capability to create frontier-competitive models with fewer resources than its American counterparts. Its R1 reasoning model showed performance that matched or exceeded that of models requiring significantly larger training costs. V4 aims to extend this methodology by proving the company can succeed entirely without American hardware.
In terms of raw performance, Huawei’s chips currently lag behind Nvidia’s offerings. The Ascend 910C, the previous version to the 950PR, delivers around 60% of the inference performance of Nvidia’s H100, which itself is two generations older than Nvidia's leading chip. Today, American chips exhibit power levels approximately five times greater than those from China, with projections indicating this will expand to a 17-fold difference by 2027. Huawei aims for 750,000 AI chip shipments by 2026, yet its total output accounts for only 3 to 5% of Nvidia’s overall computing power.
However, Huang's concerns do not stem from the existing performance gap. He stated on the podcast that even with inferior chips, China could still catch up in AI development due to its “abundant energy” and “large pool of AI researchers.” This suggests that raw hardware performance is merely one factor, and aspects such as software optimization, researcher talent, and energy access could mitigate silicon disadvantages. If V4 delivers strong performance on Ascend chips, it could validate an alternative AI development pathway that doesn’t rely on Nvidia at any stage of the supply chain.
The scenario also highlights a contradiction within American chip export policy. Nvidia resumed production of the H200, a more powerful chip, for sale in China, as confirmed by Huang in March. Nevertheless, China has been blocking imports of the H200 to protect Huawei’s domestic chip market, and Nvidia’s CFO reported no revenue from H200 sales in China. The restrictions intended to curb China’s AI capabilities appear to be expediting the growth of domestic Chinese alternatives.
DeepSeek’s experience with its R2 model showcases both the potential and limitations of the Huawei route. R2 faced multiple delays due to training failures on Huawei hardware. Although Chinese authorities encouraged DeepSeek to conduct training on indigenous chips, the company faced stability issues, leading them to revert to Nvidia GPUs for training while using Huawei chips solely for inference. This distinction is important, given that training is the most compute-intensive aspect of AI development, and the inability for Huawei chips to support it reliably suggests a genuine hardware discrepancy. However, inference, where models generate user-facing value, appears achievable with Huawei’s chips.
Meanwhile, US lawmakers are intensifying efforts to impose stricter regulations. Recently, lawmakers and experts accused China of acquiring "what they can" and stealing "what they cannot" in the AI sector,
Other articles
Nvidia's Jensen Huang cautions that DeepSeek operating on Huawei processors would lead to a 'terrible result' for the United States.
Nvidia's CEO Jensen Huang states that DeepSeek's optimization of AI models for Huawei's Ascend chips, rather than American hardware, would be "a terrible consequence for the United States."
