Nvidia's Jensen Huang cautions that DeepSeek operating on Huawei chips would result in a 'terrible situation' for the United States.

      In brief: Nvidia's CEO Jensen Huang expressed on the Dwarkesh Podcast that if DeepSeek optimizes its AI models for Huawei’s Ascend chips instead of American technology, it would lead to “a horrible outcome” for the United States. This comes as DeepSeek is set to launch its V4 foundation model on the Huawei Ascend 950PR processor. The shift from Nvidia's CUDA to Huawei's CANN framework threatens to disrupt the dependency that has supported American AI leadership, even as U.S. lawmakers push to add DeepSeek to the entity list for export control.

      During the podcast on Wednesday, Huang stated that if DeepSeek adjusts its new AI models to operate on Huawei chips instead of U.S. hardware, it would pose a “horrible outcome” for the United States. This warning highlights the growing alliance between China’s leading AI laboratory and its top chip manufacturer as a direct threat to the technological advantage that has bolstered U.S. AI for the past decade.

      “If future AI models are optimized in a significantly different manner than the U.S. technology stack,” Huang noted, as “AI spreads globally” with Chinese standards, it could result in China becoming “superior to” the United States. This statement is significant coming from the CEO of a company that has thrived under the existing arrangement, where nearly every cutting-edge AI model globally is developed on Nvidia GPUs using Nvidia’s CUDA software framework.

      What DeepSeek is developing

      DeepSeek is set to introduce its V4 multimodal foundation model later this month. Earlier in April, The Information reported that V4 would utilize Huawei’s latest Ascend 950PR processor, while a separate Reuters article claimed that the model was trained on Nvidia’s Blackwell chips, potentially breaching U.S. export regulations. These two assertions are not inherently contradictory, as a model can be trained on one hardware and implemented on another for inference.

      The significance of Huawei integration lies in the accompanying software transition. DeepSeek has spent months adapting its core code to function with Huawei’s CANN framework, moving away from the CUDA ecosystem that Nvidia has built over two decades as the backbone of AI development. CUDA's predominance has provided a layer of American control over AI that extends beyond the hardware itself. While export restrictions can limit the Nvidia hardware available to China, as long as Chinese labs develop their software for CUDA, they remained reliant on the Nvidia ecosystem, even when using alternative chips. DeepSeek's switch to CANN disrupts that reliance.

      DeepSeek's V3 model, unveiled in late 2024, was trained on 2,048 Nvidia H800 GPUs, a chip specifically designed for the Chinese market that was prohibited from sale to China in 2023. The company has demonstrated its ability to produce competitive frontier models with fewer resources than its U.S. counterparts. Its R1 reasoning model matched or exceeded the performance of models that required significantly higher training costs. V4 aims to extend this strategy by proving the company's capacity to succeed without U.S. hardware at all.

      The hardware disparity and its potential implications

      In terms of raw performance, Huawei’s chips lag behind Nvidia’s leading models. The Ascend 910C, the predecessor to the 950PR, achieves roughly 60% of the inference performance of Nvidia’s H100, which itself is two generations behind Nvidia’s latest products. Currently, American chips are about five times stronger than their Chinese counterparts, and projections suggest that this gap could expand to 17 times by 2027. Huawei aims for 750,000 AI chip shipments in 2026, but that would only represent 3 to 5% of Nvidia’s total computing power.

      However, Huang's concern isn't just about the existing performance gap. He stated on the podcast that even if China has less powerful chips, it could still catch up with U.S. AI advancements due to its “abundant energy” and “large pool of AI researchers.” This suggests that raw hardware performance is merely one factor, and that software optimization, talent among researchers, and energy availability could offset disadvantages in silicon. If V4 performs well on Ascend chips, it could validate an independent pathway for AI development that does not depend on Nvidia at any point in the supply chain.

      The export control dilemma

      The scenario highlights a conflict within American chip export policy. Nvidia has resumed production of the H200, a more powerful chip intended for the Chinese market, as Huang confirmed in March. However, China has been blocking imports of the H200 to protect Huawei’s domestic chip sector, and Nvidia's CFO has stated that the company has made no revenue from H200 sales to China. The restrictions aimed at limiting China’s AI capabilities are instead propelling the growth of a Chinese alternative.

      DeepSeek's experience with its R2 model showcases both the potential and limitations of the Huawei strategy. The training of R2 faced multiple delays due to failures on Huawei hardware. Despite encouragement from Chinese authorities for DeepSeek to train using

Other articles

Anthropic's Amodei has a meeting with Wiles and Bessent at the White House regarding access to Mythos and the standoff with the Pentagon. Anthropic CEO Amodei engaged in "productive" discussions with Wiles and Bessent at the White House regarding access to Mythos, indicating a potential easing of the conflict surrounding the Pentagon's blacklisting.

Palantir, Thales, and a startup are in competition to develop the FAA's predictive AI for air traffic. The FAA's SMART system can forecast air traffic conflicts two hours in advance. After systemic issues were revealed at LaGuardia, Palantir, Thales, and Air Space Intelligence are vying for the contract.

OpenAI experiences a triple exit in one day as the product chief, head of Sora, and enterprise CTO all depart. Kevin Weil, Bill Peebles, and Srinivas Narayanan are departing from OpenAI as the organization terminates Sora and disbands OpenAI for Science, marking the continuation of a two-year trend of leadership departures.

Europe's €80 billion investment of public funds in venture capital and scale-ups encounters fundamental growth challenges. The EIF's €15B ETCI 2 fund aims for €80B in scaleup financing, yet Europe's 80% scaleup shortfall and unprofitable unicorns indicate that merely providing capital will not suffice to bridge the gap with the US.

Palantir, Thales, and a startup are vying to develop the FAA's predictive AI for air traffic management. The FAA's SMART system is designed to forecast air traffic conflicts two hours in advance. Following the identification of systemic issues at LaGuardia, Palantir, Thales, and Air Space Intelligence are vying for the contract.

Cursor is in discussions to secure $2 billion at a valuation of $50 billion after achieving a $2 billion annual recurring revenue (ARR) within three years. Cursor is aiming to secure $2 billion at a $50 billion valuation from a16z, Thrive, and Nvidia after achieving $2 billion in annual recurring revenue (ARR) within three years, marking the quickest scaling in B2B history, with 70% of the Fortune 1,000 as its clients.

Nvidia's Jensen Huang cautions that DeepSeek operating on Huawei chips would result in a 'terrible situation' for the United States.

Nvidia CEO Jensen Huang stated that DeepSeek optimizing AI models for Huawei's Ascend chips rather than American hardware would represent "a terrible outcome for the United States."