Nvidia acquired Groq for $20 billion and recruited its leading engineers. Now, Groq is looking to raise $650 million for its remaining resources.
Groq is securing $650 million from its current investors to support its inference cloud, according to Axios. This fundraising follows Nvidia's $20 billion not-acqui-hire six months earlier, which compensated Groq’s investors in cash, recruited several senior engineers, and licensed Groq’s hardware technology.
The investors who received payouts in December are now being invited to reinvest. Disruptive and Infinitium have pledged to back the fundraising effort if other existing investors opt not to participate. Thus, the financing is essentially assured.
Interim leadership of the company is provided by CEO Adam Winter and CFO Matt Eng, following the departure of multiple senior staff members to Nvidia due to the December deal. Currently, Groq is focused on its inference cloud business, which enables developers and businesses to run inference-intensive applications on its proprietary Language Processing Unit hardware.
Inference refers to the calculations performed after an AI prompt is given, and it has grown into a significant market, overshadowing model training. Every interaction with AI, such as queries to ChatGPT or responses from Claude, requires inference computing. Economically, purpose-built chips that offer tokens at lower costs and faster speeds than general-purpose GPUs are advantageous.
Groq’s LPU architecture was specifically engineered for this type of workload. The company has delivered its chips to various model providers and cloud clients, consistently outperforming Nvidia’s GPU-based inference in terms of speed measured in tokens per second at comparable price points.
The December deal crafted by Nvidia was atypical. It didn't entail a full acquisition; instead, Nvidia compensated Groq’s investors in cash at what would have been its largest purchase price, licensed Groq’s chip technology, and recruited senior engineers without fully absorbing the company. Consequently, Groq has undergone a financial reset, experienced a loss of senior talent, and is now seeking to rebuild around a more focused yet potentially profitable inference-as-a-service model.
The inference chip sector is currently drawing significant investment. Cerebras recently went public with a valuation of $95 billion based on its inference-optimized technology. Fractile secured $220 million in London for inference chips that integrate compute and memory on the same die. Google is producing millions of Ironwood TPUs designed for inference tasks.
This week, DeepSeek announced a permanent 75% price reduction for its V4 Pro offering, which adversely affects the revenue-per-token structure that inference cloud providers rely on. Groq’s business model depends on its hardware providing tokens at a low enough cost to effectively compete with both GPU-based inference and the pricing of the model providers' APIs. The price drop from DeepSeek intensifies that competitive challenge.
The $650 million investment signifies a belief that dedicated inference hardware will maintain its edge over GPUs, despite Nvidia’s ongoing efforts to enhance its own inference capabilities. Nvidia's Blackwell and forthcoming Vera Rubin architectures aim to narrow the performance gap that has allowed companies like Groq to thrive.
The central question posed by this $650 million is whether Groq can restore its engineering expertise, expand its inference cloud, and sustain a cost advantage amid Nvidia’s hardware advancements and the aggressive price reductions from model providers. The investors who profited from the $20 billion deal are now being asked to place new bets on a smaller, streamlined version of Groq. Two of those investors have committed to backing the fundraising round, reflecting either strong confidence or a sense of obligation.
Other articles
Nvidia acquired Groq for $20 billion and recruited its leading engineers. Now, Groq is looking to raise $650 million for its remaining resources.
Groq's current investors were bought out in what's known as a not-acqui-hire. They have now been requested to reinvest $650 million into the remaining inference cloud operations.
