AWS GPU prices rise by 20% due to memory shortages.

AWS GPU prices rise by 20% due to memory shortages.

      Renting an AI chip is increasingly resembling booking a hotel in a fully booked city. You pay to secure the room, and the rates continue to escalate. On AWS, this just happened again.

      Amazon Web Services has increased the prices for EC2 Capacity Blocks for machine learning by about 20% starting in July. This change was first reported by Business Insider, and AWS has confirmed it. This service allows companies to reserve Nvidia GPUs in advance, ensuring that lengthy training runs continue without interruption.

      This marks the second price hike in six months, with AWS having already raised the same prices by around 15% in January. When combined, the cost of securing this computing power has surged significantly since the start of the year. AWS indicated that the prices fluctuate “periodically based on supply and demand.”

      The increase is specific rather than widespread. It affects one purchasing option: the reserved blocks preferred by serious AI teams that are training or refining large models. According to AWS, other options maintain fixed prices, which the company claims will remain stable. The price increase did not affect Trainium, Amazon’s proprietary AI chip, as per The Information.

      The extent of this increase is important due to the larger implications. As the largest cloud provider globally, AWS hosts numerous AI services. When the prices for its highest-tier compute rise, the effects reverberate to the start-ups and enterprises utilizing these services. AWS characterized this decision as evidence of the escalating demand for GPUs.

      The reason the cloud could not maintain low costs

      The notable shift to observe is where the constraints have now developed. For the past two years, the limitation on AI was mainly software and expertise. Currently, it is physical resources. The bottleneck now lies with high-bandwidth memory, the chips that accompany AI processors, and there is a finite supply of it. As Business Insider highlighted, the limitation on AI has transitioned from software to hardware, and creating hardware requires years.

      This situation is direct and unforgiving. A shortage of memory results in fewer GPUs. With fewer GPUs, there are fewer data centers. “With a cap on how much memory can be produced, there’s a cap on GPU production, which in turn limits the number of data centers that can be constructed,” noted Peter Berezin, chief economist at BCA Research, on X.

      Such scarcity gives cloud providers leverage. When GPU availability is limited, customers have few options, allowing AWS, Microsoft, Google, and Oracle to pass increased costs directly onto them. The shortage raises their operational expenses, Berezin pointed out, but it also maintains demand that exceeds supply, granting them pricing authority over who can access computing resources.

      A widespread price increase

      Amazon is not the only company experiencing this trend, and that is significant. Apple recently raised prices across its Mac and iPad lineups, attributing the hike to memory costs. Xbox has done the same. Elon Musk referred to the rise in memory prices as the largest increase he has witnessed in any product. Now, these same memory costs are reflecting in cloud billing.

      On the flip side of the shortage is a financial boon. The rising AWS GPU prices have elevated the market valuations of memory manufacturers Micron and SK Hynix to record levels. Investors are betting that the high-bandwidth memory shortage will keep the market constrained, causing prices to remain elevated for years.

      For AWS customers, the implication is clear. The lowest cost for AI computing is now behind them, and pressing the reserve button has become more expensive. The lingering question is how far this trend will extend. If a 15% increase led to another 20% rise in six months, the teams developing the most ambitious models are left uncertain about the cost of their next reservation.

Other articles

Prosus profits nearly double as its investments in e-commerce and Tencent stake yield positive results. Prosus announced approximately $7.3 billion in revenue for the year ending March 2026, with headline earnings per share increasing by 91–100% due to improved performance in e-commerce and Tencent. Peec AI increases its valuation to $200 million following the GEO investment. Peec AI increases its valuation to $200 million following the GEO investment. Peec AI is said to be raising funds at a valuation of $200 million, which is double its worth from eight months prior, as it bets that GEO is emerging as the new SEO in the age of AI. A bidirectional pixel that combines both screen and camera functionalities. A bidirectional pixel that combines both screen and camera functionalities. ETH Zurich's bidirectional pixel is capable of both emitting and detecting light, suggesting the potential for screens that function as cameras, in addition to applications in holography and optical computing. China's CXMT reportedly secures a $3 billion memory contract with Tencent. China's CXMT reportedly secures a $3 billion memory contract with Tencent. CXMT has reached an agreement to provide Tencent with approximately $3 billion worth of memory chips, according to two sources who spoke to Reuters, further advancing China's goal for self-sufficiency in memory production. Amazon's swift delivery initiative has led to a $15 billion loss for India's Eternal and Swiggy. Amazon's swift delivery initiative has led to a $15 billion loss for India's Eternal and Swiggy. Eternal and Swiggy have seen their market value decline by over $15 billion as Amazon broadens its 'delivery in minutes' service throughout India. OpenAI has appointed Paul Meade, the head of Apple’s Vision Pro division. OpenAI has appointed Paul Meade, the head of Apple’s Vision Pro division. Paul Meade, the head of Apple’s Vision Pro, is departing to join OpenAI’s hardware team, marking the highest-profile departure from Apple in the competition for AI devices.

AWS GPU prices rise by 20% due to memory shortages.

AWS GPU prices have increased by approximately 20%, marking the second increase this year, as the AI memory shortage impacts the cloud and provides hyperscalers with pricing leverage.