← All signal stories
§ SignalApr 6, 2026 · Issue 16 · Story 8

HBM Scarcity Is Now a Hard Ceiling on How Fast AI Can Scale

IEEE Spectrum Senior Editor Samuel K.

8. HBM Scarcity Is Now a Hard Ceiling on How Fast AI Can Scale

IEEE Spectrum Senior Editor Samuel K. Moore has published a detailed analysis of the high bandwidth memory (HBM) shortage, identifying AI hyperscalers as the primary driver of a DRAM supply crunch that is directly constraining large language model inference speeds. The piece frames HBM not as a peripheral component issue but as a foundational bottleneck: the faster AI labs want their models to run, the more HBM they need, and supply is not keeping pace with that demand curve.

The competitive stakes here are significant. HBM production is effectively an oligopoly controlled by SK Hynix, Samsung, and Micron, with SK Hynix holding the dominant position as Nvidia's preferred supplier for its H100 and H200 GPU stacks. That means firms like Google, Microsoft, Amazon, and Meta, who are racing to expand inference capacity, are all competing for allocations from the same constrained supply chain. The losers in that allocation fight face real throughput penalties, not just cost increases. Startups building on top of these hyperscaler platforms absorb the downstream latency and pricing pressure without having any leverage to negotiate their way out of it.

The deeper structural signal is that compute scaling narratives have long centered on silicon transistor density and GPU chip counts, but memory bandwidth is increasingly the actual governing constraint on AI performance. Moore's framing aligns with a growing body of technical analysis suggesting that the next meaningful gains in model serving efficiency will come from memory architecture improvements rather than raw parameter scaling alone. If HBM supply remains tight through 2025 and 2026 as projected, it will quietly reshape which players can compete at frontier inference scale and which cannot.

Source: https://spectrum.ieee.org/high-bandwidth-memory-shortage