Nvidia's Vera Rubin goes into full production — and the buying cycle has already started without you.
Six chips shipping concurrently on a single platform. Nvidia's cadence is no longer "one or two new parts a year" — it is a yearly full-stack replacement, and the hyperscalers have voted.
Jensen Huang used his CES keynote to confirm that the Vera Rubin platform — Vera CPU, Rubin GPU, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X — is in full production, positioned as a 3.5× training and 5× inference uplift over Blackwell. The inference-cost story is the one to watch: Nvidia claims one-seventh the cost per token versus Blackwell, with GPU counts for MoE training down 75%.
Microsoft Azure and Google Cloud both committed to Rubin NVL72 in H2 2026. Azure says its Fairwater sites in Wisconsin and Atlanta were designed for Rubin's power/thermal envelope years ago. Meanwhile, hyperscaler backlogs keep ratcheting up: Amazon at $244B, Google at $240B. The 2–3 year GPU bottleneck Huang describes is the backdrop for every procurement conversation this year.