How Google Cloud CUDs Work
Google Cloud Committed Use Discounts come in two flavours: resource-based and spend-based. Resource-based CUDs lock you into a specific vCPU and memory configuration per region, offering discounts up to 55% on standard compute and a remarkable 70% on memory-optimised instances for three-year commitments. Spend-based CUDs work differently — you commit to a dollar spend floor in a specific region and service family (compute, BigQuery, Cloud SQL, etc.), earning a smaller but more flexible 25% discount on three-year terms. This distinction matters enormously for your effective savings rate. If your workload footprint is stable and predictable, resource-based CUDs win. If you're still optimising or scaling unevenly across machine types, spend-based offers safer optionality.
The key advantage that separates GCP from AWS is automatic sustained-use discounts. Google applies these automatically — up to 30% off — for any compute instance running consistently, with no commitment required. This is not an opt-in; it's automatic. Most enterprise buyers overlook this when comparing gross CUD rates to AWS Savings Plans, but the cumulative benefit is substantial. A 3-year resource-based CUD at 55% stacks with zero additional friction. You don't negotiate sustained-use discounts; you simply benefit from them.
One structural constraint: CUDs are per-project and per-region. If you run the same workload across five GCP projects or two regions, you need separate CUD commitments for each. This lack of flexibility is a real cost driver — many enterprises overshoot commitment sizes because they can't pool. You can layer these constraints into a Google Cloud CUD negotiation conversation with your account team to explore whether they'll offer flexibility on regional portability, though the answer is usually no. Explore what's possible with a Private Pricing Agreement to see if multi-project consolidation is negotiable.