Google BigQuery Pricing: On-Demand vs

Understanding BigQuery Pricing Models

BigQuery pricing has evolved into a two-tier system designed to serve both volatile, unpredictable workloads and stable, high-volume analytics operations. The choice between models is the single largest lever in your Google Cloud data platform economics, often worth hundreds of thousands annually for enterprise-scale analytics teams. Unlike licensing software where pricing is fixed, BigQuery's consumption-based model creates significant optimization opportunities.

Google recently introduced BigQuery Editions, a major shift that combines compute capacity, feature access, and pricing into unified packages. This change replaced the legacy flat-rate and flex-slot model, introducing three edition tiers: Standard, Enterprise, and Enterprise Plus. Critically, editions are not just feature tiers—they fundamentally change slot pricing, available features, and reservation capacity limits.

On-Demand Query Pricing

The on-demand model charges $6.25 per terabyte (TiB) of data scanned, with the first 1 TiB per month free. This pricing applies whether you run 1 query or 10,000 queries. The key word is scanned—not returned, not processed, but actually read by BigQuery. A single unfiltered SELECT * query scanning your entire 500 GB table costs the same whether you retrieve 1 row or 10 million rows.

There is a hard concurrent limit: 2,000 slots per project in on-demand mode. This means that if your workload requires more than 2,000 parallel executing queries, you will immediately hit a ceiling. For most organizations, this is never a problem. But for real-time dashboards serving thousands of concurrent users, or batch ETL processes that heavily parallelize, this constraint becomes binding.

On-demand pricing offers extreme flexibility. You can spin up ad-hoc analytics, run expensive exploratory queries, and only pay when you actually scan data. There is zero commitment, zero minimum spend. For teams with unpredictable query patterns—like data scientists experimenting or ad-hoc business intelligence requests—on-demand is often the correct choice.

Capacity Pricing and BigQuery Editions

Capacity pricing inverts the economics entirely. Instead of paying per query, you purchase slots—virtual CPUs allocated to BigQuery—over fixed periods. A slot is a unit of compute. BigQuery autoscaling will consume available slots when queries run, releasing them when queries finish.

Google now bundles slot pricing into three Edition tiers:

Standard Edition: $0.04 per slot-hour (US regions). This tier includes essential BigQuery features but limits you to 10 named reservations across your organization. Autoscaling available for pay-as-you-go capacity.
Enterprise Edition: $0.06 per slot-hour. Unlocks advanced features including multi-region replication, premium support, and up to 200 named reservations. Fine-grained access controls and enhanced SLOs.
Enterprise Plus Edition: Premium pricing (Google negotiates this per deal). Maximum feature set, same reservation limits as Enterprise.

Editions matter because they are not merely cosmetic. A Standard Edition slot has different compute characteristics, different priority scheduling, and different feature support than an Enterprise Edition slot. If your workload requires multi-region clustering or advanced compliance controls, you cannot achieve that cost with Standard slots—you must upgrade the Edition.

Slot Commitment Discounts (CUDs)

At Google Cloud Next 2025, Google launched Committed Use Discounts specifically for BigQuery capacity. This was a watershed moment for enterprise FinOps. Prior to CUDs, the only commitment vehicles were Private Pricing Agreements (PPAs). Now, organizations can lock in lower slot rates through 1-year and 3-year purchase agreements.

CUD parameters:

1-year or 3-year terms
Minimum 50-slot commitment (in 50-slot increments)
Pricing discount compounds based on term length and organization commitment
Applies only to the Edition you purchase (Standard CUDs differ from Enterprise CUDs)

A 3-year commitment on 100 Standard Edition slots can reduce the effective slot cost by 20-30% versus 1-year rates, and significantly more versus PAYG autoscaling. This is where enterprise deals get negotiated. Google's fiscal year ends September 30, so fall and early spring are peak windows for procurement cycles and CUD discussions.

Capacity pricing becomes economically dominant when your organization consistently burns at least 100 slots. Below that, the math favors on-demand. Above that, it almost always favors reservations, assuming reasonable utilization.

The Break-Even Analysis: 467 TiB Per Month

Determining whether to move from on-demand to capacity requires concrete mathematics. Here is the break-even calculation:

On-demand cost = (TiB scanned per month) × $6.25
Capacity cost = (slot count) × 730 hours/month × slot rate

Standard Edition baseline: 100 slots at $0.04/slot-hour = 100 × 730 × $0.04 = $2,920 per month.

For on-demand to cost the same: $2,920 ÷ $6.25 = 467 TiB scanned per month.

If your organization scans fewer than 467 TiB monthly, on-demand is mathematically cheaper. If you consistently exceed 467 TiB, capacity becomes more economical. The difference becomes dramatic at higher scan volumes. An organization scanning 2,000 TiB monthly on on-demand pays $12,500. The same 2,000 TiB over 4 months running on 400 Standard slots costs approximately $11,680. At massive scale, the savings are compounded further by CUDs and PPAs.

But the analysis must account for actual utilization. If you commit to 100 slots and only use 30 slots' worth of compute, you are paying for 70 idle slots. BigQuery's autoscaling model and slot sharing mechanisms help, but there is no free lunch. Capacity commitments demand strong forecast confidence.

Peak Demand vs. Baseline Usage

Enterprise analytics teams rarely run at constant load. Morning dashboards spike, nightly ETL batches run heavy, weekend analytics are light. A pattern might look like: baseline 50 slots, peak 150 slots, valley 20 slots. In this scenario, do you commit to 150 slots to handle peak? Or do you commit to 50 and allow autoscaling to burst?

Autoscaling at PAYG rates lets you spike above committed capacity. So a sensible strategy is:

Commit to the baseline load (e.g., 50 slots) under a CUD
Allow autoscaling above that at PAYG rates for peak demand
Achieve a blended model: committed discount on base, flexibility on peak

This hybrid approach is where most enterprise customers land. And this is where negotiation leverage emerges. At enterprise scale with committed spend, Google becomes willing to discuss custom CUD rates, blended commitments, and favorable contract terms.

Slot Architecture and Reservation Management

Slots are virtual CPUs allocated for a fixed duration. BigQuery autoscaler monitors queue depth and query runtime, spinning up slots as needed. When there is no work, slots sit idle (but still incur costs if committed). Reservations are named pools of slots that you can assign to specific projects or folders within your Google Cloud organization.

Reservations and Assignments

A typical enterprise setup might include:

Production reservation: 100 committed slots for critical analytics and reporting
Development reservation: 30 committed slots for ETL testing and data engineering work
Ad-hoc reservation: 20 committed slots for business analytics and exploration
Auto-scaler: Remaining budget for peak bursts across all projects

Each reservation is then assigned to one or more projects via the slot assignment mechanism. This creates isolation and cost accounting. You can measure spend and utilization per business unit, per team, per application.

Standard Edition limits you to 10 reservations, while Enterprise allows 200. This sounds cosmetic, but for large organizations with many data teams, this is operationally critical. You cannot fine-grain cost allocation with only 10 reservations.

Idle Slot Sharing and Utilization

BigQuery's slot-sharing mechanism is underrated. If your Production reservation has allocated 100 slots but only 60 are in use at a given moment, the idle 40 slots can be used by other projects without committed capacity. This reduces waste and improves utilization rates. But this benefit only works within Standard and Enterprise editions that support slot sharing. Some advanced configurations disable sharing for compliance or performance guarantees.

Optimizing utilization requires discipline. Use the INFORMATION_SCHEMA.JOBS table to query historical slot usage. Build dashboards in Cloud Monitoring that show slot consumption by project, by user, by query type. You'll find patterns: 20% of queries consume 80% of capacity, certain times of day spike, certain projects are wasteful. With this visibility, you can rebalance commitments, tune queries, or right-size reservations.

Cost Optimization Levers

No matter which pricing model you choose, you can reduce costs through architectural and query-level optimization.

Query Optimization: SELECT and Partitioning

The single largest cost killer is SELECT *. Every column scanned incurs bytes-scanned cost. If your table has 100 columns and you only need 5, you are paying for 95 columns of waste.

For example: a 10 GB table with 100 columns. SELECT * costs $0.0000625. SELECT 5 specific columns costs $0.0000031. That 20x difference compounds across thousands of queries daily.

Partitioning and clustering tables by date, region, or customer ID allows BigQuery's query optimizer to prune entire partitions. A partitioned table scanned only for the past 7 days will scan 357x less data than an unpartitioned version scanned for the full year. For on-demand customers, this is a massive cost lever.

Authorized Views and Materialized Views

An authorized view restricts access to underlying tables, forcing queries to reference the view layer. This is useful for enforcing column-level security without duplicating data. Materialized views cache query results and automatically refresh on a schedule, avoiding repeated full-table scans from multiple downstream queries.

If your analytics team runs the same customer segmentation query 50 times daily, use a materialized view. The first query scans the base table fully. Subsequent queries run against the cached result set, scanning only incremental changes. This can reduce scan volume by 90%+ for stable, repetitive analysis.

Per-Query Byte Limits with SET Statements

Runaway queries happen. A misconfigured query or a logic error can scan your entire data warehouse. Use BigQuery's SET statement to impose per-query scan limits:

SET @@script.bytes_processed_limit = 10737418240; — 10 GB limit

If the query exceeds 10 GB of scans, BigQuery cancels it immediately. This prevents a single bad query from consuming months' worth of budget.

Storage Costs and Time-Travel

BigQuery separates compute (query) costs from storage costs. Storage is priced at $0.02 per GB for active tables, $0.01 per GB for long-term storage (table not modified for 90+ days). While not as flashy as compute pricing, storage economics matter at scale.

BigQuery also includes time-travel and fail-safe features. Time-travel lets you query snapshots of a table at any point in the past 7 days (or 30 for Enterprise). Fail-safe preserves deleted data for 7 days after deletion. Both features are free, but both consume storage. If your data is highly volatile (frequent deletes, updates, modifications), fail-safe and time-travel storage can silently inflate your bill.

Enterprise Negotiation: CUDs and PPAs

At enterprise scale—typical threshold is $500K+ annual Google Cloud spend, or specific strategic initiatives—negotiation becomes available through either CUDs or PPAs.

Committed Use Discounts (CUDs) for BigQuery

BigQuery CUDs were new in 2025. They follow the standard Google CUD mechanics: 1-year or 3-year commitments, volume discounts that increase with term length, and typically 15-40% discount against PAYG rates depending on term and volume. CUDs are self-service: you can purchase them through the Google Cloud console.

But negotiate the CUD rates themselves. For large commitments (e.g., 500+ slots across multiple editions), Google will sometimes improve the discount. This is where your GCP negotiation leverage framework matters.

Private Pricing Agreements (PPAs)

A Google Cloud PPA negotiation offers much deeper customization. In a PPA, you can negotiate:

Custom slot rates for Standard, Enterprise, and Enterprise Plus editions
BigQuery storage pricing
BigQuery ML and analytics extension pricing
Cross-product commitments (GCS + BigQuery + Compute) at blended rates
Custom SLAs and support tiers
Favorable true-up provisions if you exceed commitments

A Google Cloud CUD negotiation that ties BigQuery capacity into your broader platform commitment often yields 25-45% total savings versus standard list pricing.

Timing the Negotiation: Google Fiscal Year

Google's fiscal year ends September 30. Sales teams have aggressive quotas that compress in late August and early September. This creates a seasonal dynamic: if you initiate a PPA or large CUD discussion in March, you have 6 months to negotiate. If you wait until late August, Google is motivated to close quickly. This gives you leverage.

Additionally, renewal windows are gold. If you have a PPA that expires in September, initiating renewal discussions in July signals that you're a serious buyer evaluating alternatives. Google will often improve terms to retain your business.

Advanced Topic: BigQuery and Generative AI

BigQuery is increasingly positioned as the data backbone for enterprise Google Gemini enterprise licensing guide initiatives. Gemini in BigQuery (natural language to SQL generation, automated insights) runs queries on your data platform. These queries consume slots or on-demand TiBs like any other query. There is no separate AI pricing; the cost is buried in your query costs.

This matters because: (1) you may unknowingly trigger expensive generative queries if you enable Gemini features, and (2) BigQuery commitments become even more valuable because they effectively subsidize AI feature usage. If your organization is moving toward generative analytics, the ROI on capacity commitments increases.

Bringing It Together: When to Use Each Model

Choose on-demand if:

Your analytics workload is unpredictable (ad-hoc business requests, data exploration)
You scan fewer than 300 TiB monthly
You want zero commitment and operational simplicity
Your query patterns are bursty and difficult to forecast

Choose capacity reservations if:

You consistently scan more than 467 TiB monthly
You run stable, predictable ETL and reporting workloads
You need cost predictability and budget certainty
You can forecast 12-36 month demand with reasonable confidence

Choose a hybrid if:

You have baseline load (committed) + peak load (autoscale)
You want commitment discounts without over-committing
You are negotiating a PPA and can structure blended rates

For any organization with more than $1M annual GCP spend that uses BigQuery, a Google Cloud data platform advisory specialists engagement is typically ROI-positive. Reevaluating your slot allocation and CUD strategy quarterly can yield 15-25% savings.

Need help optimizing your BigQuery economics?

Our team can benchmark your current spend and model out CUD and PPA scenarios tailored to your workload.

Google BigQuery Pricing: On-Demand vs Capacity Reservations

Understanding BigQuery Pricing Models

On-Demand Query Pricing

Capacity Pricing and BigQuery Editions

Slot Commitment Discounts (CUDs)

The Break-Even Analysis: 467 TiB Per Month

Peak Demand vs. Baseline Usage

Slot Architecture and Reservation Management

Reservations and Assignments

Idle Slot Sharing and Utilization

Cost Optimization Levers

Query Optimization: SELECT and Partitioning

Authorized Views and Materialized Views

Per-Query Byte Limits with SET Statements

Storage Costs and Time-Travel

Enterprise Negotiation: CUDs and PPAs

Committed Use Discounts (CUDs) for BigQuery

Private Pricing Agreements (PPAs)

Timing the Negotiation: Google Fiscal Year

Advanced Topic: BigQuery and Generative AI

Bringing It Together: When to Use Each Model

Related Reading: Complete Your BigQuery Knowledge

Google BigQuery Pricing: On-Demand vs Capacity Reservations

Understanding BigQuery Pricing Models

On-Demand Query Pricing

Capacity Pricing and BigQuery Editions

Slot Commitment Discounts (CUDs)

The Break-Even Analysis: 467 TiB Per Month

Peak Demand vs. Baseline Usage

Slot Architecture and Reservation Management

Reservations and Assignments

Idle Slot Sharing and Utilization

Cost Optimization Levers

Query Optimization: SELECT and Partitioning

Authorized Views and Materialized Views

Per-Query Byte Limits with SET Statements

Storage Costs and Time-Travel

Enterprise Negotiation: CUDs and PPAs

Committed Use Discounts (CUDs) for BigQuery

Private Pricing Agreements (PPAs)

Timing the Negotiation: Google Fiscal Year

Advanced Topic: BigQuery and Generative AI

Bringing It Together: When to Use Each Model

Related Reading: Complete Your BigQuery Knowledge

Related Articles

Google Cloud PPA Negotiation

Google Cloud CUD Negotiation

Google Cloud Knowledge Hub