Committed-Use Pricing: Examples & Companies

What is it

Committed-Use Pricing is a pricing model where the customer commits to a minimum spend over a period (typically annual) in exchange for a discounted rate.

It is the enterprise counterpart to pure-usage pricing. Where pure usage lets a buyer start at zero and pay only for what they consume, committed-use asks the buyer to put a floor under their spend — a dollar minimum, a reserved machine, or a prepaid volume — and rewards that floor with a lower per-unit rate. The trade is symmetric: the vendor converts a volatile usage stream into predictable, bookable revenue, and the buyer converts an unpredictable bill into a known rate.

The discount is almost always the headline. CoreWeave lists 8-GPU HGX H100 nodes at $49.24/hr on-demand but quotes “up to 60% discounts over our On-Demand prices for committed usage” — and those reserved contracts, not the public card, carry most of its $5.13B in 2025 revenue. Vast.ai advertises up to 50% off its marketplace rate for reserved, pre-paid capacity; Nebius cuts up to 35% for multi-month and large-cluster commitments. The mechanism is identical across very different products: spend certainty buys rate.

Committed-use rarely stands alone. It is almost always the top rung of a ladder that starts with self-serve usage — a free trial or pay-as-you-go entry, then volume tiers, then an annual commit reserved for the enterprise buyer. That structure is so consistent among infrastructure vendors that it has become a documented cross-corpus pattern: the infrastructure-layer commitment-pricing trend tracks how GPU-compute and data vendors converged on offering identical “annual commit” enterprise tiers alongside on-demand rates.

Same H100 node, two ways to buy · CoreWeave

How it works

Committed-use pricing has three structural variants, distinguished by what the buyer actually commits to:

Variant	What the buyer commits to	Typical discount	Example
Dollar floor	A minimum monthly or annual spend	20–70%	LanceDB Enterprise ($60,000/yr AWS Marketplace), Nomic Platform ($1,000/mo floor)
Reserved capacity	A specific GPU or machine for a term	Up to 60%	CoreWeave reserved, Vast.ai (1/3/6-month), Lambda 1-Click Clusters (2wk–1yr)
Annual usage commit	A booked annual usage volume	Negotiated	Baseten, Fireworks AI, Anyscale

The unit math is what makes the model work for both sides. A commit is a floor, not a cap: the buyer pays at least the committed amount, draws down usage against it, and — at most vendors — pays standard on-demand rates on anything above the commitment. LanceDB’s AWS Marketplace listing is unusually explicit: a $60,000/year committed contract with $0.01 per LCU billed on overage above the committed quantity. Fireworks AI, Baseten, and Groq similarly bill over-commitment usage at list price, so the discount applies only inside the committed band.

Worked example. Suppose a team runs steady inference on CoreWeave H100 nodes, one 8-GPU HGX H100 node continuously for a year:

On-demand: $49.24/hr × 730 hr/mo × 12 = roughly $431,000/year, fully variable and cancellable any hour.
Reserved commit at up to 60% off: roughly $172,000/year at the deepest quoted discount — but the team is now contractually on the hook for that node whether or not it runs a single job.
If the team needs to burst above one node, the extra capacity bills at the on-demand rate, not the reserved rate — the commit is a floor, not a ceiling.

The break-even logic is the buyer’s whole decision: a commit only pays off if forecasted usage reliably clears the floor. That is why committed-use clusters in vendors whose customers have predictable, always-on workloads — GPU inference, proxy bandwidth, vector search — and is far rarer in spiky, experimental usage. Use the pricing calculator hub to model commit-versus-on-demand break-even, and see the prepaid-credits models guide for how prepaid commitments draw down against usage.

Companies using this

The companies below are every in-corpus entry whose pricing model includes a commitment component, verified against their public pricing pages. The cluster is heavily weighted toward infrastructure — GPU compute, web-data platforms, and vector databases — where always-on workloads make a spend floor a rational trade. A long tail of vertical and robotics companies appears because their enterprise contracts are inherently committed, even when no rate card is public.

Patterns observed

Commitment is the enterprise rung of a usage ladder, not a standalone model. None of these 57 companies sell committed-use as their only option. CoreWeave, Lambda, Nebius, RunPod, Together AI, Fireworks AI, Baseten, and Groq all lead with self-serve on-demand usage and reserve the “annual commit” or reserved-capacity tier for the enterprise buyer. The commit is the destination of the PLG-to-sales upgrade path, not its starting point — the natural top of the ladder that begins with pure-usage pricing.

GPU compute is the center of gravity. The largest concentration is GPU-hour vendors, where the underlying capital cost makes utilization guarantees valuable to both sides. Anyscale layers committed contracts on its $1-per-ACU rate card (from $0.0135/hr CPU up to $10.68/hr for an H200); RunPod and Together AI both drop committed cluster rates below their published on-demand token and GPU cards. Lambda is a telling variant — its 1-Click Clusters publish hard per-GPU-hour numbers ($5.54–$6.16/hr for committed H100 clusters) while on-demand H100 SXM sits at $3.99/hr, inverting the usual logic because scarce frontier capacity is a sellers’ market.

Discounts scale with commitment depth, and the deepest ones are gated. The published-ladder version is legible: Nebius prints both on-demand ($3.85 H100) and preemptible ($2.15 H100) rates against its commit line, and Snowflake Cortex sets credit price by edition (Standard $2, Enterprise $3, Business Critical $4 per credit) so the platform tier itself is the commitment. But most vendors quote the deepest tier privately — Baseten, Fireworks, and Replicate all say “annual commit, contact sales” and reveal the reserved rate only in a quote. The published-ladder approach is the exception, not the rule.

The commitment is increasingly disguised as a tier minimum or a seat-plus-commit hybrid. turbopuffer sets a monthly minimum spend per tier that functions as a soft commitment floor: pick a tier, and you’ve effectively committed to that minimum even on pure usage. Lorikeet does the same with annual credit pools billed monthly. Nomic fuses commitment into a seat model entirely — its Platform Business plan is $40/user/month on an annual commit, with a 25-seat minimum and a $1,000/month platform commitment, so the floor is baked into both the seat count and the platform fee. The floor is the commitment, dressed as packaging.

Counterexamples & variants

Twelve Labs: the gate-only variant. Twelve Labs offers committed-use contracts, but only as a fully gated Enterprise tier — there is no published prepaid bundle or committed-use discount visible on the page. The only commitment path is “contact sales,” which means a developer on the pay-as-you-go Developer plan has no on-page incentive to commit and no way to model the saving. This is the failure mode of commitment pricing: when the discount is invisible, the floor stops being a value exchange and becomes pure friction.

LanceDB and m3ter: the platform-fee floor. Not every commitment is a discount mechanism. LanceDB’s public /pricing page is a contact form, not a price table — its committed AWS Marketplace contract is a fixed annual quantity with a per-LCU overage rate, not a published discount off a list rate. m3ter’s “commitment” is a custom core platform fee bundling allowances for usage data ingested and bills calculated — a committed platform spend to access the product at all. Both are reminders that a commit can be a minimum-to-enter rather than a reward for scale.

Reserved-capacity commits carry lock-in risk the dollar-floor variant doesn’t. Vast.ai is explicit that reserved credits are locked to a single machine — pre-pay for a host and your capacity is tied to it, not portable across the marketplace. DeepInfra’s 3-year DeepCluster delivers among the corpus’s lowest GPU rates precisely because the buyer absorbs three years of obsolescence risk on a fast-moving hardware curve. Lambda shows the sharpest edge of this: its on-demand H100 SXM rate rose from $2.99 to $3.99 through 2025–2026 as capacity tightened, so a buyer who committed early locked in scarcity protection while a buyer who waited paid more. The deeper the discount and the longer the term, the more the buyer is betting on a hardware and demand curve that can move against them.

The mid-tier gap is where committed-use silently fails. The most common structural weakness is not a bad commit but a missing one. Fireworks AI’s own teardown flags that self-serve customers grow into the $10K–$50K/month band with no published mid-tier discount before the Enterprise commit — so a growing account sees on-demand rates all the way up until a sales-gated jump. That gap is exactly where accounts shop around, and it is a variant failure mode distinct from the gate-only problem: the ladder exists, but it skips a rung.

What this means for buyers vs vendors

For buyers

Commit only against the floor you are confident you will clear in your worst month, not your average month. The worked example above turns negative the moment forecasted usage dips below the committed capacity, because you still pay the floor whether the node runs or not. Favor dollar-floor commits (an annual quantity, a platform fee, a tier minimum) over reserved-capacity commits (machine-locked credits, a multi-year cluster) when your workload mix is uncertain — a dollar floor is fungible across products; a reserved machine is not.

Watch the lock-in window closely on GPUs. A multi-year reservation delivers the deepest rate but spans several hardware generations; the H100 you commit to today may be the previous generation before your term ends. Always extract the overage rate before signing: a pre-committed pool sold without a published overage rate, as Lorikeet does, leaves you unable to model the downside — insist on a stated rate for everything above the floor.

Use the visible rate cards as leverage. Where a vendor prints its on-demand, preemptible, and committed numbers, you can anchor a negotiation for the sales-quoted tiers against those published figures. Model the break-even before you talk to sales — the pricing calculator hub and the prepaid-credits models guide both help translate committed spend into an effective per-unit rate.

For vendors

Committed-use converts volatile usage into bookable revenue, which is why every infrastructure vendor in this corpus offers it — public-market investors reward a contracted backlog far more than a self-serve rate card. The commit is not a discount you give away; it is the mechanism that turns spiky demand into a predictable, financeable revenue stream.

Publish the ladder rather than gating it. The visible-discount approach outconverts the gate-only approach, because the buyer can see the saving and self-qualify: a growing account facing a printed commit line models the upgrade itself, while an invisible Enterprise gate forces every conversion through a sales call and loses the ones who won’t book a demo. Fill the missing middle so accounts don’t shop around exactly when they are most valuable, and state the overage rate up front. The usage-invoicing & billing-cycles guide covers how to reconcile commits, drawdowns, and overage cleanly on the invoice.

Company	Product	Pricing model	Billing units	Free tier	Verified
6sense	ABM and B2B revenue-intelligence platform — predictive account scoring, buyer intent data, and AI sales/marketing workflows	hybrid commitment	credits contacts seats	No	2026-07-14
Aleph Alpha	PhariaAI sovereign-AI platform, specialized models & professional services	commitment subscription	seats tokens credits	No	2026-06-11
Anyscale	Managed Ray platform for distributed AI training, inference, and batch processing (RayTurbo, Anyscale Compute Units)	pure-usage commitment hybrid	gpu-hours cpu-hours credits	Yes	2026-05-29
Apptronik	Apollo general-purpose humanoid robot (RaaS + outright sale)	pure-usage commitment	robot-hours units	No	2026-06-14
Artisan	Ava — an autonomous AI BDR/SDR that finds leads, enriches data, and runs outbound campaigns	hybrid commitment	contacts credits mailboxes	No	2026-07-14
Baseten	ML inference infrastructure — dedicated GPU deployments, Model APIs, and Truss framework	pure-usage hybrid commitment	gpu-hours tokens requests	Yes	2026-05-29
BentoML	BentoCloud — managed model-serving & inference platform	pure-usage freemium commitment	gpu-hours cpu-hours	Yes	2026-06-15
Bright Data	Web data platform — proxy networks, scraping APIs, a managed scraping browser, SERP and unlocker APIs, ready-made datasets, and eCommerce insights	pure-usage hybrid commitment	bandwidth-gb requests records	Yes	2026-07-14
Browse AI	No-code web scraping and website-monitoring platform that turns any site into a structured dataset or API	freemium hybrid commitment	credits seats	Yes	2026-06-04
Cerebras	Wafer-scale AI inference cloud and WSE hardware systems	pure-usage subscription commitment	tokens api-calls gpu-hours	Yes	2026-05-30
Clay	AI-powered GTM data-enrichment and outbound platform billed on Actions plus Data Credits	hybrid freemium commitment	credits actions	Yes	2026-07-06
CoreWeave	GPU cloud & AI compute infrastructure	pure-usage commitment	gpu-hours cpu-hours storage-gb	No	2026-06-15
Covariant	Covariant Brain — AI for autonomous warehouse robotic picking	commitment	units	No	2026-07-14
Cresta	AI coaching and intelligence for contact centers	seat-based subscription commitment	seats conversations	No	2026-06-11
Databricks (Mosaic AI)	Mosaic AI — enterprise GenAI & ML on the Data Intelligence Platform	pure-usage commitment	units tokens gpu-hours	Yes	2026-06-15
DeepInfra	Serverless inference cloud — per-token LLM/embedding APIs, per-image and per-minute media models, per-hour on-demand GPU containers, and reserved DeepCluster GPU clusters	pure-usage commitment	tokens gpu-hours requests	No	2026-07-14
Docket	AI Marketing Agent that converts B2B website visitors into qualified pipeline	subscription commitment	active-users	No	2026-06-21
Essential AI	Enterprise foundation models & data-workflow automation	commitment	units	No	2026-06-11
Exscientia (now part of Recursion)	AI-driven drug discovery & design platform	outcome-based commitment	milestones outcomes	No	2026-06-16
Figure	General-purpose humanoid robots (Figure 03) & Helix AI	commitment	units	No	2026-06-14
Finout	Finout — enterprise cloud + AI cost observability (FinOps) platform	subscription commitment	datapoints	No	2026-06-10
Fireworks AI	Generative AI inference platform — serverless per-token, on-demand GPU, fine-tuning, batch API	pure-usage hybrid commitment	tokens gpu-hours requests	Yes	2026-05-30
Gladia	Speech-to-text & audio intelligence API	pure-usage freemium commitment	media-minutes requests	Yes	2026-06-09
Groq	GroqCloud — LPU-based ultra-low-latency inference API for Llama, GPT-OSS, Qwen, Whisper transcription, and Orpheus text-to-speech	pure-usage hybrid commitment	tokens requests api-calls	Yes	2026-07-14
Hebbia	Matrix — agentic AI for institutional knowledge work and document analysis	seat-based subscription commitment	seats	No	2026-06-15
Hyperbolic	GPU cloud marketplace & serverless AI inference	pure-usage commitment	gpu-hours tokens images	Yes	2026-06-15
Ironclad AI	AI-powered contract lifecycle management (CLM)	subscription seat-based commitment	seats workflow-executions documents	No	2026-06-16
Lambda	GPU cloud & AI compute infrastructure	pure-usage commitment	gpu-hours	No	2026-06-09
LanceDB	AI-native multimodal lakehouse	freemium pure-usage commitment	storage-gb vectors-indexed gpu-hours	Yes	2026-06-09
Lorikeet	AI customer-support agent that resolves chat, email, SMS, and voice tickets	outcome-based commitment	resolutions credits	No	2026-06-07
m3ter	Usage-based billing and metering infrastructure for B2B SaaS	hybrid commitment	transactions events	No	2026-06-03
Maven AGI	Enterprise AI agent platform for customer support	outcome-based pure-usage commitment	resolutions conversations interactions	No	2026-06-11
Milvus	Vector database (OSS) + Zilliz Cloud (managed)	pure-usage freemium commitment	gpu-hours storage-gb vectors-indexed	Yes	2026-06-09
MultiOn	Autonomous web-browsing AI agent API (wound down)	pure-usage commitment	requests	No	2026-06-10
Nebius	AI cloud & GPU compute infrastructure	pure-usage commitment	gpu-hours cpu-hours storage-gb	No	2026-06-15
Nomic	Nomic Platform (AEC agentic workflows) + Atlas data-exploration app + Nomic Embed embedding/Developer API	hybrid seat-based commitment	seats tokens credits	Yes	2026-06-04
Physical Intelligence	Robotics foundation models (Vision-Language-Action policies for robots)	commitment	units	No	2026-06-14
PolyAI	Enterprise voice AI assistants for contact centers	hybrid commitment	media-minutes	No	2026-06-09
Poolside	AI coding foundation model	seat-plus-usage subscription commitment	seats tokens	No	2026-06-16
Recursion	AI-enabled drug discovery platform (Recursion OS) — pharma partnerships, internal pipeline & NVIDIA-powered compute	outcome-based commitment	milestones outcomes	No	2026-06-10
Replicate	Cloud platform for running, fine-tuning, and deploying AI models via REST API	pure-usage hybrid commitment	gpu-hours tokens requests	Yes	2026-05-30
RunPod	GPU cloud marketplace — Secure Cloud and Community Cloud Pods, Serverless endpoints, and persistent storage	pure-usage hybrid commitment	gpu-hours storage-gb	No	2026-07-14
SambaNova	SambaNova Cloud inference API & RDU AI systems	pure-usage subscription commitment	tokens	Yes	2026-06-15
Sanctuary AI	Phoenix general-purpose humanoid robot & Carbon AI control system	commitment	units	No	2026-06-14
Scale AI	Data engine, GenAI platform & contributor marketplace	pure-usage commitment	tasks records data-licensing	No	2026-06-15
Sequence	Sequence — quote-to-revenue platform (CPQ, billing, usage metering, AR & revenue recognition) for B2B finance teams	subscription commitment	invoices events seats	No	2026-06-10
Shield AI	Hivemind autonomy software, V-BAT & X-BAT autonomous aircraft	commitment	units	No	2026-06-14
Snorkel AI	Programmatic AI data development platform & expert data	subscription commitment	data-licensing records units	No	2026-06-15
Snowflake Cortex	AI functions and model APIs on Snowflake	pure-usage commitment	credits tokens pages-rendered	Yes	2026-07-06
Sourcegraph Cody	Enterprise code intelligence platform with AI Deep Search and pooled AI credits	hybrid commitment	seats credits	No	2026-06-09
SugarCRM	CRM platform (Sugar Sell, Serve, Market, Enterprise) with predictive + generative AI, now branded SugarAI	seat-based commitment	seats	No	2026-07-06
Tempus	Precision-medicine platform — genomic diagnostics, multimodal clinical data licensing & oncology AI apps (NASDAQ: TEM)	hybrid commitment	tests data-licensing	No	2026-06-10
Together AI	AI Acceleration Cloud — serverless inference, dedicated endpoints, GPU clusters, Code Sandbox, fine-tuning	pure-usage hybrid commitment	tokens gpu-hours cpu-hours	Yes	2026-07-14
turbopuffer	Serverless vector and full-text search database on object storage	pure-usage commitment	storage-gb vectors-indexed gb-hours	No	2026-07-14
Twelve Labs	Video understanding foundation models (Marengo for search/embeddings, Pegasus for analysis) delivered as a usage-metered API	pure-usage freemium commitment	media-minutes tokens requests	Yes	2026-06-02
Vast.ai	GPU rental marketplace — on-demand, interruptible (spot), and reserved cloud GPUs plus autoscaling serverless inference	pure-usage commitment	gpu-hours storage-gb bandwidth-gb	No	2026-07-14
Vectara	Enterprise RAG-as-a-Service and agent platform for trusted, grounded, auditable AI	commitment subscription	credits requests storage-gb	No	2026-06-02
Weaviate	AI-native vector database (open-source core + Weaviate Cloud managed serverless, dedicated/Enterprise Cloud, BYOC)	pure-usage hybrid commitment	vectors-indexed tokens api-calls	Yes	2026-07-06
Yellow.ai	Conversational CX automation platform	freemium outcome-based hybrid	resolutions conversations interactions	Yes	2026-06-11
Zenskar	Zenskar — AI-native order-to-cash platform (billing, metering, invoicing, revenue recognition)	subscription commitment	invoices events transactions	No	2026-06-10
ZoomInfo	GTM / sales-intelligence platform (contact + company data, intent, and the ZoomInfo Copilot AI GTM assistant)	seat-plus-usage hybrid commitment	seats credits contacts	No	2026-07-06

Explore this theme in the knowledge graph

FAQ

What is committed-use pricing?

Committed-use pricing is a model where the customer agrees to a minimum spend or volume over a fixed period — usually a year — in exchange for a discounted per-unit rate. The vendor gets revenue predictability; the buyer gets a lower price.

How much can you save with a committed-use discount?

In this corpus, committed discounts range from roughly 20% to nearly 70%. CoreWeave offers up to 60% off on-demand for reserved capacity, Vast.ai up to 50% off for reserved GPUs, Nebius up to 35% for multi-month commitments, and Together AI drops H100 clusters below its on-demand token and GPU rates.

What's the difference between committed-use and reserved-instance pricing?

Reserved instances are a subset of committed-use: you pre-pay or commit to a specific GPU or machine for a term (Vast.ai's 1/3/6-month reservations, Lambda's 2-week-to-1-year 1-Click Clusters, DeepInfra's 3-year DeepCluster). General committed-use can also be a dollar floor with no specific resource attached, like Nomic's $1,000/month platform commitment or LanceDB's $60,000/year Enterprise contract.

What happens if you exceed your commitment?

Most vendors bill overage above the commitment at standard on-demand rates. LanceDB's AWS Marketplace listing bills $0.01 per LCU above the committed annual quantity; Fireworks, Baseten, and Groq all state that over-commitment usage reverts to list pricing — so the commit is a floor, not a cap.

Which AI companies use committed-use pricing?

57 in-corpus companies offer it, heavily concentrated in GPU compute and data infrastructure: CoreWeave, Lambda, Nebius, RunPod, Together AI, Fireworks AI, Baseten, Cerebras, Groq, Replicate, DeepInfra, Anyscale, Vast.ai, plus vector databases (Weaviate, Milvus, LanceDB, turbopuffer) and vertical platforms like Nomic and Snowflake Cortex.

Related pricing models

Related guides & calculators

Usage Invoicing and Billing Cycles Explained

Guide

Back to companies