Per-GB Storage Pricing: Examples & Companies

What is it

Per-GB storage pricing is a billing unit where customers are charged per gigabyte of data stored on the platform per month.

Storage is the persistent-state dimension of cloud pricing. Where compute meters time and APIs meter requests, the storage meter captures data at rest — the bytes that survive between sessions and keep occupying disk whether or not anything is actively running against them. The unit is almost always expressed as a rate per gigabyte over a billing window, most commonly a GB-month, though marketplace vendors like Vast.ai bill it per GB-hour for finer-grained, continuous accounting.

The model shows up in two distinct places in the corpus. The first is cloud and GPU infrastructure, where storage is a secondary meter layered on top of per-second or per-hour compute: RunPod, Modal, E2B, CoreWeave, Nebius, and Lightning AI all bill persistent disk separately from the machines that use it. The second is data and retrieval platforms — vector databases and serverless search — where the stored index is the product. Pinecone, Weaviate, Qdrant, Chroma, Upstash, and turbopuffer all meter stored data per GB because the embedding corpus a customer holds is the asset they pay to keep online.

The defining characteristic of per-GB storage pricing is that the rate is low but compounds. A few cents per gigabyte feels negligible at signup, but storage never sleeps: unlike a paused container or an idle endpoint, retained data keeps accruing charges for as long as it exists. That asymmetry — cheap per unit, relentless over time — is what makes storage the line item buyers most often underestimate. And the range is wider than any other billing unit in the corpus: object-storage tiers like Nebius ($0.0147/GiB-month) sit more than three orders of magnitude below enterprise SaaS add-ons like Kustomer ($50/GB-month), so the same nominal unit can mean radically different economics depending on the product it wraps.

Same gigabyte, four storage classes — a ~3,000× price span

How it works

The core formula is simple: a vendor measures how much data a customer is keeping, multiplies it by a per-GB rate, and prorates it over the billing window. The variation lives in three dimensions — the time basis, the storage class, and the free allowance.

Dimension	What it controls	Example from the corpus
Time basis	Whether storage is metered per GB-month, GB-hour, or daily-average	RunPod, Modal, and Pinecone use GB-month; Vast.ai bills GB-hour continuously; Upstash meters Redis on a daily average; Qdrant meters disk hourly
Storage class	Different rates for different durability/performance tiers	CoreWeave splits AI Object Storage Hot/Warm/Cold/Archive ($0.06 / $0.03 / $0.015 / $0.0125 per GB-mo) from Distributed File ($0.070/GB-mo); RunPod splits Container Disk, Volume Disk, and Network Storage ($0.05–$0.20)
Free allowance	A baseline volume included before metering starts	Modal includes 1 TiB/month free; Upstash gives the first 1 GB of Redis free; E2B includes 1 GiB on Hobby and 5 GiB on Pro
Overage floor / minimum	A rate that kicks in only above an included quota	Weights & Biases includes 5 GB free then $0.03/GB overage; Comet includes 500 GB on Pro then $3 per 100 GB/month

The headline rates cluster into recognizable bands. Object-storage-backed tiers are the cheapest: Nebius Object Storage Standard is $0.0147/GiB-month and Zilliz Cloud cut managed storage to $0.04/GB-month in late 2025 (an 87% drop from $0.30). GPU block and network disk runs a notch higher — RunPod prices Network Storage at $0.05–$0.14/GB-month, Modal persistent volumes at $0.09/GiB-month, and CoreWeave object storage from $0.06/GB-month Hot. Database and vector storage sits higher still: Upstash charges a flat $0.25/GB, Pinecone is ~$0.33/GB-month on Standard, and Chroma is $0.33/GiB-month. At the top, enterprise SaaS folds storage into premium add-ons — Kustomer bills additional data storage at $50/GB/month.

Unit math: Monthly storage bill = (stored_GB − free_allowance_GB) × rate_per_GB_month. On Upstash Redis, keeping 50 GB costs (50 − 1) × $0.25 = $12.25/mo on top of per-command charges. The same 50 GB on Nebius object storage is roughly 50 × $0.0147 = $0.74/mo — the storage class matters far more than the volume.

The subtlety is when the meter runs. GB-month vendors snapshot or average your volume across the month. Vast.ai’s per-second model is stricter: storage accrues for every second an instance exists, including while it is stopped, so the only way to stop paying is to delete the instance. CoreWeave has the same property — object and file storage “accrues whether or not compute runs.” Upstash softens this with a daily-average basis, so a brief spike in stored data doesn’t set a high-water mark for the whole period. For buyers, the time basis matters as much as the headline rate — a “cheaper” GB-hour rate that bills paused resources can cost more than a higher GB-month rate that doesn’t.

Companies using this

Twenty-nine companies in the corpus meter storage per GB, spanning GPU and sandbox infrastructure (RunPod, Modal, Vast.ai, E2B, CoreWeave, Nebius, Lightning AI), vector and retrieval databases (Pinecone, Weaviate, Qdrant, Chroma, Milvus/Zilliz, LanceDB, Upstash, turbopuffer, Voyage AI), LLM observability and eval platforms (Helicone, Braintrust, Comet, Weights & Biases, LangChain, Vellum), and application platforms that fold storage into credits, compute units, or SaaS add-ons (Apify, Dify, Genspark, Vectara, GitLab, Krisp, Kustomer). The table below lists all of them with their pricing model and other billing units.

Patterns observed

Across the twenty-nine companies, per-GB storage almost never stands alone — it is the second, third, or fourth meter in a stack, and how it’s positioned reveals the vendor’s product shape.

Storage is a secondary meter at compute vendors. For RunPod, Modal, E2B, CoreWeave, Nebius, Lightning AI, and Vast.ai, the headline product is per-second or per-hour GPU/CPU time; storage is the line item that captures the state customers want to keep between runs. E2B’s pricing history is telling — its per-vCPU compute rates and $150/mo Pro fee stayed fixed for the entire tracked range, but RAM and storage were added as separate meters ($0.0000045/GiB/s) once customers needed persistent sandboxes. Modal went the opposite direction on generosity, launching persistent volumes with 1 TiB/month free precisely to absorb model-weight storage without pushing customers back to S3.
Storage is a primary, co-equal meter at retrieval platforms. At vector databases the stored index is the product, so storage sits alongside reads/writes/queries as a first-class usage axis. Pinecone bills read units, write units, and storage as three independently-scaling layers; Chroma meters four dimensions ($2.50/GiB written, $0.33/GiB-month stored, $0.0075/TiB queried, $0.09/GiB egress); and Weaviate charges per 1M vector dimensions stored plus $0.10–$0.12/GiB storage. turbopuffer goes furthest by making cheap object storage its entire architectural bet against in-memory rivals.
Vector-storage rates are deflating fast. The corpus captures a clear downward trend in the price of keeping an embedding index online. Milvus/Zilliz cut managed storage 87% from $0.30 to $0.04/GB-month in October 2025, and Weaviate’s per-1M-dimension rate fell roughly 95% from $0.095 (Jan 2024) to $0.00465 (2026) as object-storage architectures replaced RAM-resident indexes. Storage-per-GB is becoming a commodity line at the retrieval layer even as it stays sticky elsewhere.
Free allowances are small and deliberate at infra, generous at research. Upstash (1 GB Redis free), E2B (1 GiB Hobby, 5 GiB Pro), and Pinecone (2 GB Starter) include just enough to remove signup friction. The exception is research land-grabs: Weights & Biases hands academic users 200 GB free, and Qdrant offers a permanent free 4 GB-disk cluster with no credit card.
Storage tiers proliferate by durability. CoreWeave’s four-tier object storage (Hot/Warm/Cold/Archive) and RunPod’s three-way split (Container Disk, Volume Disk, Network Storage) show vendors charging different rates for ephemeral vs. persistent vs. archival durability — the same way hyperscalers separate instance store from block volumes and cold archive.

Counterexamples & variants

The clearest variant is the gated rate. turbopuffer meters storage per GB-month and writes per GB written, but the base per-GB rate appears only inside its interactive cost calculator — the static pricing page does not expose it. The corpus records this as “unknown” rather than guessing, which is itself instructive: a usage-based vendor can lean entirely on storage economics for its competitive story (object storage vs. RAM) while keeping the actual rate behind a calculator, so buyers must model their own corpus to learn the number. LanceDB is a softer version of the same pattern — storage is a real billed dimension, but the public pricing page is a contact form and the rate is free during public beta, so the number lives in sales quotes and a single AWS Marketplace listing.

The second variant is storage absorbed into a synthetic unit. Apify bills compute units at $0.13–$0.20 per CU with storage as one of four platform-usage components rolled into the CU; Dify, Genspark, and GitLab fold storage into message credits, credit tiers, or GitLab Credits respectively; and Vectara meters storage inside a credit unit and never publishes a per-GB overage rate. In these models storage consumption is real but is converted into the platform’s own currency rather than billed as raw GB. The customer still pays for data at rest, but the per-GB rate is invisible. This trades transparency for simplicity and is common where storage is a small fraction of total cost.

The sharpest counterexample to “storage is cheap and passive” runs in two directions. On the low end, Vast.ai and CoreWeave bill storage continuously — including on stopped instances or while no compute runs — so a customer who spins down a GPU to save on compute keeps paying for the attached disk indefinitely; deleting the resource is the only way to stop the meter. This inverts the usual mental model where pausing a resource pauses its cost, and it’s the most common surprise on the bill. On the high end, Kustomer’s $50/GB/month data-storage add-on shows that “storage is cheap” is a claim about infrastructure, not about SaaS: wrapped inside an enterprise support platform, the same gigabyte costs orders of magnitude more than raw object storage. The enterprise RAG case (Vectara) is the opposite failure again — storage is metered but its rate is never published, so buyers cannot self-estimate where a deployment lands and must negotiate the size band per deal.

What this means for buyers vs vendors

For buyers

Treat the headline per-GB rate as the least important storage variable. Ask three questions instead: What is the time basis (GB-month, GB-hour, or daily-average)? Does the meter keep running on stopped or idle resources — as it does on Vast.ai and CoreWeave? And what storage class am I actually buying — object storage at $0.015–$0.04/GB, GPU block disk at $0.05–$0.20, database storage at $0.25–$0.33, or a SaaS add-on at $50? The 3,000x spread in this corpus means the class you land in dominates the bill far more than your volume does. Storage compounds silently, so model it over the full retention period, not just month one, and use the guide to choosing the right usage metric to weigh whether storage or requests will dominate. Where rates are gated (turbopuffer, LanceDB) or bundled into credits (Vectara, Apify), insist on a worked estimate for your own data volume before committing.

For vendors

Per-GB storage works best as a secondary meter that captures persistent state without distorting your primary value metric — the pattern RunPod, Modal, CoreWeave, and E2B follow by keeping compute as the headline. Decide your time basis deliberately: daily-average (like Upstash) reduces bill-shock from spikes, while per-second continuous metering (like Vast.ai) maximizes capture but generates support tickets when customers don’t realize stopped instances still cost money. If you sell retrieval, expect the storage rate itself to keep deflating — Milvus/Zilliz’s 87% cut and Weaviate’s ~95% drop are the direction of travel, so plan to differentiate on architecture (object vs. RAM) rather than on the rate card alone. A small free allowance removes onboarding friction cheaply. For the mechanics of metering and prorating data at rest across a billing cycle, see the guides on usage-based pricing models and usage invoicing and billing cycles.

Company	Product	Pricing model	Billing units	Free tier	Verified
Apify	Apify Platform — web scraping and browser-automation cloud with an Actors marketplace	hybrid freemium	gb-hours credits bandwidth-gb	Yes	2026-06-03
Braintrust	LLM evaluation & observability platform	hybrid	tokens storage-gb scores	Yes	2026-07-14
Chroma	Open-source vector database + Chroma Cloud	pure-usage freemium	storage-gb bandwidth-gb api-calls	Yes	2026-06-09
Comet	AI/ML observability and experiment-tracking platform — Opik (LLM/agent observability) and Comet MLOps (experiment tracking)	freemium seat-based hybrid	seats gpu-hours storage-gb	Yes	2026-06-02
CoreWeave	GPU cloud & AI compute infrastructure	pure-usage commitment	gpu-hours cpu-hours storage-gb	No	2026-06-15
Dify	Dify Cloud + self-hosted LLM app development platform	subscription seat-based	credits seats documents	Yes	2026-07-14
E2B	Open-source cloud sandboxes for AI agents — secure, isolated micro-VMs that run LLM-generated code, coding agents, and computer-use workflows	freemium hybrid	cpu-hours gb-hours storage-gb	Yes	2026-06-02
Genspark	All-in-one AI agent workspace (Super Agent, AI Slides/Sheets/Docs, image/video/audio generation) on a credit-based model	freemium subscription seat-based	credits seats storage-gb	Yes	2026-06-02
GitLab	AI-native DevSecOps platform (source control, CI/CD, security, agents)	seat-based seat-plus-usage hybrid	seats credits cpu-hours	Yes	2026-06-21
Helicone	Open-source LLM observability & AI gateway	hybrid freemium	requests logs storage-gb	Yes	2026-06-09
Krisp	AI noise-cancellation, meeting transcription/notes, call-center voice AI, and a developer Voice AI SDK	seat-based	seats storage-gb media-minutes	Yes	2026-06-04
Kustomer	AI-first CRM and customer-service platform unifying omnichannel support, automation, and AI agents	hybrid seat-based outcome-based	seats resolutions media-minutes	No	2026-06-07
LanceDB	AI-native multimodal lakehouse	freemium pure-usage commitment	storage-gb vectors-indexed gpu-hours	Yes	2026-06-09
LangChain	Agent orchestration frameworks + LangSmith platform	hybrid seat-plus-usage freemium	seats traces workflow-executions	Yes	2026-06-10
Lightning AI	Cloud GPU/CPU Studio compute platform for building, training, and serving AI models, billed by the second with a credit pool.	hybrid freemium pure-usage	gpu-hours cpu-hours credits	Yes	2026-06-02
Milvus	Vector database (OSS) + Zilliz Cloud (managed)	pure-usage freemium commitment	gpu-hours storage-gb vectors-indexed	Yes	2026-06-09
Modal	Serverless compute and GPU platform — per-second billing for Python functions, batch jobs, and model serving	pure-usage freemium subscription	gpu-hours cpu-hours gb-hours	Yes	2026-07-14
Nebius	AI cloud & GPU compute infrastructure	pure-usage commitment	gpu-hours cpu-hours storage-gb	No	2026-06-15
Pinecone	Managed vector database (serverless)	pure-usage hybrid	requests storage-gb vectors-indexed	Yes	2026-06-09
Qdrant	Open-source vector database + Qdrant Cloud	pure-usage freemium	cpu-hours gb-hours storage-gb	Yes	2026-06-09
RunPod	GPU cloud marketplace — Secure Cloud and Community Cloud Pods, Serverless endpoints, and persistent storage	pure-usage hybrid commitment	gpu-hours storage-gb	No	2026-07-14
turbopuffer	Serverless vector and full-text search database on object storage	pure-usage commitment	storage-gb vectors-indexed gb-hours	No	2026-07-14
Upstash	Upstash (Redis, Vector, QStash, Search, Workflow)	pure-usage freemium hybrid	requests api-calls vectors-indexed	Yes	2026-07-14
Vast.ai	GPU rental marketplace — on-demand, interruptible (spot), and reserved cloud GPUs plus autoscaling serverless inference	pure-usage commitment	gpu-hours storage-gb bandwidth-gb	No	2026-07-14
Vectara	Enterprise RAG-as-a-Service and agent platform for trusted, grounded, auditable AI	commitment subscription	credits requests storage-gb	No	2026-06-02
Vellum	Personal AI assistant (ex LLM application development platform)	hybrid freemium	credits storage-gb	Yes	2026-06-10
Voyage AI	Embedding and reranker models (text, code, multimodal) for retrieval and RAG	pure-usage freemium	tokens storage-gb	Yes	2026-06-04
Weaviate	AI-native vector database (open-source core + Weaviate Cloud managed serverless, dedicated/Enterprise Cloud, BYOC)	pure-usage hybrid commitment	vectors-indexed tokens api-calls	Yes	2026-07-06
Weights & Biases	MLOps experiment tracking, W&B Weave LLM observability/evals, Models registry, and Serverless Inference	freemium hybrid seat-plus-usage	seats storage-gb traces	Yes	2026-07-14

Explore this theme in the knowledge graph

FAQ

What is per-GB storage pricing?

Per-GB storage pricing charges customers for the volume of data they keep on a platform, typically expressed as a rate per gigabyte per month (GB-month). It meters data at rest rather than compute time or requests.

How much does per-GB storage typically cost?

Rates vary enormously by storage class. Object-storage tiers run as low as $0.0147/GiB-month (Nebius) or $0.04/GB-month (Zilliz Cloud); GPU block and network disk sits around $0.05–$0.20/GB-month (RunPod, Modal, CoreWeave); database and vector storage runs $0.25–$0.33/GB-month (Upstash, Pinecone, Chroma); and enterprise SaaS add-ons can reach $50/GB-month (Kustomer).

Is storage billed on peak usage or average?

It depends on the vendor. Upstash meters Redis storage on a daily-average basis, while marketplace compute vendors like Vast.ai bill storage continuously per second the data exists — including while instances are stopped.

Why do compute vendors charge for storage separately?

Persistent data outlives the compute that created it. Vendors like RunPod, Modal, E2B, CoreWeave, and Nebius separate the storage meter so customers pay for state retained between sessions independently of per-second or per-hour compute.

Does per-GB storage pricing apply to vector databases?

Yes. Vector databases such as Pinecone (~$0.33/GB-mo), Chroma ($0.33/GiB-mo), Weaviate ($0.10–$0.12/GiB), Upstash Vector ($0.25/GB), and Zilliz Cloud ($0.04/GB-mo after its 2025 cut) all meter stored data per GB, since the embedding index is the persistent asset customers pay to keep online.

What hidden costs come with per-GB storage pricing?

Storage rates are small per unit but compound over time, and stopped or idle resources often keep accruing charges (Vast.ai, CoreWeave). Watch for storage that keeps metering on paused instances, and for rates bundled into abstract credits or compute units (Vectara, Apify) where the raw per-GB number is invisible.

Related billing units

Back to companies