Does turbopuffer have a free tier?

No. turbopuffer has no free tier — its cheapest entry point is the Launch plan's $16/month usage minimum (cut from $64 in June 2026). You pay the greater of your metered usage or that minimum.

How is turbopuffer priced?

Purely on usage across three dimensions: writes per GB, queries per unit of data scanned (base $1/PB), and stored data per GB-month. A tier-scaled monthly minimum ($16 / $256 / ≥$4,096) acts as a floor on that usage.

What's the difference between the Launch, Scale, and Enterprise tiers?

The tiers set your monthly minimum and your compliance/support envelope, not your base usage rates. Launch and Scale charge identical usage rates; Scale adds HIPAA-ready BAA, SSO, and audit logs. Enterprise adds single-tenancy, BYOC, and SLAs but applies a 35% premium on usage.

How much did turbopuffer's query pricing change in 2026?

In February 2026 the base queried-data rate dropped from $5/PB to $1/PB, with up to 94% reductions for the largest namespaces via marginal discounts (80% for 32–128 GB, 96% above 128 GB per query). The minimum billable data per query rose from 256 MB to 1.28 GB.

Does turbopuffer charge for storage and writes too?

Yes. Beyond queries, you pay per GB written and per GB-month of stored data. As of mid-2026 the pricing-page cost calculator surfaces these base rates as per-unit ceilings — storage up to $0.33/GB and writes up to $2.00/GB — though the exact figures still live inside the calculator widget rather than as static published numbers.

What is namespace pinning and how is it billed?

Pinning keeps a namespace warm for latency-sensitive workloads. Introduced in April 2026, pinned namespaces bill in GB-hours (scaling with size, replica count, and time pinned) instead of per-query rates, with minimums of 64 GB and 10 minutes.

turbopuffer Pricing

AI Summary

turbopuffer is a serverless vector and full-text search database built directly on object storage, priced purely on metered usage rather than nodes or provisioned memory.
Customers pay for three usage dimensions — writes per GB, queries per unit of data scanned, and stored data per GB-month — and pay the greater of their usage or a tier-scaled monthly minimum.
The three plan tiers (Launch at $16/mo, Scale at $256/mo, Enterprise at $4,096/mo or more) set the monthly minimum and the compliance and support envelope, not the base usage rates.
turbopuffer has no free tier; the lowest entry point is the Launch tier's $16 monthly usage minimum, cut from $64 in June 2026.
In February 2026 turbopuffer cut its base queried-data rate from $5 per petabyte to $1 per petabyte, an up-to-94% reduction for the largest namespaces, documented in a public pricing changelog.
turbopuffer powers retrieval for Cursor and Notion and reached an estimated $100M annualized run-rate in March 2026 on minimal outside capital.

Pricing summary

turbopuffer 2026 — usage pricing with a tier-scaled monthly minimum

Pure usage: every plan meters writes, queries, and stored data; the tier you pick sets the monthly minimum spend and the compliance/support envelope.

Monthly minimum

Launch

$16 /mo

Teams getting to production on multi-tenant search

Monthly minimum

Scale

$256 /mo

Production workloads needing SSO, audit logs, BAA

+35% usage premium

Enterprise

≥$4,096 /mo

Billion-scale, single-tenant or BYOC deployments

Minimums are floors on metered usage, not platform fees — you pay the greater of your usage or the tier minimum. The Launch minimum was cut from $64 to $16 in June 2026. Captured from turbopuffer.com/pricing, 2026-07-14.

About

turbopuffer is a serverless vector and full-text search database built directly on object storage (e.g. S3) with SSD caching, designed to make large-scale retrieval an order of magnitude cheaper than memory-resident vector databases. It serves AI and search teams running retrieval-augmented generation, semantic search, and hybrid (vector + full-text) workloads, with a multi-tenant architecture purpose-built for namespaces-per-customer patterns at billion-document scale.

The company positions itself against in-memory vector databases (Pinecone, Qdrant, Weaviate) by trading a small amount of cold-start latency for a storage cost structure anchored to cheap object storage rather than RAM. Its benchmarks advertise p50 warm-query latency around 14ms at 10M documents, and it markets a path “to billion-scale” handled directly with the customer’s engineering team.

turbopuffer monetizes purely on usage — writes, queries, and stored data — rather than on nodes, pods, or provisioned memory. The plan tier a customer selects (Launch, Scale, Enterprise) changes the monthly minimum spend and the security/support envelope, but the underlying per-unit usage rates are the same on Launch and Scale; only Enterprise adds a 35% usage premium in exchange for single-tenancy, BYOC, and SLAs.

Pricing summary : serverless vector search billed per write, query, and GB-month

turbopuffer uses pure usage-based pricing: there are no seats and no platform fee — you pay for metered writes, queries, and stored data, subject to a tier-scaled monthly minimum (Launch $16/mo, Scale $256/mo, Enterprise ≥$4,096/mo). The three published tiers differ in their minimum spend and their compliance/support entitlements, not in their base usage rates (Launch and Scale share rates; Enterprise adds a 35% premium). The Launch minimum was cut from $64 to $16 in June 2026.

Storage — billed per GB-month of logical stored data. The pricing-page cost calculator surfaces the base rate as a per-unit ceiling of $0.33/GB.
Writes — billed per GB written, with a calculator-shown ceiling of $2.00/GB. copy_from_namespace copies bill at a 50% discount on write costs.
Queries — billed per unit of data queried. The base queried-data rate is $1/PB (reduced from $5/PB in February 2026), with marginal volume discounts: 80% off bytes between 32–128 GB queried and 96% off above 128 GB per query. Minimum billable data per query is 1.28 GB.
Namespaces — multi-tenant by design; the calculator includes an allotment (e.g. 10 namespaces) before per-namespace charges apply.
Pinning — pinned namespaces (introduced April 2026) bill in GB-hours instead of per-query, scaling with namespace size, replica count, and time pinned (minimums of 64 GB and 10 minutes).

What makes this different: the monthly tier price is a floor on usage, not an add-on fee — customers pay the greater of their metered usage or the tier minimum, so the model behaves like consumption usage-based billing with a commitment floor rather than a classic seat-plus-overage SaaS plan.

Pricing by product

turbopuffer ships a single product (the search database) sold across three tiers. Tiers gate compliance and support; the metered usage dimensions are common to all.

turbopuffer database (plan tiers)

Tier	Price	Included	Key mechanics
Launch	$16 / mo minimum	All database features; multi-tenancy; SOC 2 + GDPR-ready DPA; community Slack & email	Floor on usage; self-serve sign-up (cut from $64 June 2026)
Scale	$256 / mo minimum	Everything in Launch plus HIPAA-ready BAA, SSO, audit logs, private Slack channel, 8–5 support hours	Same usage rates as Launch; higher floor buys compliance
Enterprise	≥ $4,096 / mo	Single-tenancy + BYOC; CMEK per namespace; private networking; 24/7 support; support + 99.95% uptime SLA	35% usage premium; sales-led, contact required

Usage rates (metered across all tiers)

Dimension	Rate	Notes
Storage	up to $0.33 / GB-month	Per-unit ceiling shown in the pricing-page cost calculator
Writes	up to $2.00 / GB written	Calculator ceiling; `copy_from_namespace` copies at a 50% write discount
Queries	$1 / PB queried (base)	80% marginal discount 32–128 GB; 96% above 128 GB; min 1.28 GB billable per query
Namespaces	per namespace beyond included allotment	Allotment (e.g. 10) included in the calculator before charges
Pinning	GB-hours (size × replicas × time)	Minimums 64 GB and 10 minutes; replaces per-query billing while pinned

Sales motions across products: PLG / self-serve sign-up for Launch and Scale; sales-led, quoted for Enterprise (single-tenancy, BYOC, SLAs).

Hidden costs : the monthly minimum floor and the per-query 1.28 GB scan minimum

turbopuffer’s pure-usage model is honest, but two mechanics quietly shape the bill: the tier minimum (you pay it even when usage is lower) and the 1.28 GB minimum billable data per query (small namespaces still scan a minimum chunk). The June 2026 cut of the Launch floor from $64 to $16 shrinks the first mechanic sharply — far fewer accounts now sit under the floor — while the pricing-page calculator now surfaces the base storage and write ceilings ($0.33/GB, $2.00/GB) that used to be gated.

Early-production RAG app (low traffic, under the floor)

A small team runs a single-tenant RAG index of ~3M documents with light query traffic — only a few dollars of metered usage a month. They still pay the Launch minimum, but that minimum is now $16, not $64.

Line item	Monthly cost
Metered usage (writes + queries + storage)	~$7 (illustrative)
Launch tier minimum floor	applies the difference
Effective bill	$16 (the minimum)

The lesson: at very low volume turbopuffer still behaves like a small subscription, but the June 2026 cut quartered that floor from $64 to $16 — so the band of accounts that pay the minimum rather than metered usage is now far narrower. A workload spending ~$20/mo of usage, which used to sit under the old $64 floor and pay it, now clears the $16 floor and bills purely on consumption. The floor is still a floor; it just catches many fewer accounts.

High-QPS multi-tenant search (many small queries)

A consumer app issues a very high rate of queries against many small per-tenant namespaces. Each query bills a minimum of 1.28 GB scanned, so query cost is driven by query count, not data size.

Line item	Monthly cost
Storage (per GB-month)	up to $0.33/GB (calculator ceiling)
Writes (per GB)	up to $2.00/GB (calculator ceiling)
Queries — billed at min 1.28 GB each	dominant line at high QPS
Effective bill	driven by the per-query floor

The lesson: high-QPS workloads on tiny namespaces are billed on the 1.28 GB per-query minimum, not on actual bytes scanned — so the headline $1/PB rate understates cost for query-heavy, data-light patterns. Latency-sensitive teams can switch such namespaces to pinning (GB-hours), trading the per-query floor for a time-based rate. For the storage-versus-RAM economics that make turbopuffer cheaper at rest, see our note on storage-based billing.

Want to estimate your own turbopuffer bill? Use the turbopuffer pricing calculator to model your monthly cost based on document count, write rate, query rate, and namespace count.

Pricing evolution : a public changelog of query-rate cuts and new billing modes

turbopuffer maintains a dated public pricing changelog (last updated June 18, 2026), so its rate history is fully recoverable. The through-line is a series of price cuts — on queries, and now on the Launch entry minimum — the opposite of the typical SaaS price-creep — plus the introduction of new billing modes (copy_from_namespace, pinning) rather than new tiers.

Cadence

Quarter	Price changes	Product / SKU additions	Notes
2024 Q3	0	1	Sept 2024 — `copy_from_namespace` introduced; copies bill at a 50% discount on write costs.
2025 Q3	1	0	July 2025 — query pricing for large namespaces cut up to 80% (80% marginal discount on bytes over 32 GB/query).
2026 Q1	2	0	Feb 2026 — base query rate cut $5/PB → $1/PB (up to 94% off largest namespaces); Mar 2026 — multi-vector-column billing clarified.
2026 Q2	1	1	April 2026 — namespace pinning introduced, billed in GB-hours with 64 GB / 10-minute minimums; June 2026 — Launch plan minimum cut from $64/mo to $16/mo.

Tracked range: 2024 Q3–2026 Q2. Quarters not listed were verified stable (0 price changes, 0 SKU additions) against the public pricing changelog.

Notable changes

2024-09 — Introduced copy_from_namespace, copies billed at a 50% discount on write costs (turbopuffer.com/docs/pricing-log).
2025-07 — Query pricing for large namespaces cut up to 80%: 80% marginal discount on bytes queried over 32 GB per query (turbopuffer.com/docs/pricing-log).
2026-02 — Base queried-data rate cut from $5/PB to $1/PB; 80% marginal discount 32–128 GB, 96% above 128 GB; minimum billable data per query raised from 256 MB to 1.28 GB (turbopuffer.com/docs/pricing-log).
2026-03 — Multi-vector-column billing clarified: filterable attributes billed once per vector column for writes and storage; non-filterable attributes billed once total (turbopuffer.com/docs/pricing-log).
2026-04 — Namespace pinning introduced, billed in GB-hours (size × replicas × time) with minimums of 64 GB and 10 minutes (turbopuffer.com/docs/pricing-log).
2026-06 — Launch plan monthly minimum cut from $64/mo to $16/mo for all Launch customers, new and existing; reflected from the July 1st invoice (turbopuffer.com/docs/pricing-log).

The February 2026 query-rate cut in detail

The Feb 2026 change is the most consequential: the base scan rate fell 5× ($5/PB → $1/PB), and the marginal-discount curve (80% for 32–128 GB, 96% above 128 GB per query) means the biggest namespaces saw effective reductions up to 94%. This is corroborated by the same period in which turbopuffer was scaling to a reported $100M run-rate (Sacra, March 2026) on workloads like Cursor and Notion — i.e. the cut coincided with, rather than constrained, rapid revenue growth, consistent with object-storage unit economics improving as volume grows. The simultaneous increase of the per-query minimum from 256 MB to 1.28 GB partly offsets the cut for tiny, high-QPS namespaces (see Hidden costs).

What’s unique : object-storage economics, a usage-floor tier model, and price cuts over time

Object storage instead of RAM. turbopuffer’s core differentiator is architectural: vectors and indexes live on object storage (e.g. S3) with SSD caching, not in provisioned memory. This trades a small amount of cold-start latency for a cost structure anchored to cheap storage, and it’s why the company can keep cutting query prices rather than raising them. Pricing follows architecture — there are no pods, nodes, or provisioned-memory SKUs.

The tier price is a floor, not a fee. Launch ($16), Scale ($256), and Enterprise (≥$4,096) are monthly minimums on metered usage, not platform fees stacked on top of usage. Customers pay the greater of their usage or the minimum — a commitment floor wrapped around pure consumption, rather than the seat-plus-overage shape most SaaS uses.

Compliance gates the tier, rates stay flat. Launch and Scale charge identical usage rates; the higher Scale floor buys HIPAA-ready BAA, SSO, and audit logs — not cheaper or pricier units. Only Enterprise changes the unit economics, adding a 35% usage premium in exchange for single-tenancy, BYOC, CMEK, and a 99.95% uptime SLA. This cleanly separates “what compliance do you need” from “how much do you use.”

Prices go down, publicly and on the record. Across 2024–2026 every metered change in the changelog was a cut or a discount (50% copy discount, 80% then 94% query reductions, and a 75% cut to the Launch entry minimum in June 2026). turbopuffer publishes these in a dated pricing changelog — unusually transparent for an infrastructure vendor and a deliberate trust signal.

New billing modes, not new tiers. Rather than adding plans, turbopuffer adds meters: copy_from_namespace (50% write discount), and namespace pinning (GB-hours). The pricing surface area grows by exposing real cost levers to customers, not by fragmenting the plan grid.

Strengths & weaknesses

Strengths	Weaknesses
Pure usage with a transparent, dated pricing changelog — rare honesty for infra pricing	Base storage and write rates are exposed only inside the calculator, not as static published numbers
Object-storage economics let prices fall over time (query rate cut 5× in Feb 2026)	Still no free tier — though the Launch entry minimum was cut 75% to $16/mo in June 2026, it remains a hard floor vs. free tiers at Pinecone/Qdrant/Weaviate
Tier = compliance + minimum, with identical usage rates on Launch and Scale (no rate penalty)	The 1.28 GB per-query minimum can make high-QPS, data-light workloads cost more than the $1/PB rate implies
Built-in cost levers: `copy_from_namespace` 50% discount and GB-hour pinning	Enterprise 35% usage premium is opaque until you model real volumes through the calculator
Public cost calculator lets buyers self-estimate before talking to sales	Pinning, branching, and multi-vector-column billing add modeling complexity for finance teams

Billing UX : an interactive cost calculator and a public pricing changelog

Cost calculator — an interactive widget on the pricing page lets you set vector dimensions, attribute size, plan tier, and per-dimension volumes (storage GB, writes/WPM, queries/QPM, namespaces) to produce an estimated monthly cost (with the Launch $16/mo floor applied). The sliders surface per-unit ceilings — storage up to $0.33/GB, writes up to $2.00/GB, queries up to $1.00/PB.
Plan selector — Launch / Scale / Enterprise toggle inside the calculator recomputes the estimate, surfacing the Enterprise 35% usage premium.
Pricing Changelog — a public, dated docs page (/docs/pricing-log, last updated June 18, 2026) records every rate change over time, including the June 2026 Launch-minimum cut from $64/mo to $16/mo and the Feb 2026 query-rate cut from $5/PB to $1/PB.
copy_from_namespace — a first-class API operation billed at a 50% write discount, exposing migration/copy economics directly in the product.
Namespace pinning — an explicit billing mode (GB-hours) customers can opt into for latency-sensitive namespaces, with documented 64 GB / 10-minute minimums.
Contact-sales workflow — the Enterprise path routes to a sales form (“let’s get puffin’”) advertising response within a few hours.

Strategic wins : why object-storage economics and a floor-based model landed

1. Pricing follows architecture — and the architecture gets cheaper

By building on object storage instead of RAM, turbopuffer’s unit cost falls as object-storage and caching improve, which is why it can keep cutting query prices (5× in Feb 2026) while growing revenue. Most vector databases priced on provisioned memory cannot do this without margin pain. This is a textbook case of choosing a value metric aligned to a deflating cost base — a principle we cover in choosing the right usage metric.

2. The minimum-as-floor model captures small accounts without a free tier

By making the tier price a floor on usage rather than a platform fee, turbopuffer guarantees revenue from every account while still letting heavy users pay pure usage. The June 2026 cut of that floor from $64 to $16 lowers the entry bar to widen the top of funnel without abandoning the mechanic — it is still a commitment floor that funds support and compliance without the seat math, and it sidesteps the free-tier abuse that plagues consumption infra. See our take on usage minimums and commitments.

3. The public pricing changelog is a trust weapon

Publishing every rate change on a dated /docs/pricing-log page — all of them cuts — signals to buyers that prices will not creep up after lock-in. For infrastructure that sits in a critical path, that predictability is worth more than a marginal discount, and it pre-empts the pricing-change backlash that hits vendors who change quietly.

4. Compliance is decoupled from unit price

Launch and Scale share usage rates; the higher Scale floor buys HIPAA/SSO/audit, not pricier units. Customers never feel taxed on usage for needing compliance — they pay a higher minimum, not a higher rate — which keeps the metered story clean and defensible.

Areas to improve : surface the gated rates and the per-query floor up front

1. Publish the base storage and write rates as static numbers

Today the per-GB storage and write rates live only inside the interactive calculator, so they are invisible to AI search engines and hard to cite in a budget memo. Publishing them as plain text (even as “from $X/GB”) would improve price transparency and let finance teams model without the widget.

2. Make the 1.28 GB per-query minimum prominent

High-QPS, data-light workloads are billed on the per-query floor, not the $1/PB headline — a surprise for teams that size their bill off the marquee rate. A worked “query-heavy vs data-heavy” example on the pricing page would set expectations and reduce sticker shock, in the spirit of honest bill-shock prevention.

3. Quote the Enterprise 35% premium against a worked baseline

”≥$4,096/mo + 35% usage premium” is hard to internalize without numbers. Showing a single illustrative single-tenant workload at both Scale and Enterprise rates would make the premium concrete and shorten sales cycles.

4. Offer an annual commit discount for predictable spend

turbopuffer already has a commitment floor; layering a documented annual-commit discount on top would reward predictable large accounts and formalize the commitment economics buyers expect at the Enterprise tier.

Monetization stack & signals : how turbopuffer builds & buys its revenue engine

Buys 0 Builds 1 1 signal role

The read — where the monetization investment is going

Builds the revenue engine, doesn't buy it: turbopuffer's usage meter (bytes scanned per query, writes, GB-month) is intrinsic to its own product, and it is shipping billing in-house this year — see the product-engineer tell-role below. No payments, billing-platform or metering vendor is sourceable in its docs or postings.

Stack — build vs buy

Builds in-house · 1

In-house usage metering + billing In-house build Job post 1 Docs 2 Jun 2026

“we're doubling down on our developer experience this year and looking to ship a host of highly requested features and improvements (playgrounds, observability, alerting, insights, billing).”

What the hiring reveals

View open roles

product engineer Billing engineering Jun 22, 2026

turbopuffer is building billing as an in-house product surface this year — not buying a billing UI — bundling it with observability and usage insights. The meter behind its $1/PB usage pricing is its own; monetization is a product-engineering build, not a packaging tweak.

“ship a host of highly requested features and improvements (playgrounds, observability, alerting, insights, billing) ... help customers understand, monitor, and optimize their tpuf workloads.”

2 more matched roles — supporting evidence

outbound growth Growth Jun 22, 2026
field growth marketer Growth Jun 22, 2026

Signals reviewed Jun 2026 · derived from public job posts, product docs

Job postings fill and close over time — once a posting is filled we keep it as a dated citation (the quoted evidence remains); use View open roles for current listings.

Key takeaways

Align the value metric to a deflating cost base. turbopuffer prices on object-storage-backed usage, so it can cut prices as its costs fall — a structural advantage RAM-priced competitors lack.
A minimum-as-floor beats a platform fee for consumption infra. Charging the greater of usage or a tier minimum captures small accounts and funds support without a free tier or seat math.
Decouple compliance from unit price. Selling HIPAA/SSO/SLA as a higher minimum rather than a higher rate keeps the metered story clean and avoids penalizing regulated buyers on usage.
A public, dated pricing changelog is a moat, not a liability. When every logged change is a cut, transparency becomes a reason to trust the vendor through lock-in.
Watch the per-unit minimums. A 1.28 GB per-query floor means the headline $1/PB rate understates cost for query-heavy, data-light workloads — always model the minimums, not just the marquee rate.

UBP implications

Object storage reframes vector-DB economics. Pricing anchored to storage instead of memory lets a usage-based vendor lower prices over time, shifting the competitive axis from features to unit cost — a pattern other usage-based infra categories will follow.
The “floor, not fee” pattern is a clean substitute for free tiers. For high-cost consumption products, a tier-scaled monthly minimum monetizes small accounts and discourages abuse while preserving pure-usage economics for heavy users.
Per-request minimums are the hidden lever in metered pricing. turbopuffer’s 1.28 GB per-query floor shows how a single minimum can dominate the bill for a whole workload class — every usage-based vendor should model and disclose its smallest billable unit.

Sources

turbopuffer pricing page — Launch/Scale/Enterprise tiers, minimums, and cost calculator (accessed 2026-07-14)
turbopuffer pricing changelog — dated record of every rate change 2024–2026 (accessed 2026-07-14)
turbopuffer Enterprise / contact sales — single-tenancy, BYOC, SLA enterprise path (accessed 2026-07-14)
turbopuffer documentation — Pinning — GB-hour pinning billing mechanics (accessed 2026-06-04)
See more usage-based pricing breakdowns in the UsagePricing blueprint corpus.

Bottom line

turbopuffer is one of the cleanest examples of pricing that follows architecture: by building search on object storage rather than RAM, it can charge pure usage, keep cutting query prices, and still hit a $100M run-rate. The tier price is a floor — not a fee — and the public pricing changelog turns transparency into a trust advantage. The catch is in the minimums: no free tier, a $16/mo entry floor (cut from $64 in June 2026), and a 1.28 GB per-query scan minimum that can dominate query-heavy bills. Model the floors, not just the $1/PB headline — and compare it against the rest of the blueprint corpus.

Pricing timeline : Major events on a vertical axis

Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.

Current snapshot — usage pricing with tier-scaled minimums

Jul 2026

Three tiers (Launch $16/mo, Scale $256/mo, Enterprise ≥$4,096/mo at a 35% usage premium) act as monthly minimums on metered usage. Queried-data base rate is $1/PB, min 1.28 GB billable per query; writes and storage metered per GB with copy_from_namespace at a 50% write discount. The pricing-page cost calculator now surfaces the per-unit ceilings: storage up to $0.33/GB, writes up to $2.00/GB, queries up to $1.00/PB.

captured 2026-07-14

Launch plan minimum cut from $64/mo to $16/mo

Jun 2026

turbopuffer reduced the Launch plan monthly minimum from $64 to $16 (documented June 18, 2026). Launch customers with $16 or less of usage in a month pay $16; heavier usage bills on usage. Applies to all Launch customers, new and existing, reflected from the July 1st invoice. Source: turbopuffer.com/docs/pricing-log.

Namespace pinning billed in GB-hours

Apr 2026

Introduced namespace pinning, which bills pinned namespaces in GB-hours instead of per-query rates; cost scales with namespace size, replica count, and time pinned, with minimums of 64 GB and 10 minutes. Source: turbopuffer.com/docs/pricing-log (last updated April 3, 2026).

Multi-vector-column billing clarified

Mar 2026

Filterable attributes are billed once per vector column for writes and storage (reflecting per-column ANN index cost); non-filterable attributes are billed once regardless of vector-column count. Source: turbopuffer.com/docs/pricing-log.

Base query rate cut from $5/PB to $1/PB (up to 94% off largest namespaces)

Feb 2026

Base queried-data rate dropped from $5/PB to $1/PB; added an 80% marginal discount between 32–128 GB and 96% above 128 GB per query; minimum billable data per query rose from 256 MB to 1.28 GB. Source: turbopuffer.com/docs/pricing-log.

Query pricing for large namespaces cut by up to 80%

Jul 2025

Introduced an 80% marginal discount on bytes queried over 32 GB per query, reducing the cost of scanning large namespaces. Source: turbopuffer.com/docs/pricing-log.

copy_from_namespace introduced at a 50% write discount

Sep 2024

turbopuffer added the copy_from_namespace operation, allowing data to be copied between namespaces at a 50% discount on write costs — making bulk migration and branching cheaper than re-ingesting. Source: turbopuffer.com/docs/pricing-log.

Trivia

· turbopuffer's $1/PB base query rate is the price for scanning a full petabyte of data per query — the result of a February 2026 cut from $5/PB that lowered query costs for the largest namespaces by up to 94%.
· turbopuffer has no free tier at all; unlike most vector databases its cheapest entry point is a $16/month usage minimum on the Launch plan (reduced from $64 in June 2026).
· turbopuffer reached an estimated $100M annualized run-rate in March 2026 having raised less than $1M in total primary capital, per its CEO — an unusually capital-efficient infrastructure business.

Questions & answers

Does turbopuffer have a free tier?: No. turbopuffer has no free tier — its cheapest entry point is the Launch plan's $16/month usage minimum (cut from $64 in June 2026). You pay the greater of your metered usage or that minimum.
How is turbopuffer priced?: Purely on usage across three dimensions: writes per GB, queries per unit of data scanned (base $1/PB), and stored data per GB-month. A tier-scaled monthly minimum ($16 / $256 / ≥$4,096) acts as a floor on that usage.
What's the difference between the Launch, Scale, and Enterprise tiers?: The tiers set your monthly minimum and your compliance/support envelope, not your base usage rates. Launch and Scale charge identical usage rates; Scale adds HIPAA-ready BAA, SSO, and audit logs. Enterprise adds single-tenancy, BYOC, and SLAs but applies a 35% premium on usage.
How much did turbopuffer's query pricing change in 2026?: In February 2026 the base queried-data rate dropped from $5/PB to $1/PB, with up to 94% reductions for the largest namespaces via marginal discounts (80% for 32–128 GB, 96% above 128 GB per query). The minimum billable data per query rose from 256 MB to 1.28 GB.
Does turbopuffer charge for storage and writes too?: Yes. Beyond queries, you pay per GB written and per GB-month of stored data. As of mid-2026 the pricing-page cost calculator surfaces these base rates as per-unit ceilings — storage up to $0.33/GB and writes up to $2.00/GB — though the exact figures still live inside the calculator widget rather than as static published numbers.
What is namespace pinning and how is it billed?: Pinning keeps a namespace warm for latency-sensitive workloads. Introduced in April 2026, pinned namespaces bill in GB-hours (scaling with size, replica count, and time pinned) instead of per-query rates, with minimums of 64 GB and 10 minutes.