Data Platform Pricing: Examples & Companies

21 companies in the corpus Updated full analysis
Definition

Data Platform Pricing is Pricing for data platforms — scraping, enrichment, search API, and knowledge-graph vendors.

Also known as: Data Collection PricingWeb Data Platform Pricing

What is it

Data Platform Pricing is pricing for data platforms — scraping, enrichment, search API, and knowledge-graph vendors.

These are the companies that sell the structured data AI systems need for grounding, enrichment, and training. The category spans four overlapping sub-types: web-extraction and proxy platforms (Bright Data, Oxylabs, ScraperAPI, Firecrawl, Apify, Browse AI); search and knowledge APIs (SerpApi, Linkup, Diffbot, Nomic); GTM enrichment and analytics (Clay, Julius AI, Powerdrill, Rows); and the data infrastructure underneath them — vector stores, metering layers, and compute (turbopuffer, Upstash, OpenMeter, Anyscale).

What unites the category is a structural fact: the cost driver is volume of data, not headcount. A single account can run millions of automated requests or store terabytes of vectors, so seat-based pricing has no relationship to the cost of service. Every company on this page therefore meters something — requests, bandwidth, records, credits, or storage — even when it wraps that meter in a monthly subscription tier.

The category is also consolidating. ScraperAPI’s parent acquired Traject Data; the proxy and scraping vendors are converging on near-identical per-GB and per-1,000-results rate cards. The differentiation is no longer the meter itself but the generosity of the included volume, the steepness of the overage curve, and how honestly the unit math is exposed on the public pricing page. For the strategy behind these choices, see our introduction to usage-based pricing and the data pipeline use-case breakdown.

How it works

Data platforms pick a meter that tracks their dominant cost, then wrap it in either pure pay-as-you-go or a subscription tier that pre-buys volume. The five common meters and their canonical examples:

Billing unitWhat it countsExample rateCompanies
Bandwidth (per GB)Data transferred through proxies/browsers$8 → $2.50/GB (residential)Bright Data, Oxylabs, Apify
Requests / results (per 1K)Successful API responses$0.50–$1.50 / 1K resultsOxylabs, Bright Data, SerpApi
Credits (per pool)Abstracted unit, ≈1 page or 1 call$0.001/credit (Diffbot)Firecrawl, ScraperAPI, Diffbot, Clay
Records (per 1K)Rows of ready-made data$2.50 / 1K recordsBright Data datasets
Storage + queryGB-month stored, data queried$1/PB queried (turbopuffer)turbopuffer, Upstash

Worked example — credit-pool scraping. Firecrawl’s Standard plan is $83/mo for 500,000 credits, where 1 credit ≈ 1 scraped page. A pipeline scraping 480,000 pages/month stays inside the pool and pays the flat $83. Cross to 520,000 pages and the 20,000 overflow auto-recharges at the plan’s per-credit rate — the model behaves like a subscription until you exceed the pool, then like pure usage.

Worked example — per-GB proxy. Oxylabs residential proxies start at $6/GB on the $30/mo Starter (5GB), declining to $2.50/GB on the 1TB Corporate tier. Moving 200GB of residential traffic costs roughly $500–$1,200 depending on the committed tier — the unit math is identical to Bright Data’s $8 → $2.50/GB ladder, which is why included-volume and overage steepness, not headline rate, decide the deal.

Worked example — pure usage with a floor. turbopuffer meters writes, queries (now $1/PB after a February 2026 cut from $5/PB), and per-GB-month storage, but every tier carries a monthly minimum — $64, $256, or ≥$4,096. A workload billing $40 of metered usage on the $64 tier still pays $64; the floor is the real entry price. See usage invoicing and billing cycles for how minimums and overage are reconciled on the invoice.

Companies using this

The corpus tracks 21 data platforms spanning web extraction, search APIs, enrichment, analytics, and the vector and metering infrastructure underneath. The table below lists each company with its pricing model, billing units, free-tier status, and last-verified date.

Patterns observed

The meter follows the cost driver, and changes per product. Bright Data is the clearest case: bandwidth-heavy residential proxies bill per-GB, static ISP/datacenter proxies bill per-IP, scraping and SERP APIs bill per 1,000 results, and ready-made datasets bill per 1,000 records — four billing units under one roof, each tracking the dominant cost of its product line. Oxylabs mirrors this exactly: per-GB and per-IP proxies, per-1K-results scraper APIs. The lesson is that “data platform pricing” is rarely a single model; it’s a portfolio of meters.

Credits are the abstraction of choice for scraping and enrichment. Firecrawl (1 credit ≈ 1 page), ScraperAPI (per-request multiplier on a credit allotment), Diffbot ($0.001/credit on prepaid pools), and Clay (Actions capacity plus a Data Credits pool) all use a credit layer. Credits let these vendors expose many features — crawl vs scrape, standard vs premium proxy, basic vs AI enrichment — at different internal rates without forcing the buyer to learn each one, and let the vendor change underlying economics without renegotiating.

Free volume is table stakes, but only on the legible meters. Diffbot gives 10,000 credits/month free, Firecrawl 1,000 credits, SerpApi a free monthly search allowance, and Linkup a recurring $20 balance (~4,000 searches). The free tier sits on the request/credit meter where marginal cost is low. Raw bandwidth and bulk datasets — where Bright Data charges from the first GB — rarely get a free allowance, because the cost of serving is real and immediate.

Subscription tiers are increasingly just pre-bought usage. ScraperAPI’s seven-tier ladder, Firecrawl’s six tiers, and Julius AI’s credit-pool subscriptions all present as flat monthly plans but are sized by an included usage allotment with overage beyond it. This hybrid framing — predictable floor, metered ceiling — is now the default for the category, blending the budgeting comfort of subscription with the cost-alignment of usage.

Counterexamples & variants

Storage-and-query infra prices nothing like a scraper. turbopuffer and Upstash are data platforms, but they store and serve data rather than extract it, so their meters are writes, queries, vectors indexed, and GB-month storage — not requests or records. turbopuffer’s tier-scaled monthly minimum (up to ≥$4,096) is a commitment mechanic that scraping APIs almost never use, because their buyers expect to start at zero. Treating these vector/cache platforms as interchangeable with scraping APIs would mislead on both the meter and the entry price.

The human-data marketplaces have no public meter at all. Mercor and micro1 sell data partnerships and human-labeling/RL-environment work to frontier AI labs, and both are sales-quoted with no visible price — Mercor discloses only the hourly expert pay, not the buyer take-rate. They sit in the data-platform category because the product is data, but the pricing motion is enterprise sales-led, the opposite of the self-serve per-request norm. They’re a reminder that “data platform” spans both a $0.005-per-request API and a multi-million-dollar custom data contract.

Open-core breaks the usage frame entirely. OpenMeter is the metering-and-billing platform other usage-based vendors run on, and it’s priced open-core: a free Apache-2.0 self-hosted edition plus a contact-sales managed cloud (now Kong Metering & Billing). There is no per-event public rate — the free tier is the entire software, and revenue comes from the managed and enterprise upsell. It’s a variant where the “data” being platformed is the customer’s own usage events, and the pricing logic inverts the rest of the category.

What this means for buyers vs vendors

For buyers

Forecast on your actual cost driver, not the headline tier. A scraping pipeline should model credits-per-page or successful-results-per-1K; a proxy workload should model GB-per-month; a vector store should model writes, queries, and stored GB plus the tier minimum. The headline rates across Bright Data and Oxylabs are near-identical, so the deal is decided by included volume and overage steepness — get both in writing. Watch for floors: turbopuffer’s monthly minimum means low-usage months still cost the tier price, and ScraperAPI’s concurrent-thread caps can throttle throughput regardless of credit balance. Use the free allowances — Diffbot’s 10,000 credits, Firecrawl’s 1,000, Linkup’s recurring $20 — to benchmark real per-job cost before committing to a tier.

For vendors

Pick the meter that tracks your dominant marginal cost, and don’t be afraid to run multiple meters if you sell multiple products — Bright Data’s four-unit portfolio proves buyers tolerate this when each meter is intuitive for its product. Use a credit abstraction when you need to expose many features at different internal rates or expect to change economics, as Firecrawl and Diffbot do — but keep the credit-to-output mapping legible (1 credit ≈ 1 page) or buyers will distrust it. Put a free allowance on your cheapest-to-serve meter to drive self-serve acquisition, and reserve commitment minimums for infra products like vector and cache stores where buyers expect them. If you’re building the metering itself, study OpenMeter’s open-core path; for the broader strategy, our usage-based pricing introduction and billing-cycle guide cover floor reconciliation and overage design.

Company Product Pricing modelBilling unitsFree tier Verified
AnyscaleManaged Ray platform for distributed AI training, inference, and batch processing (RayTurbo, Anyscale Compute Units)
pure-usagecommitmenthybrid
gpu-hourscpu-hourscredits
Yes2026-05-29
ApifyApify Platform — web scraping and browser-automation cloud with an Actors marketplace
hybridfreemium
gb-hourscreditsbandwidth-gb+2
Yes2026-06-03
Bright DataWeb data platform — proxy networks, scraping APIs, a managed scraping browser, SERP and unlocker APIs, ready-made datasets, and eCommerce insights
pure-usagehybridcommitment+1
bandwidth-gbrequestsrecords+1
Yes2026-06-04
Browse AINo-code web scraping and website-monitoring platform that turns any site into a structured dataset or API
freemiumhybridcommitment
creditsseats
Yes2026-06-04
ClayAI-powered GTM data-enrichment and outbound platform billed on Actions plus Data Credits
hybridfreemiumcommitment
creditsactions
Yes2026-06-02
CometAI/ML observability and experiment-tracking platform — Opik (LLM/agent observability) and Comet MLOps (experiment tracking)
freemiumseat-basedhybrid
seatsgpu-hoursstorage-gb
Yes2026-06-02
DiffbotWeb-extraction APIs (Extract, Crawl, Natural Language) plus a Knowledge Graph, metered on monthly credits
hybridfreemium
creditsapi-calls
Yes2026-06-04
FirecrawlWeb-scraping and data-extraction API for AI agents — scrape, crawl, map, search, and extract pages into clean markdown/JSON
subscriptionhybridfreemium
creditspages-renderedapi-calls+1
Yes2026-06-02
Julius AIJulius AI — AI data-analyst chat & notebooks
subscriptionfreemium
seatscredits
Yes2026-06-08
LinkupWeb search API for AI agents — Search, Fetch, and async Research endpoints with grounded, structured results
pure-usagefreemium
requestscreditsapi-calls
Yes2026-06-04
MercorAI talent marketplace + enterprise data partnerships for frontier AI labs
pure-usage
tasks
No2026-06-08
micro1Human-data engine, RL environments, and agent evaluation for frontier AI labs
pure-usage
tasks
No2026-06-08
NomicNomic Platform (AEC agentic workflows) + Atlas data-exploration app + Nomic Embed embedding/Developer API
hybridseat-basedcommitment+1
seatstokenscredits+2
Yes2026-06-04
OpenMeterOpen-source usage metering and billing platform for AI, agentic, and developer tools
freemium
eventsapi-calls
Yes2026-06-03
OxylabsWeb data collection: residential, datacenter, ISP & mobile proxies plus Web Scraper API and Web Unblocker
hybridpure-usagefreemium
bandwidth-gbipsrecords+1
Yes2026-06-04
PowerdrillAI-native data analytics platform that turns spreadsheets, PDFs, and databases into insights via specialized data agents
subscriptionfreemium
creditsseats
Yes2026-06-08
RowsRows AI spreadsheet
subscriptionhybrid
seatstasksapi-calls
Yes2026-06-08
ScraperAPIWeb scraping API that handles proxies, browsers, and CAPTCHAs behind a single endpoint
subscriptionpure-usage
creditsrequestsapi-calls
No2026-06-04
SerpApiReal-time search-results API (Google, Bing, and other engines)
subscriptionpure-usage
api-callsrequests
Yes2026-06-04
turbopufferServerless vector and full-text search database on object storage
pure-usagecommitment
storage-gbvectors-indexedgb-hours+1
No2026-06-04
UpstashUpstash (Redis, Vector, QStash, Search, Workflow)
pure-usagefreemiumhybrid
requestsapi-callsvectors-indexed+3
Yes2026-06-03

FAQ

What is data platform pricing?

Data platform pricing is the set of usage-based models used by scraping, enrichment, search-API, and knowledge-graph vendors. Companies typically meter per request, per GB of bandwidth, per record, or per credit, with tiered volume discounts and a free or trial allowance.

Why do data platforms charge per request or per GB instead of per seat?

The cost driver for a data platform is the volume of data extracted or delivered, not the number of users. A single account might run millions of automated requests, so per-request, per-GB, and per-record meters align the bill with the vendor's actual compute, proxy, and bandwidth cost.

Which billing unit is most common for web scraping APIs?

Per-request or per-credit billing dominates scraping APIs. ScraperAPI and Firecrawl sell monthly credit pools (1 credit ≈ 1 scraped page), while Bright Data and Oxylabs bill scraping endpoints per 1,000 successful results and proxies per GB or per IP.

Do data platforms offer free tiers?

Most do. Diffbot includes 10,000 credits per month free, Firecrawl gives 1,000 credits, SerpApi offers a free monthly search allowance, and Linkup refills a $20 balance every month — though pure proxy and high-volume datasets are usually paid from the first GB.

How do vector and analytics data platforms price differently from scrapers?

Vector and analytics platforms meter storage and compute rather than extraction volume. turbopuffer bills per write, per query, and per GB-month of storage with a tier-scaled monthly minimum, while Julius AI and Powerdrill use credit pools sized per subscription tier.

Trivia

  • Bright Data changes the meter per product: bandwidth-heavy residential proxies bill per-GB ($8 → $2.50), static ISP/datacenter proxies bill per-IP, scraping and SERP APIs bill per 1,000 results, and ready-made datasets bill per 1,000 records — one company, four different billing units.

  • turbopuffer cut its queried-data rate from $5/PB to $1/PB in February 2026, an 80% price drop on the query meter while keeping the storage and write meters intact.

  • Linkup refills a $20 prepaid balance every month for free — roughly 4,000 standard searches at $0.005 each — with no card required, making it one of the few pure-usage data APIs with a recurring free balance rather than a one-time trial.

See all pricing trivia

Related product categories

Back to companies