AI Summary
About
xAI is the foundation-model lab founded by Elon Musk in 2023 to build Grok, a family of frontier reasoning models. It monetizes two distinct surfaces: a public, per-million-token developer API (this page’s focus) for the Grok models and agentic tools, and a separate freemium consumer app (Free, SuperGrok, and up) that wraps the same models in a chat product. Enterprise — custom rate limits, dedicated infrastructure, SSO, and compliance — is sold through a sales motion.
xAI’s defining structural move was its March 2025 all-stock acquisition of X (formerly Twitter), which valued X at $33 billion ($45 billion less $12 billion of debt) and xAI at roughly $80 billion, forming a combined entity (X.AI Holdings Corp) worth about $113 billion. That merger folded X’s real-time social data, distribution, and audience into the lab — and is the reason live X-search shows up as a billable agentic tool on the API. xAI has since raised at escalating valuations (around $200 billion in a 2025 equity raise, with later rounds reported higher), funding an aggressive compute build-out anchored by Colossus, a Memphis supercluster xAI describes as the world’s largest, scaled to over 100,000 GPUs in under a year.
The model catalog has moved fast. The API opened in October 2024 with grok-beta; Grok 3 and Grok 3 mini followed in 2025, then Grok 4 (July 2025) and the cost-optimized Grok 4 Fast. By mid-2026 the published lineup consolidated on the grok-4.x generation — grok-4.3 as the flagship (1M-token context), grok-4.20 reasoning / non-reasoning / multi-agent variants, and grok-build-0.1 for coding — while older rungs were retired. Throughout, xAI has cut per-token prices steeply, positioning Grok as a price-aggressive frontier option versus closed-weight rivals like OpenAI and Anthropic and the open-weight Mistral AI.
Pricing summary : a public per-token API plus a freemium consumer app
xAI runs a two-surface model: pure usage-based pricing for the Grok developer API, billed per million tokens, and a separate freemium subscription ladder for the consumer Grok app. The dimensions are:
- API tokens — separate input and output rates per million tokens by model. grok-4.3 (flagship, 1M context) is $1.25 in / $2.50 out; grok-build-0.1 (coding, 256k context) is $1.00 in / $2.00 out. Cached input is billed at $0.20 / 1M tokens across the lineup — about 85% off the cache-miss rate.
- Agentic tools — billed per 1,000 calls outside the token meter: Web Search, X Search, and Code Execution at $5 / 1k calls each; Collections Search at $2.50 / 1k; File Attachments at $10 / 1k.
- Media & voice — images from $0.02 each, video from $0.05/second, text-to-speech at $15 per 1M characters, realtime voice at $0.05/minute, and speech-to-text from $0.10/hour.
- Consumer app seats — Free ($0), SuperGrok ($30/mo), plus SuperGrok Heavy, Business, and Enterprise — a flat-rate subscription surface distinct from the per-token API.
What makes this different: xAI publishes raw per-million-token billing and prices live X (Twitter) search as a metered agentic tool — turning a proprietary social-data feed, acquired in the X merger, into a per-call line item that closed-data rivals can’t replicate.
Pricing by product
Grok API — models (per million tokens, USD)
| Model | Input /M | Cached input /M | Output /M | Key mechanics |
|---|---|---|---|---|
| grok-4.3 | $1.25 | $0.20 | $2.50 | Flagship reasoning model; 1M-token context |
| grok-4.20 (reasoning) | $1.25 | $0.20 | $2.50 | Reasoning variant; 1M context |
| grok-4.20 (non-reasoning) | $1.25 | $0.20 | $2.50 | Lower-latency variant; 1M context |
| grok-4.20 (multi-agent) | $1.25 | $0.20 | $2.50 | Multi-agent orchestration; 1M context |
| grok-build-0.1 | $1.00 | $0.20 | $2.00 | Coding-focused; 256k context |
Cached input is billed at $0.20 / 1M tokens — roughly 85% off the cache-miss rate.
Grok API — agentic tools & media (USD)
| Service | Price | Key mechanics |
|---|---|---|
| Web Search | $5 / 1,000 calls | Live web retrieval as an agent tool |
| X Search | $5 / 1,000 calls | Live X (Twitter) search — xAI’s data moat |
| Code Execution | $5 / 1,000 calls | Sandboxed code-running tool |
| Collections Search | $2.50 / 1,000 calls | Retrieval over indexed collections |
| File Attachments | $10 / 1,000 calls | Document attachment processing |
| Image generation | $0.02–$0.05 / image | grok-imagine (quality tier higher) |
| Video generation | $0.050–$0.080 / second | grok-imagine video |
| Text to Speech | $15 / 1M characters | Voice synthesis |
| Realtime voice | $0.05 / minute | Live voice ($3/hr) |
| Speech to Text | $0.10–$0.20 / hour | REST vs streaming |
Consumer Grok app — subscription ladder
| Tier | Price | Included | Key mechanics |
|---|---|---|---|
| Free | $0 | Real-time web + X search, voice mode, connectors, SOC 2 | Entry point for the consumer app |
| SuperGrok | $30 / mo | Grok 4 model, higher rate limits, Expert mode, image + video generation | ”Popular” consumer tier |
| Business | Contact us | Team seat management, consolidated billing, RBAC, user analytics | Team admin controls |
| Enterprise | Contact us | Custom rate limits, SSO & SCIM, data residency, dedicated data plane, CMEK | Sales-led, quoted |
Sales motions across products: PLG / self-serve for the pay-as-you-go API and the Free + SuperGrok consumer tiers; sales-led for Business, Enterprise, and custom volume API pricing (dedicated infrastructure, SSO, compliance, data residency).
Hidden costs : What xAI users actually pay
xAI’s headline token rates are unusually low, but the real API bill is shaped by three things the per-model row doesn’t show: the output-token premium, the separately-billed agentic tools, and the cache-hit ratio that decides whether you pay $1.25 or $0.20 for input. Two archetypes show how the total assembles.
Archetype 1 — a developer running a live-research agent on the API. Answering questions with grok-4.3 (assume ~40M input + ~12M output tokens/month), plus 30,000 X-search calls and 10,000 code-execution calls a month, with roughly half the input served from cache.
| Line item | Monthly cost |
|---|---|
| grok-4.3 input — 20M tok @ $1.25/M (cache-miss) | $25.00 |
| grok-4.3 cached input — 20M tok @ $0.20/M | $4.00 |
| grok-4.3 output — 12M tok @ $2.50/M | $30.00 |
| X Search — 30,000 calls @ $5 / 1,000 | $150.00 |
| Code Execution — 10,000 calls @ $5 / 1,000 | $50.00 |
| Estimated total | ~$259/mo |
The lesson: on grok-4.3 the agentic tools dominate the bill — $200 of live-search and code-execution calls dwarfs the ~$59 of token cost. The token rates are cheap by frontier standards; the variable cost has shifted to the per-call tools. A high cache-hit ratio (here halving input to $0.20/M) further shrinks the token line, so prompt-caching discipline matters more than model choice for repeat-context workloads.
Archetype 2 — a 10-person team on SuperGrok. Ten seats at $30/user/mo on the consumer Grok app, using it for research and drafting rather than building on the API.
| Line item | Monthly cost |
|---|---|
| SuperGrok — 10 seats @ $30 | $300.00 |
| Business plan upgrade (admin/RBAC, est.) | quote |
| Estimated total | ~$300/mo + admin |
Here the surprise is the surface split: a team that grows past individual SuperGrok seats is steered toward the Business tier (team seat management, consolidated billing, RBAC) — which is contact-us rather than a published per-seat price, so the jump from $30 self-serve to a quoted team plan is the hidden step.
Want to estimate your own xAI bill? Use the xAI pricing calculator to model your costs based on token volume, agentic-tool calls, and seat count.
Pricing evolution : xAI pricing history and changes
xAI’s API has billed per million tokens since the grok-beta public beta opened in October 2024 — and the per-token price has fallen steeply as the model lineup advanced. The flagship rate dropped from $5 / $15 (grok-beta) to $3 / $15 (Grok 4) to $1.25 / $2.50 (grok-4.3), even as context windows grew from 128k to 1M tokens. The dated milestones below are reconstructed from primary announcements and contemporaneous press; per-snapshot reconstruction will be tightened with archived captures on a later pass.
Cadence
| Quarter | Price changes | Product / SKU additions | Notes |
|---|---|---|---|
| 2024 Q4 | 1 | 1 | 2024-10 Grok API public beta opens; grok-beta at $5 / $15 per 1M tokens, $25/mo free credits |
| 2025 Q1 | 0 | 1 | 2025-03 xAI acquires X in an all-stock deal; live X-search becomes a data asset |
| 2025 Q2 | 1 | 1 | 2025-06 Grok 3 ($3 / $15) and Grok 3 mini ($0.30 / $0.50) reach the API |
| 2025 Q3 | 1 | 2 | 2025-07 Grok 4 ($3 / $15, 256k context); Grok 4 Fast added ($0.20 / $0.50, up to 2M context) |
| 2026 Q2 | 1 | 1 | 2026-05 lineup consolidates on grok-4.x; flagship grok-4.3 at $1.25 / $2.50, cached $0.20; legacy models retired |
Tracked range: 2024 Q4–2026 Q2. Quarters not listed had no publicly announced price or SKU change. Dated milestones below cite primary/secondary sources; per-snapshot price reconstruction is a later pass.
Notable changes
- 2024-10 — Grok API public beta opens with grok-beta at $5 / $15 per 1M tokens and $25/month in free credits (TechCrunch, InfoQ).
- 2025-03 — xAI acquires X in an all-stock deal valuing X at $33B and xAI at ~$80B; the merger underpins live X-search as a billable tool (CNBC).
- 2025-06 — Grok 3 ($3 / $15) and Grok 3 mini ($0.30 / $0.50) reach the API, adding the first low-cost rung.
- 2025-07 — Grok 4 launches at $3 / $15 with a 256k context window; Grok 4 Fast follows at $0.20 / $0.50 with context up to 2M tokens.
- 2026-05 — The lineup consolidates on grok-4.x; flagship grok-4.3 lands at $1.25 / $2.50 with cached input standardized at $0.20/M, and legacy models (Grok 3, Grok 4 Fast) are retired.
The price-down march in detail
xAI’s pricing story is a sustained per-token markdown paired with capability gains. The grok-beta launch price of $5 / $15 was squarely in the frontier band of late 2024; by mid-2026 the flagship grok-4.3 charged $1.25 / $2.50 — a roughly 75% cut on input and 83% on output — while the context window grew nearly 8x (128k to 1M). Rather than hold a premium price and harvest margin, xAI used compute scale (Colossus) and the X-data advantage to compete on price, betting that cheap frontier inference plus a proprietary live-search tool wins developer share faster than a high sticker rate. The variable cost it does charge aggressively for is the agentic tools — $5 / 1k for live search and code execution — signaling where xAI thinks the durable value (and willingness to pay) actually sits.
What’s unique : xAI’s distinctive pricing mechanics
1. Live X-search as a metered tool. The March 2025 X acquisition turned a proprietary social-data feed into a billable agentic tool — X Search at $5 per 1,000 calls, priced identically to web search but backed by data no closed competitor can match. xAI prices the data access per action, not per token, making real-time social context an explicit line item rather than a free model feature.
2. Aggressive token markdowns on a frontier model. Where most labs hold flagship pricing steady and add cheaper minis, xAI cut the flagship itself — $5 to $1.25 on input across 18 months — while expanding context to 1M tokens. The strategy treats cheap frontier inference as a share-grab funded by compute scale, the inverse of premium-priced token economics at rivals.
3. Two surfaces, deliberately separate. Unlike peers that route consumer overage through the API meter, xAI keeps the per-token developer API and the flat-rate consumer Grok app ($0 Free, $30 SuperGrok) as distinct pricing systems. Developers get pure usage billing; consumers get predictable subscriptions — each optimized for its buyer rather than forced through one meter.
4. Cache-first input pricing. Cached input at $0.20/M (about 85% off the $1.25 cache-miss rate) is a structural discount that rewards repeat-context, agentic workloads — exactly the prompt-heavy patterns Grok’s tools encourage. The cache rate, not the headline rate, is the real price for production usage-based agents.
Strengths & weaknesses
| Strengths | Weaknesses |
|---|---|
| Fully public per-million-token API rates — no “contact sales” wall for inference | Agentic tools ($5 / 1k for search and code exec) can dwarf token cost on tool-heavy agents, making totals harder to predict |
| Flagship grok-4.3 at $1.25 / $2.50 undercuts most frontier rivals while offering a 1M-token context | Frequent model churn (grok-beta, 3, 4, 4 Fast, 4.x) and confusing version names make historical price tracking hard |
| Live X-search is a genuine data moat sold as a metered tool | Consumer Business tier is contact-us — no published per-seat price above $30 SuperGrok |
| Cached input at $0.20/M rewards repeat-context agentic workloads | Enterprise (custom rate limits, dedicated infra, compliance) is fully sales-gated with no public floor |
| Clean separation of developer API and consumer app keeps each surface predictable | Rapid retirement of models (Grok 3, Grok 4 Fast in May 2026) can strand integrations on deprecated SKUs |
| Steep, sustained price cuts signal a credible cost-leadership position | API surface is bot-protected / JS-rendered, so transparent as numbers but harder to archive |
Billing UX : usage tracking and overage controls
- API Console usage dashboard — developers monitor token consumption, tool calls, and spend per model through the xAI Console, with prepaid credits and pay-as-you-go billing.
- Free credits — xAI seeded the 2024 public beta with $25/month in free API credits and continues to offer promotional credits to onboard developers.
- Cached-input metering — input served from cache is billed at the lower $0.20/M rate automatically, so prompt-caching discipline directly lowers the bill without a separate plan.
- Per-call tool metering — agentic tools (web search, X search, code execution, collections, file attachments) are tracked and billed per 1,000 calls, separate from the token meter.
- Consumer seat management (Business) — team seat management, consolidated billing, and role-based access control sit on the Business tier of the Grok app.
- Enterprise controls — custom rate limits, SSO & SCIM, advanced audit controls, custom data retention, customer-managed encryption keys, and a dedicated data plane are available on Enterprise.
- No-training & compliance — SOC 2 (Type I & II) compliance is available even on Free; Enterprise adds no-training guarantees and custom retention.
Strategic wins : Why xAI’s pricing decisions worked
1. Monetizing the X data moat as a tool
By pricing live X-search as a $5-per-1k-call agentic tool, xAI turned the most distinctive asset from its X acquisition into a metered, recurring revenue line rather than a free model feature. The per-action price makes real-time social context a value metric competitors can’t copy, anchoring xAI’s differentiation in data access rather than raw model quality. See how outcome-shaped pricing is moving the meter from tokens toward actions.
2. Cost leadership funded by compute scale
Cutting the flagship from $5 to $1.25 on input while growing context to 1M tokens is a deliberate share-grab. By using Colossus-scale compute to drive marginal inference cost down, xAI competes on price where rivals hold premiums — a bet that cheap frontier inference accelerates developer adoption. This mirrors the broader shift away from premium per-token pricing as inference commoditizes.
3. Separating the developer and consumer meters
Keeping the per-token API and the flat-rate consumer app distinct lets each be priced for its buyer: developers get transparent usage billing, consumers get a predictable $30 subscription. Avoiding a forced single-meter design means neither surface compromises — a discipline in choosing the right usage metric per audience that multi-product vendors often miss.
Areas to improve : Gaps in xAI’s pricing approach
1. Surface agentic-tool cost in the headline
Live search and code execution at $5 / 1k calls routinely dominate the bill on agentic workloads, yet they sit below the token table. As Grok becomes a tool-using agent platform, a combined “estimated cost per agent run” view in the Console would prevent the bill-shock and unpredictability that hits when tool calls outrun token spend.
2. Publish a Business floor price
The jump from self-serve SuperGrok ($30) to the contact-us Business tier hides the per-seat team price, forcing a sales call for a mid-market team that just needs RBAC and consolidated billing. A published Business starting price would shorten evaluation, as some peers now do — compare how other AI companies stage enterprise transparency.
3. Stabilize the model lineup and version naming
Rapid model churn and overlapping names (grok-4.3 vs grok-4.20 vs grok-build-0.1) plus the May 2026 retirement of Grok 3 and Grok 4 Fast make it hard to plan around a price. Clearer deprecation timelines and a stable naming scheme would reduce integration risk and make historical token-pricing comparisons legible.
Key takeaways
- Price the data moat, not just the model. xAI sells live X-search as a $5-per-1k-call tool, turning a proprietary feed from the X merger into a metered value metric rivals can’t replicate. The durable differentiation is data access, priced per action.
- Cheap frontier inference is a share strategy. Cutting the flagship from $5 to $1.25 on input while growing context 8x signals cost leadership funded by compute scale — adoption over margin.
- Two surfaces beat one forced meter. Keeping the per-token developer API and the flat-rate consumer app separate lets each be priced for its buyer, instead of laundering consumer overage through an API meter.
- The cache rate is the real price. Cached input at $0.20/M (vs $1.25 cache-miss) means prompt-caching discipline, not model choice, decides production cost for repeat-context agents.
- Tools, not tokens, are where the bill lives. On agentic workloads the $5 / 1k tool calls dwarf token spend — an early signal that the meter is shifting from inference toward actions.
UBP implications
- Proprietary data becomes a per-action meter. xAI shows that a distinctive data asset (live X content) can be priced as a per-call agentic tool rather than bundled free into the model. UBP practitioners with unique data should consider metering access to it as its own unit.
- As inference commoditizes, the value migrates to tools. When flagship token rates fall 75% in 18 months, the variable cost — and the willingness to pay — shifts to the agentic tools layered on top. UBP design should follow the value to the action, not the token.
- Match the meter to the buyer, not the product. Running a pure per-token API for developers and a flat subscription for consumers — two meters, deliberately — lets each audience get the pricing model it actually wants, a reminder that one universal unit isn’t always the right answer.
Sources
- xAI API overview (accessed 2026-06-11)
- xAI documentation — models (accessed 2026-06-11)
- xAI documentation — pricing (accessed 2026-06-11)
- xAI pricing page (consumer plans) (accessed 2026-06-11)
- xAI news — API public beta (accessed 2026-06-11)
- TechCrunch — xAI launches an API (accessed 2026-06-11)
- CNBC — Musk says xAI acquired X at $33B (accessed 2026-06-11)
- Browse the pricing blueprint corpus
Bottom line
xAI prices Grok on two deliberately separate surfaces: a fully public per-million-token developer API (grok-4.3 at $1.25 in / $2.50 out, cached $0.20) with agentic tools metered at $5 per 1k calls, and a freemium consumer app ($0 Free, $30 SuperGrok). The defining moves are a sustained 75% per-token markdown that bets cheap frontier inference wins developer share, and pricing live X-search — the asset from its $33B X acquisition — as a metered tool no closed rival can match. The friction is fast model churn and sales-gated Business/Enterprise tiers; the strength is transparent, aggressively cheap token rates paired with a genuine data moat.
Want to compare xAI against other foundation-model providers? See OpenAI and Mistral AI, or browse the full pricing blueprint.
Pricing timeline : Major events on a vertical axis
Each milestone below corresponds to a public pricing change, product launch, or material adjustment. Major events use a filled marker; minor adjustments use a faded one.
Live snapshot: grok-4.3 at $1.25 / $2.50, tools at $5 / 1k
Captured current USD pricing: grok-4.3 $1.25 input / $0.20 cached / $2.50 output per 1M tokens (1M context); grok-build-0.1 $1.00 / $2.00 (256k); agentic Web/X Search and Code Execution at $5 / 1k calls; consumer Free $0 and SuperGrok $30/mo on the Grok app. (Per-token API rates from docs.x.ai; consumer card from live capture.)
Lineup consolidates; legacy models retired
xAI retires Grok 3, Grok 4 Fast, and several variants, consolidating the published API on the grok-4.x generation (grok-4.3 plus grok-4.20 reasoning / non-reasoning / multi-agent and grok-build-0.1). Cached-input pricing is standardized at $0.20/M across the lineup. (Source: docs.x.ai, third-party trackers, 2026-05.)
Grok 4 Fast introduced (then later retired)
xAI ships Grok 4 Fast at $0.20 / $0.50 per 1M tokens with context up to 2M — a deliberate cost-optimized rung that prices well below the flagship. (Grok 4 Fast and several Grok 3 variants were later retired in May 2026 as the lineup consolidated on the grok-4.x generation.)
Grok 4 launches at $3 / $15 per 1M tokens
xAI releases Grok 4 (released 2025-07-09) with a 256k-token context window, priced at $3 per 1M input and $15 per 1M output tokens — matching the frontier-flagship pricing band of the era. Grok 4 becomes the headline model behind the SuperGrok consumer tier.
xAI acquires X (Twitter) in all-stock deal
xAI acquires X in an all-stock transaction valuing X at $33B ($45B less $12B debt) and xAI at roughly $80B, forming a combined ~$113B entity (X.AI Holdings Corp). The merger folds X's real-time data and distribution into xAI — the structural basis for live X-search as a billable agentic tool. (Source: CNBC, 2025-03-28.)
Grok 3 family announced; API access expands
xAI unveils Grok 3 and Grok 3 mini (publicly released on the API in June 2025), adding a cheaper mini tier alongside the flagship. Grok 3 is priced at $3 / $15 per 1M tokens and Grok 3 mini at $0.30 / $0.50 — the first time xAI offers a low-cost model rung on the API.
Grok API public beta launches (grok-beta at $5 / $15)
xAI opens the Grok developer API in public beta with a preview model, grok-beta (128k context, function calling, system prompts), billed at $5 per 1M input tokens and $15 per 1M output tokens — establishing per-token metering as the core API primitive. Every developer gets $25/month in free credits through end of 2024. (Source: TechCrunch, InfoQ, 2024-10/11.)
- · xAI's Grok API per-token price has fallen dramatically: from $5 / $15 per 1M tokens at the Oct 2024 grok-beta launch to $1.25 / $2.50 on grok-4.3 by mid-2026 — a roughly 75% cut on input.
- · Live X (Twitter) search is a billable agentic tool ($5 per 1,000 calls), a data moat made possible by xAI's March 2025 all-stock acquisition of X that valued the social platform at $33 billion.
- · Cached input on grok-4.3 costs $0.20 per 1M tokens — about 85% cheaper than the $1.25 cache-miss rate — so prompt-heavy, repeat-context workloads pay a fraction of the headline price.
Questions & answers
- What is xAI's pricing model?
- xAI bills the Grok developer API per million tokens — grok-4.3 at $1.25 input / $2.50 output, cached input $0.20 — with agentic tools (web search, X search, code execution) metered separately at $5 per 1,000 calls. A separate consumer Grok app is freemium ($0 Free, $30/mo SuperGrok).
- How much does the Grok API cost per million tokens?
- The current flagship grok-4.3 is $1.25 per 1M input tokens and $2.50 per 1M output tokens, with cached input at $0.20/M. The grok-build-0.1 coding model is $1.00 in / $2.00 out. Media and voice are priced separately (e.g. images from $0.02, TTS $15 per 1M characters).
- Does xAI offer a free tier?
- Yes, on two surfaces. The consumer Grok app has a Free plan at $0/month with real-time web and X search, voice mode, and connectors. On the API side, xAI ran $25/month in free credits during the 2024 public beta and continues to offer promotional credits.
- How does Grok API pricing compare to OpenAI and Anthropic?
- At $1.25 in / $2.50 out, grok-4.3 undercuts most frontier flagships on output tokens. Grok 4 launched in July 2025 at $3 / $15 per 1M tokens; xAI then cut per-token rates dramatically, positioning Grok as a price-aggressive frontier option with a 1M-token context window.
- What does xAI charge for agentic tools and live search?
- Agentic tools are billed per 1,000 calls, separate from tokens: Web Search, X Search, and Code Execution at $5 / 1k calls each; Collections Search at $2.50 / 1k; File Attachments at $10 / 1k. Live X (Twitter) search is a distinctive data advantage following the xAI–X merger.
- How much is SuperGrok?
- The consumer SuperGrok subscription is $30/month, unlocking the Grok 4 model, higher rate limits, Expert mode, and image and video generation. xAI also sells SuperGrok Heavy, Business, and Enterprise tiers above it. The developer API is billed and packaged separately.