Sharpens 9 companies · First observed July 2025 · Updated July 2026 Explore in the graph

Per-action pricing for agent tools

Quick answer

Search- and agent-native vendors now bill discrete tool actions — a web search, a code run, a research task — as their own unit, on top of (or instead of) tokens. Six corpus companies do it, including a general-model API (Anthropic) and browser-agent infra (Browserbase); the billable unit is shifting from the token to the action.

6 vendors bill agent tool calls per action

What's happening — and why

What's happening: when an AI agent uses a tool — searches the web, opens a page, runs code, kicks off deep research — some vendors now charge for that action directly, per call, separate from the tokens the model generates.

Why: agent workloads make tool calls a real, attributable cost (an API hit, compute, a third-party fee) that doesn't map cleanly to tokens. Pricing the action lets vendors recover it and signal it to buyers. It's early — mostly search/agent-native APIs — but it points at a future where the unit of value is the task an agent completes, not the text it emits.

How it works

Tools the agent calls — search, code, research — are each metered as their own billable action.

Evidence over time

10 supporting · 4 counter — hover or tap a point for detail, click to jump to the row.

supporting evidence counterexample

Evidence

Company	Date	What happened
Groq	Jan 2026	Built-in tools priced per use — web search $5–$8/1K, website visits $1/1K, code execution $0.18/hour, billed on top of token rates.
Perplexity AI	Jan 2026	API restructured around a Search API plus an Agentic Research tier — research priced per task, above raw per-call search.
You.com	Mar 2026	Research API launched with effort tiers (lite $6.50 → exhaustive $300) — agentic research billed per call by effort, separate from token rates.
Exa	Apr 2026	Per-endpoint cards with a dedicated Agent endpoint; base Search at $7/1k, Deep Search $15/1k — each action billed per call.
Anthropic	May 2026	Web search tool billed $10 per 1,000 searches and code execution at $0.05/hour per container — discrete per-action fees on top of tokens, on a general-model API.
Browserbase	Mar 2026	Fetch API launched at $1/$4/$7 per 1k calls, adding per-call Search/Fetch metering on top of browser-hours — action pricing on browser-agent infra.
Composio	Jul 2025	Core meter renamed to 'tool calls' (after 'executions' 2024 and 'API calls' Jan 2025); premium tools (search APIs, E2B sandboxes, ML inference) bill at exactly 3x the standard per-call rate — $0.897 vs $0.299 per 1K overage calls on the $29 plan.
Zapier	Jun 2026	Meters Agents by 'activities' and Chatbots by bot count, on top of platform tasks — agentic actions as a separate billable dimension from workflow steps.
Vapi	Jun 2026	Voice agent platform bills $0.05/min hosting plus per-tool passthrough (STT, LLM, TTS, telephony at cost) — a multi-action billing model where each voice agent call spans multiple tool invocations priced separately.
Anthropic	Jun 2026	Claude Managed Agents bill $0.08 per session-hour of runtime (millisecond-metered, running-only) on top of tokens — a session-time meter for agent workloads, replacing the Code Execution container-hour model. Adds a wall-clock dimension to the per-action billing already in place ($10/1k web searches, code execution per container).

Counterexamples

OpenAI · May 2026 — Built-in tools fold into token usage rather than a discrete per-call line item.
Google · Apr 2026 — Gemini grounding/tool use is billed within token and request usage, not as a separate per-action SKU.
Groq · Jul 2026 — Added a Browser Automation built-in tool at $0.08/hour, joining its existing per-use tool card — web search ($5-$8 per 1,000 requests), website visits ($1 per 1,000), and code execution ($0.18/hour). Discrete agent tools metered by action or by hour on an LPU inference API.
Google · Jul 2026 — AlphaEvolve on Vertex AI prices the agent as the base Gemini model rate PLUS an explicit 2x agent surcharge (3x all-in — e.g. Gemini 3.1 Pro $6/$36 vs $2/$12 base). The agent premium as a MULTIPLE of the underlying token rate rather than a flat per-action fee — a token-scaled agent charge, distinct from Google's own grounding/tool use which stays folded into token/request usage.

Trivia

Composio's billable unit changed identity three times in roughly twelve months — 'executions' (2024), 'API calls' plus authenticated users (January 2025), then 'tool calls' (July 2025) — and the final form prices premium tool calls at exactly 3.0x the standard rate ($0.897 vs $0.299 per 1K), the corpus's first premium-vs-standard tiering *inside* the action unit itself.
Groq was the first corpus vendor to price discrete agent tool calls as their own line item (January 2026), breaking the convention that web search was just a model capability bundled into token usage — its $5-$8 per 1,000 searches made the action unit legible as a cost driver separate from the conversation tokens around it.
You.com's effort-tier Research API (lite $6.50 → exhaustive $300 per call, launched March 2026) produces the widest single-parameter cost range in the corpus: a 46× spread from a single configuration knob. No other billing dimension in the corpus moves cost by that ratio within one API endpoint, making "effort" the most financially consequential agent parameter the corpus has documented.
Anthropic's May 2026 decision to price web search at $10 per 1,000 calls and code execution by the container-hour on its general-model API — rather than folding tool use into tokens as OpenAI and Google do — was the decisive event that graduated per-action pricing from a search-native curiosity to an established billing unit, because it meant a frontier general-purpose API now had discrete per-action SKUs.

See all pricing trivia

For buyers

Where tools are billed per action, forecast on the agent's plan — how many searches, page visits and code runs per task — not just token volume. A verbose agent can rack up tool fees that dwarf its token cost; cap tool calls per run where the vendor allows it.

For vendors

To price actions you need per-tool metering and a clear line item for each (search, visit, execution, research), plus a free monthly allotment to ease adoption. Make the per-call price legible so buyers can model an agent run end to end.

Outlook — what to watch

Anthropic has already moved tool use out of tokens into explicit per-action fees (web search $10/1k, code execution per hour); watch whether OpenAI and Google follow — if they do, per-action becomes the default. The natural endpoint is pricing the completed task, which edges into outcome-based pricing.

Bottom line

Per-action tool pricing has graduated from emerging to established: six vendors — search-native plus a general-model API (Anthropic) and browser infra (Browserbase) — now meter tool calls directly. The action is overtaking the token as the unit that matters.

FAQ

What is per-action (agent-tool) pricing?

Charging for each tool an AI agent invokes — a web search, a page visit, a code execution, a research task — as its own billed unit, separate from the tokens the model generates.

Which vendors price agent tools per action?

In the corpus: Groq (built-in web search / visits / code execution), Perplexity (Agentic Research tier), You.com (Research API effort tiers), Exa (per-endpoint + Agent), plus Anthropic (web search $10/1k, code execution per hour) and Browserbase (Fetch/Search per 1k). OpenAI and Google still fold tool use into tokens.

How do I budget for an agent that uses tools?

Estimate tool calls per task (searches, visits, runs), not just tokens — tool fees can exceed token cost for search-heavy agents. Use any per-run caps or free allotments the vendor offers.

All trends