Pick one workflow
A coding agent, browser agent, support agent, research agent, or internal automation that already runs repeatedly.
CAUM Live gives teams a private-content-safe meter for loops, retry pressure, stagnation, exact cycles, token exposure, and cost exposure while agents run.
The point is not to prove a universal waste rate. The point is to prove that CAUM can observe one recurring workflow, keep private content out, and surface reviewable structural evidence when the workflow starts looping or retrying.
A coding agent, browser agent, support agent, research agent, or internal automation that already runs repeatedly.
Tool family, phase, status, state hash, timestamps, token counters, cost counters, and latency. No raw content.
CAUM returns structural tiers, public signals, hard-alert evidence, budget exposure percentages, and hash-linked receipts.
Event-level counters can be tiny. A buyer cares about the recurring agent budget. CAUM should frame the pilot as reviewable structural exposure against monthly agent spend, then let the customer prove any realized savings with their own controls.
5% reviewable exposure is about $28/month or $337/year. This is a scenario, not guaranteed savings.
5% reviewable exposure is about $500/month or $6,000/year. The customer decides which controls to apply.
5% reviewable exposure is about $2,500/month or $30,000/year. CAUM supplies evidence, not a savings guarantee.
The integration should be boring. CAUM does not need prompts, model outputs, code, files, or business data to measure structural movement.
# Minimal CAUM Live event session.event(structural_event( event="tool_call", tool="browser", phase="retry", status="error", state_id="local_hash_bucket_17", input_tokens=420, output_tokens=80, cost_usd=task_cost_counter, latency_ms=1200, ))
State IDs should be generated locally. Do not send raw tool arguments, prompts, completions, source files, customer messages, secrets, or PHI/PII.
A good pilot is not one where CAUM flags everything. A good pilot shows that normal work stays understandable, controlled retry loops are visible, and review-only signals are not sold as confirmed waste.
CAUM Live is for teams already spending recurring money on agents. PDF Receipt remains the low-friction entry point; Live is the product when a workflow repeats often enough that loop/retry exposure matters.
Run a PDF Receipt on one historical trace to prove the structural readout is understandable.
Instrument one recurring workflow with CAUM Live and review hard alerts, cost counters, and receipts.
If the same pattern repeats, monitor more workflows and connect customer-owned review gates or retry ceilings.
Bring one agent trace or one running workflow. CAUM will observe structure only, produce receipts, and keep the claim boundary intact.