Skip to content

Cost & Usage

The Cost & Usage report provides visibility into the resources consumed by Agent Assist. Use it to understand spending patterns, attribute costs to individual agents and features, and forecast future usage.

KPI Cards

The top-level summary cards show period totals:

MetricDescription
Search QueriesTotal number of knowledge search queries triggered
LLM TokensTotal input + output tokens consumed. The card shows the input/output breakdown (e.g., "45.0K in / 4.4K out"). If prompt caching is enabled, cached tokens are shown separately with a savings indicator (e.g., "21.4K cached (90% savings)").
STT MinutesTotal minutes of audio processed by the speech-to-text provider. Voice calls only — zero for chat-only deployments.
Real-time EventsTotal messages published for real-time event delivery

Estimated Total Cost

A banner card shows the total estimated cost across all services (LLM, STT, events) for the selected period. If a budget threshold is configured, it displays alongside with a visual indicator when spending exceeds the limit.

Token Usage by Feature

A stacked bar chart breaks down token consumption by feature and direction (input vs output):

FeatureDescription
ClassifierTokens used by the LLM classifier to evaluate each utterance
Answer GenerationTokens used for knowledge search context assembly and answer streaming
SummaryTokens used for conversation summarization
AnalysisTokens used for conversation quality analysis
CoachingTokens used for coaching tip generation (only when coaching is enabled)

Cost Breakdown

A cost breakdown by service category shows dollar amounts and percentages:

CategoryDescription
LLMCost of all LLM token consumption
STTCost of speech-to-text processing minutes
RAGEstimated cost of knowledge search queries

Token Efficiency

Compares accepted vs rejected suggestions to identify wasted tokens:

MetricDescription
Accepted count + avg latencySuggestions the agent found helpful
Rejected count + avg latencySuggestions the agent rejected — high-latency rejected suggestions represent wasted tokens

Burn Rate Projection

Based on current usage patterns, projects daily average and monthly estimated cost. Highlights red if the projected monthly cost exceeds the configured budget threshold.

Daily Usage Trend

A daily bar chart shows knowledge search query volume over the selected period. Hover over a bar to see the exact date and count. Use this to spot usage spikes tied to specific events or campaigns.

Per-Agent Usage

A table showing resource consumption by individual agent:

ColumnDescription
AgentAgent display name
Search QueriesNumber of knowledge search queries triggered
LLM TokensTotal tokens consumed
STT MinutesAudio processing minutes (voice only)
Est. CostCalculated cost based on usage and pricing

Filters

FilterDescription
QueueFilter all metrics to a specific CCaaS queue
Date rangeCustom start and end date pickers

Budget Alert Threshold

Set a monthly spending cap. When estimated cost exceeds this amount, the system alerts you. Configure the threshold directly on this page or in Tenant Configuration.

WARNING

Cost data reflects estimates based on token counts and configured pricing. Refer to your LLM provider invoices for actual billed amounts.

OmniBots Agent Assist