> ## Documentation Index > Fetch the complete documentation index at: https://docs.agenticenv.ai/llms.txt > Use this file to discover all available pages before exploring further. # Token Usage > Aggregate prompt, completion, and reasoning token counts across LLM rounds in a run Each LLM completion can report token counts via [`interfaces.LLMUsage`](https://pkg.go.dev/github.com/agenticenv/agent-sdk-go/pkg/interfaces#LLMUsage) on [`LLMResponse.Usage`](https://pkg.go.dev/github.com/agenticenv/agent-sdk-go/pkg/interfaces#LLMResponse). OpenAI, Anthropic, and Gemini clients populate the fields when the provider returns them. ## Usage fields | Field | Description | | -------------------- | ----------------------------------------- | | `PromptTokens` | Input tokens | | `CompletionTokens` | Output tokens | | `TotalTokens` | Sum when reported by provider | | `CachedPromptTokens` | Prompt cache hits (when supported) | | `ReasoningTokens` | Extended thinking tokens (when supported) | ## Run [`AgentRunResult.LLMUsage`](https://pkg.go.dev/github.com/agenticenv/agent-sdk-go/pkg/agent#AgentRunResult) is the **sum across all LLM calls** in that run — including tool rounds. Use it for cost estimates, quotas, and logging: ```go theme={null} result, err := a.Run(ctx, prompt, nil) if err != nil { return err } if result.LLMUsage != nil { fmt.Printf("prompt=%d completion=%d total=%d\n", result.LLMUsage.PromptTokens, result.LLMUsage.CompletionTokens, result.LLMUsage.TotalTokens, ) } ``` ## Stream The same aggregate appears on `LLMUsage` in the `AgentEventTypeRunFinished` event — `Result` is an `*AgentRunResult`: ```go theme={null} for ev := range eventCh { if ev.Type() != agent.AgentEventTypeRunFinished { continue } finished, ok := ev.(*agent.AgentRunFinishedEvent) if !ok || finished.Result == nil || finished.Result.LLMUsage == nil { continue } u := finished.Result.LLMUsage fmt.Printf("total tokens: %d\n", u.TotalTokens) } ``` OpenAI streaming with `include_usage` surfaces totals on `RUN_FINISHED`. Example: [Stream](/examples/stream). ## RunAsync Token usage is on the final `AgentRunAsyncResult` when the run completes — same aggregate as `Run`. ## Per-request usage For per-LLM-call breakdowns, use [`AfterLLMHook`](/features/hooks) and inspect `in.Response.Usage` on each iteration. ## Example SHOW\_LLM\_USAGE footer ## Related ReasoningTokens from extended thinking Usage on RUN\_FINISHED events Provider-specific usage reporting Export usage as metrics