Documentation

FAQ: Why did this model cost more than expected? in the NexoRouter documentation.

FAQ: Why did this model cost more than expected?

Model cost depends on the selected model, input tokens, output tokens, and the request pattern. Usage Logs are the source of truth for request-level cost.

Check in this order

  1. Confirm the exact model ID in Usage Logs.
  2. Check prompt and completion tokens.
  3. Check whether the app retried automatically.
  4. Check whether an agent loop made multiple requests.
  5. Check whether max_tokens allowed a long completion.
  6. Compare the model against Pricing.

Common causes

CauseFix
More output tokens than expectedSet a smaller max_tokens during testing.
Agent loop or tool retriesUse a low-budget key for agents and inspect Usage Logs.
Expensive model selected by a toolRestrict the key model scope or configure one model explicitly.
Automatic fallback chainStart with one model only until cost is understood.
Long context promptTrim input, summarize history, or choose a model intentionally.

Cost-control setup

EnvironmentRecommended key setup
Local experimentLow budget, one or a few low-cost models.
StagingSeparate key, realistic model scope, moderate budget.
ProductionSeparate key, approved models only, monitored spend.
FAQ: Why did this model cost more than expected? — NexoRouter