Documentation

Reasoning Models in the NexoRouter documentation.

Reasoning Models

Status: Use model-specific testing.

Reasoning models can be slower and more expensive than standard chat models. They also vary in how clients expose reasoning controls.

First test

Use a small prompt and a longer timeout:

Explain the tradeoffs of retrying a failed API request in three bullets.

Client timeout

Start with at least 60 seconds for ordinary chat models. For slower reasoning-style models, allow up to 180 seconds before adding retries.

Cost control

  • Use a key budget for experiments.
  • Keep prompts short.
  • Avoid running agent loops until the text response path is verified.
  • Watch Usage Logs for token usage and cost.
Reasoning Models — NexoRouter