Documentation
LlamaIndex in the NexoRouter documentation.
LlamaIndex
Status: Candidate chat setup, not yet verified by NexoRouter.
LlamaIndex can use OpenAI-compatible chat models, but many LlamaIndex workflows also need embeddings. Keep chat and embeddings separate until NexoRouter publishes stable embeddings support.
Chat LLM example
import os
from llama_index.llms.openai import OpenAI
llm = OpenAI(
api_key=os.environ["NEXOROUTER_API_KEY"],
api_base="https://api.nexorouter.com/v1",
model="deepseek-v4-flash",
)
response = llm.complete("Write one short production checklist.")
print(response)
Verification path
- Run one plain chat completion or completion-style call.
- Confirm the request appears in Usage Logs.
- Add indexes, retrievers, agents, or tools only after chat is proven.
- Use a separate verified embeddings provider if your index requires embeddings.
Boundaries
- RAG indexes normally require embeddings.
- Agent workflows depend on tool calling and model behavior.
- Streaming and structured outputs need separate tests.