LlamaIndex

Status: Candidate chat setup, not yet verified by NexoRouter.

LlamaIndex can use OpenAI-compatible chat models, but many LlamaIndex workflows also need embeddings. Keep chat and embeddings separate until NexoRouter publishes stable embeddings support.

Chat LLM example

import os
from llama_index.llms.openai import OpenAI

llm = OpenAI(
    api_key=os.environ["NEXOROUTER_API_KEY"],
    api_base="https://api.nexorouter.com/v1",
    model="deepseek-v4-flash",
)

response = llm.complete("Write one short production checklist.")
print(response)

Verification path

Run one plain chat completion or completion-style call.
Confirm the request appears in Usage Logs.
Add indexes, retrievers, agents, or tools only after chat is proven.
Use a separate verified embeddings provider if your index requires embeddings.

Boundaries

RAG indexes normally require embeddings.
Agent workflows depend on tool calling and model behavior.
Streaming and structured outputs need separate tests.

LlamaIndex

Chat LLM example

Verification path

Boundaries

Related pages