Skip to content

Companion — getting started

The house the son comes home to. Open the door, pair an engine, start talking.

Companion is the chat client. It ships no model of its own — you pair it to an engine (a local cluster, a single Mac, or a cloud key) and it gives you a consistent window with history, memory, projects, skills and MCP on top.

Companion is installed for you by the OdyssAI Configurator (profile Serveur) — see Install the stack. After that it’s serving on http://<server-ip>:3100.

Only have one Mac and Telemak? You still need Companion for a chat window — run the Configurator’s Serveur profile on that same Mac (or any Mac on the LAN); it brings up Companion without requiring a cluster.

Open http://<server-ip>:3100. On a fresh install the first screen is Create your operator account — pick a name, email and password. That account is the admin; change the default password on first login.

Settings → Infrastructure → Engine.

You haveAdd this endpoint
An OdyssAI-X clusterhttp://<server-ip>:8000
A Telemak Machttp://<telemak-ip>:8003
A cloud keyAdd Cloud Provider → paste OpenRouter / Anthropic / OpenAI key
Ollama / LM Studio / vLLMtheir local OpenAI endpoint

Click Test endpoint. Companion reads the engine’s capability contract (/.well-known/inference-engine.json), learns which models support tools, vision and thinking, and loads the catalog. You can pair several engines at once — Companion keeps a separate catalog per engine and lets you route per message.

Open a new chat, pick a model from the model picker (top of the conversation), and type. You get the full conversation surface: edit / regenerate, attachments, voice, code-block helpers.

The footer under each answer (the StatsRow) shows TTFT, tokens/s, token counts, cache hits, and the model that actually answered — handy when you route through CoeOS, which shows e.g. CoeOS · python — MiniMax-M3.

Companion remembers across conversations through Némo, a per-user knowledge graph. Toggle it in Settings → Memory, or per-project. The agent can append to it on its own (companion_remember), and you can curate it by hand. Per-project memory keeps a project’s facts scoped to that project. See Memory.

  • Organise chats into Projects that share a system prompt and memory.
  • Extend the agent with Skills and MCP servers (Notion, Linear, GitHub, Tavily, your own).
  • Turn Companion into a brain for an external coding agent with Agents tokens — Cline, Continue.dev, Claude Desktop and Cowork can call back into Companion’s memory and tools.

The assistant inside Companion calls itself Némo. Némo is not a model — it’s the persona and orchestrator above whatever model you’ve routed: the memory, the relationship, the editor’s voice. The model is the substrate; Némo is what the substrate says when you address it.

  • The chat window — anatomy of a conversation.
  • Model picker — choosing models, semantic routing, Easy / Advanced / Expert modes.
  • Engine pairing — gateway / hybrid / legacy rails, multiple engines.
  • Memory — Némo, per-project memory, what gets injected.
  • Agents tokens — make Companion an MCP brain for your IDE.