Skip to content

All Opilot commands are accessible via the Command Palette (Ctrl+Shift+P / Cmd+Shift+P). Type "Ollama" to filter to extension commands. Most commands are also accessible through the sidebar panels and context menus.

Authentication

Ollama: Manage Ollama Auth Token

Opens a prompt to set a bearer token sent as Authorization: Bearer <token> with every Ollama API request.

Use this when your remote Ollama instance is behind an authentication proxy. Leave blank to clear the token.

Model Lifecycle

Ollama: Start Ollama Model

Starts (warms up) a local model so it is ready to respond without a cold-start delay. Also available via the button on a local model item.

Ollama: Stop Ollama Model

Unloads a running model from memory to free VRAM/RAM. Also available via the button on running model items.

Ollama: Pull Ollama Model

Downloads a model from the Ollama library by name (e.g. llama3.2:3b). Opens a progress notification with live download progress. Available via the button in the panel header or the library context menu.

Ollama: Delete Ollama Model

Permanently removes an installed model and its weights from disk. Prompts for confirmation. Available via the 🗑 context menu on a local model item.

Model Parameters

Ollama: Open Model Settings

Opens the per-model settings webview panel. Also available via the toolbar button in the Local Models view.

Use it to tune model-specific generation options:

  • Temperature
  • Top-P
  • Top-K
  • Context window (num_ctx)
  • Max tokens (num_predict)
  • Thinking toggle (think)
  • Thinking budget (think_budget)

Changes are applied immediately and persisted per model.

Ollama: Refresh Local Ollama Models

Re-fetches the list of locally installed models from the Ollama API. Triggered automatically every ollama.localModelRefreshInterval seconds.

Ollama: Filter Local Models

Opens an input box to filter the local models list by model name. The filter persists until cleared.

Ollama: Clear Local Models Filter

Removes the active filter from the local models list.

Ollama: Show as Flat List / Show as Tree (Local)

Toggles local model grouping between:

  • Flat list — all models in alphabetical order
  • Tree view — models grouped by model family (e.g. llama, qwen, mistral)

The current mode is reflected by the icon in the panel header.

Ollama: Collapse All Local Models

Collapses all expanded model family groups in the local models panel (tree view only).

Ollama: Refresh Ollama Library

Re-fetches the public model list from the Ollama Library API.

Ollama: Filter Library Models

Opens an input box to filter the library model list by name or tag.

Ollama: Clear Library Models Filter

Removes the active filter from the library list.

Ollama: Show as Flat List / Show as Tree (Library)

Toggles library model view between flat list and family-grouped tree.

Ollama: Collapse All Library Models

Collapses all expanded groups in the library panel.

Ollama: Pull Ollama Model (from Library)

Downloads the selected library model tag to your local Ollama instance. Equivalent to ollama pull <model>:<tag> in the terminal.

Ollama: Open Ollama Model Page

Opens the model's page on ollama.com/library in your browser.

Ollama: Login to Ollama Cloud

Opens the Ollama Cloud authentication flow in your browser. Required to access private cloud models.

Ollama: Refresh Ollama Cloud Models

Re-fetches your Ollama Cloud model catalog.

Ollama: Filter Cloud Models

Opens an input box to filter the cloud models list.

Ollama: Clear Cloud Models Filter

Removes the active filter from the cloud models list.

Ollama: Show as Flat List / Show as Tree (Cloud)

Toggles cloud model view between flat list and grouped tree.

Ollama: Collapse All Cloud Models

Collapses all expanded groups in the cloud panel.

Ollama: Run Ollama Cloud Model

Starts the selected cloud model.

Ollama: Stop Ollama Cloud Model

Stops the selected running cloud model.

Ollama: Open Ollama Cloud Model Page

Opens the model's page on the Ollama Cloud web interface.

Ollama: New Modelfile

Opens an interactive wizard to create a new Modelfile, then opens it in the editor. See Modelfile Manager for details.

Ollama: Edit Modelfile

Opens the selected Modelfile in the VS Code editor.

Ollama: Build Model from Modelfile

Runs ollama create for the currently active or selected Modelfile. Progress is shown in a streaming VS Code notification.

Ollama: Open Modelfiles Folder

Reveals the modelfiles directory (ollama.modelfilesPath or ~/.ollama/modelfiles) in your OS file manager.

Ollama: Refresh Modelfiles

Rescans the modelfiles directory and updates the panel.

Diagnostics

Ollama: Check Server Health

Runs an immediate connectivity check against the configured ollama.host and reports whether the server is reachable. The same check is available from the status bar heartbeat item.

Ollama: Dump Performance Snapshot

Outputs a performance timing snapshot to the Opilot output channel. Useful when profiling response latency or troubleshooting provider initialization timing.

Ollama: Refresh Ollama Models

Re-fetches all model lists (local + library + cloud) in one operation.

Released under the MIT License