Installation

Veto supports multiple integration modes. Choose the one that fits your setup.

Claude Code Plugin

The plugin installs as a Claude Code plugin that intercepts tool calls before execution.

Prerequisites

Claude Code installed and working
Python 3.8+ (uses only stdlib — no pip dependencies)
A Veto account with an API key (from Settings → API Keys in the dashboard)

Install the plugin

Install from the Claude Code plugin marketplace:

Linux / macOS:

/plugin marketplace add damhau/veto-claude-plugin
/plugin install veto-linux

Windows:

/plugin marketplace add damhau/veto-claude-plugin
/plugin install veto-windows

Then run the setup command:

Linux / macOS:

/veto-linux:setup

Windows:

/veto-windows:setup

This will prompt you for:

API key — your Veto API key from the dashboard
Fail policy — open (allow on error) or closed (deny on error)

Configuration is saved to ~/.veto/config.json.

Configuration options

Field	Default	Description
`server_url`	`https://api.vetoapp.io`	Veto server URL
`api_key`	—	Your Veto API key from the dashboard
`fail_policy`	`open`	What to do when the server is unreachable: `open` (allow) or `closed` (deny)
`timeout`	`25`	Request timeout in seconds

How the plugin works

The plugin registers a PermissionRequest hook in Claude Code. Every tool call is intercepted before execution and sent to the Veto API for evaluation. The server returns allow, deny, or ask (escalate to human review).

Verify the installation

Linux / macOS:

/veto-linux:status

Windows:

/veto-windows:status

This checks the connection to the server and reports the active rule count.

Gemini CLI Extension

The extension installs as a Gemini CLI extension that intercepts tool calls via the BeforeTool hook before execution.

Prerequisites

Gemini CLI v0.26+ installed and working
Python 3.8+ (uses only stdlib — no pip dependencies)
A Veto account with an API key (from Settings → API Keys in the dashboard)

Install the extension

gemini extensions install damhau/veto-gemini-extension

Configure

Create ~/.veto/config.json (same config file used by the Claude Code plugin):

{
  "server_url": "https://api.vetoapp.io",
  "api_key": "veto_your_api_key_here",
  "fail_policy": "open",
  "ask_policy": "deny",
  "timeout": 25
}

Configuration options

Field	Default	Description
`server_url`	`https://api.vetoapp.io`	Veto server URL
`api_key`	—	Your Veto API key from the dashboard
`fail_policy`	`open`	What to do when the server is unreachable: `open` (allow) or `closed` (deny)
`ask_policy`	`deny`	How to handle "ask" decisions (Gemini CLI has no native ask). `deny` (block) or `allow` (permit)
`timeout`	`25`	Request timeout in seconds

How the extension works

The extension registers a BeforeTool hook in Gemini CLI. Every tool call (run_shell_command, write_file, replace, etc.) is intercepted before execution and sent to the Veto API for evaluation. The server returns allow or deny. Since Gemini CLI has no native "ask" mode, the ask_policy config controls how undecided calls are handled.

Gemini CLI tool names

Gemini CLI uses different tool names than Claude Code. Rules in Veto use the Gemini CLI names directly:

Gemini CLI	Claude Code equivalent
`run_shell_command`	Bash
`read_file`	Read
`write_file`	Write
`replace`	Edit
`glob`	Glob
`grep_search`	Grep
`google_web_search`	WebSearch
`web_fetch`	WebFetch

Verify the installation

gemini extensions list

You should see veto-guardrail in the list. Then run a Gemini CLI session — tool calls should appear in your Veto audit log.

LLM Proxy

Veto includes a built-in LLM proxy powered by LiteLLM. It intercepts tool calls in the LLM response stream before they reach the coding agent, enforcing your rules at the network level. No plugin required — any tool that supports a custom base URL works out of the box.

The proxy is already deployed as part of Veto (both SaaS and self-hosted). You just need to enable it and configure your coding tool.

Step 1 — Enable the proxy

In the Veto dashboard, go to Settings → LLM Proxy and click Enable LLM Proxy.

Choose a mode:

Mode	How it works	Best for
Passthrough	Users keep their own API keys (e.g. Claude Max). The proxy forwards their credentials to the upstream provider.	Teams where each developer has their own subscription
BYOK	The org provides LLM API keys. All users share the org's keys via a virtual key.	Teams with a shared Anthropic/OpenAI account

You can switch between modes at any time from the dashboard.

Step 2 — Get your proxy key

Passthrough mode: Each user generates a personal proxy key from Settings → LLM Proxy Keys in the dashboard.

BYOK mode: A virtual key is shown once when you enable the proxy. Copy it immediately — it won't be shown again. You can rotate it from the dashboard at any time.

Step 3 — Configure Claude Code

Passthrough mode — users keep their own ANTHROPIC_API_KEY and route through the proxy:

export ANTHROPIC_BASE_URL="https://proxy.vetoapp.io"
export ANTHROPIC_CUSTOM_HEADERS="x-litellm-api-key: Bearer <your-proxy-key>"

BYOK mode — users replace their API key with the virtual key:

export ANTHROPIC_API_KEY="<your-virtual-key>"
export ANTHROPIC_BASE_URL="https://proxy.vetoapp.io"

For self-hosted deployments, replace https://proxy.vetoapp.io with your LiteLLM proxy URL (default: http://localhost:4000).

Configure other tools

The proxy is compatible with any tool that supports a custom base URL:

Aider (Passthrough):

export ANTHROPIC_API_BASE="https://proxy.vetoapp.io"
export ANTHROPIC_EXTRA_HEADERS="x-litellm-api-key: Bearer <your-proxy-key>"
aider --model anthropic/claude-sonnet-4-20250514

Aider (BYOK):

export ANTHROPIC_API_KEY="<your-virtual-key>"
export ANTHROPIC_API_BASE="https://proxy.vetoapp.io"
aider --model anthropic/claude-sonnet-4-20250514

Cursor (BYOK): In Cursor Settings → Models → OpenAI API Key:

API Key: <your-virtual-key>
Base URL: https://proxy.vetoapp.io

How the proxy works

The coding tool sends a request through the Veto proxy to the LLM provider
The LLM response streams back through the proxy
The guardrail intercepts the stream:
- Text-only responses — passed through immediately (zero added latency)
- Tool call responses — buffered, evaluated against your Veto rules, then either forwarded (allowed) or replaced with a denial message (blocked)
Keepalive pings are sent during evaluation to prevent connection timeouts

Self-hosted deployment

See the Architecture page for the full deployment topology. Veto can be deployed via Docker Compose or Kubernetes.

Docker Compose (quickstart)

cp .env.example .env
# Edit .env with your settings
docker-compose up -d

This starts:

PostgreSQL — database
Redis — caching and sessions
Veto Server — FastAPI backend (port 8000)
Dashboard — Next.js frontend (port 3000)
LiteLLM Proxy — with Veto guardrail (port 4000)