Overview
SudoDog is a free benchmark and monitoring platform for AI agents. Test your agents, get a score, compare on the leaderboard, and optionally upgrade to continuous monitoring.
- Benchmark (Free) - Run sudodog-benchmark to test your agent and get a score from 1-100 with recommendations
- Dashboard (Free Beta) - Continuous monitoring, security alerts, cost tracking for teams
Quick Start
Installation
Install the SudoDog CLI from PyPI. Works on Linux, macOS, and Windows.
Run Your First Benchmark
Start your AI agent, then run the benchmark tool. It auto-detects running agents.
The benchmark will:
- Scan for running AI agents on your machine
- Let you select which agent to test
- Monitor it for 30 seconds
- Analyze with AI and give you a score
- Provide recommendations and a shareable report link
Benchmark Tool
The benchmark tool tests your AI agent and provides a score from 1-100 with a letter grade (A+ to F).
What It Measures
How efficiently your agent uses CPU resources
Peak and average memory consumption
Number and pattern of API calls made
Consistency and quality of responses
How well your agent handles errors
Overall resource utilization efficiency
What You Get
1-100 score with letter grade (A+ to F)
Shareable link with full analysis
AI-powered suggestions for improvement
Compare against other agents
Embed in your GitHub README
Badge for Your README
After running a benchmark, you get a badge you can embed in your GitHub README:
sudodog-benchmark
The benchmark command tests and scores your AI agents.
Usage
Options
| Option | Description |
|---|---|
--version | Show version number |
--json | Output results as JSON |
--no-submit | Run locally without submitting to API |
--duration N | Monitoring duration in seconds (default: 30) |
Examples
sudodog-scan
Scan your machine for running AI agents. Detects LangChain, AutoGPT, CrewAI, Claude Code, and custom agents.
Usage
Options
| Option | Description |
|---|---|
--json | Output as JSON |
--watch | Continuously watch for new agents |
--interval N | Watch interval in seconds (default: 5) |
--quiet | Suppress non-essential output |
--api-key KEY | API key for dashboard reporting |
Examples
sudodog run
Run an agent with continuous monitoring. Connects to the dashboard for real-time visibility.
Usage
Options
| Option | Description |
|---|---|
--name NAME | Display name for the agent in dashboard |
--agent-id ID | Custom agent identifier |
--docker | Run inside a Docker container |
--cpu-limit N | CPU limit (e.g., 1.0 = 1 core) |
--memory-limit N | Memory limit (e.g., 512m, 1g) |
--no-network | Disable network access |
--guardrails POLICY | Apply guardrail policy |
Examples
Connecting to Dashboard
To see your agent in the dashboard, first initialize with your API key:
sudodog integrate
Integrate SudoDog with other tools like Claude Code or Docker.
Claude Code Integration
Add SudoDog as a hook to Claude Code for automatic monitoring:
This adds SudoDog hooks to your Claude Code configuration, enabling observability without changing your workflow.
Docker Integration
Get configuration for Docker-based agent monitoring:
This outputs MCP configuration for adding SudoDog to your Docker setup.
sudodog config
View and modify SudoDog configuration.
Usage
Options
| Option | Description |
|---|---|
--show | Show current configuration |
--set KEY=VALUE | Set a configuration value |
Examples
Adding Agents to Dashboard
There are two ways to add agents to the SudoDog dashboard for continuous monitoring.
Method 1: Auto-Detection (Recommended)
The dashboard can detect agents running on machines where you've installed the CLI:
- Install the CLI:
pip install sudodog - Run the scanner with your API key:
sudodog-scan --api-key YOUR_KEY - Go to "Add Agent" in the dashboard
- Click "Add to Dashboard" on any detected agent
Method 2: Manual Setup
Run agents with the sudodog run command:
Getting Your API Key
1. Log in to the dashboard at dashboard.sudodog.com
2. Go to "Add Agent" page
3. Click "Create API Key"
4. Copy and store your key securely
Mission Control
Mission Control is the central monitoring hub for all your AI agents. It displays real-time metrics including:
Total number of running AI agents
Real-time cost tracking across all providers
Active security incidents requiring attention
Overall system health indicator (0-100)
Security Center
Monitor and protect your AI agents from security threats. The Security Center detects:
Malicious prompts attempting to manipulate agent behavior
Unauthorized data transfer attempts
Access attempts from unknown sources
Unusual patterns that deviate from normal operation
Cost Intelligence
Track and optimize your AI spending across all providers and models.
See exactly how much each agent costs
Compare costs across OpenAI, Anthropic, AWS, etc.
Get notified when approaching limits
AI-powered recommendations to reduce costs
Guardrails
Guardrails let you define security policies that block dangerous operations before they execute.
Key Features
Block access to sensitive files and directories
Prevent connections to unauthorized domains
Stop dangerous shell commands before execution
Redact PII and secrets from agent output
CLI Usage
API Reference
Benchmark API (Public)
| Method | Endpoint | Description |
|---|---|---|
| POST | /api/v1/benchmark | Submit benchmark results |
| GET | /api/v1/benchmark/{id} | Get benchmark report |
| GET | /api/v1/benchmark/leaderboard | Get public leaderboard |
| GET | /api/v1/benchmark/frameworks | Get framework statistics |
| GET | /api/v1/badge/{id}.svg | Get badge image |
Dashboard API (Authenticated)
| Method | Endpoint | Description |
|---|---|---|
| GET | /api/v1/agents | List all agents |
| POST | /api/v1/telemetry | Submit telemetry data |
| GET | /api/v1/costs/summary | Get cost summary |
| GET | /api/v1/security/events | List security events |
| GET | /api/v1/guardrails/policies | List guardrail policies |
| GET | /api/v1/agents/shadow/list | List detected shadow agents |
| POST | /api/v1/agents/shadow/{id}/wrap | Add shadow agent to dashboard |
Supported Frameworks
SudoDog works with any AI agent framework. Here are some we automatically detect:
System Requirements
CLI
- Python 3.8+
- pip package manager
- Linux, macOS, or Windows
Dashboard
- Modern web browser
- Chrome 90+
- Firefox 88+
- Safari 14+
Self-Hosted
- Docker 20+
- PostgreSQL 13+
- 4GB RAM minimum
Support
Need help? We're here for you.
Email: support@sudodog.com
GitHub: github.com/SudoDog-official/sudodog
SudoDog Docs