Dashboard
The Khaos Dashboard is your command center for exploring evaluation runs, comparing agent behavior across versions, and managing projects and team settings. Access it at khaos.exordex.com.
Getting Started
To start using the dashboard, sync your first evaluation run:
# Run evaluation with cloud sync
khaos run <agent-name> --eval quickstart --sync
# Or sync manually after a run
khaos run <agent-name>
khaos synckhaos sync will automatically prompt for login if you're not authenticated.Dashboard Views
The dashboard provides four main views for exploring your agent evaluations:
Projects Index
Your home screen showing all projects in your namespace. Features include:
- Project cards - Quick overview of each project's recent activity
- Search & filter - Find projects by name or tags
- Recent runs - See latest evaluation results at a glance
- Project creation - Create new projects for different agents
Project Detail
Deep dive into a specific project to see:
- Agent overview - All agents discovered from your evaluations
- Run history - All evaluation runs for this project
- Latest impact report - Quick access to your most recent comparison
- Version comparison - Compare any two runs side by side
Run Detail
Explore individual evaluation runs with:
- Score summary - Overall, security, and resilience scores
- LLM trace viewer - Full conversation traces per test case
- MCP telemetry - MCP server interactions and tool calls
- Resilience breakdown - Fault injection results and recovery metrics
- Security findings - Vulnerability details and remediation hints
- Cost analysis - Token usage and estimated costs per case
Comparison View
The heart of Khaos - comparing two runs side by side:
- Four-lens delta - Changes across structural, resilience, security, and functional dimensions
- Output diff - Line-by-line comparison of agent outputs
- Cost projection - Estimated impact at scale
- Regression detection - Automatic flagging of degraded metrics
Project Identity
Khaos uses owner-scoped project identifiers everywhere:
Format: owner_slug/project_slug
Examples:
myteam/customer-support-agent
johndoe/code-assistant
acme-corp/internal-botLLM Trace Viewer
For pack evaluations, the LLM Trace tab provides detailed conversation inspection:
- Per-case traces - See exactly what happened in each test case
- Collapsed identical traces - Quickly spot differences between cases
- Phase filtering - Filter by baseline, resilience, or security phase
- Token breakdown - Prompt and completion tokens per message
- Timing data - TTFT (time to first token) and total duration
# Generate pack runs with LLM traces
khaos run <agent-name> --eval full-eval --synckhaos run <agent-name> --eval full-eval --sync to get comprehensive LLM traces in the dashboard.API Tokens
Generate project-scoped API tokens for CI/CD integration and programmatic access:
- Navigate to your project in the dashboard
- Open the Tokens page from the project sidebar
- Click Generate New Token and select the scopes you need
- Copy the token (it won't be shown again)
# Use token in CI/CD
export KHAOS_API_TOKEN=your-project-token
export KHAOS_PROJECT_SLUG=owner/project
khaos ci my-agent --eval security --syncTeam Settings
On Team plans, manage team members and permissions:
- Invite members - Add team members by email
- Role management - Assign admin, developer, or viewer roles
- Project access - Control which projects each member can access
- Audit log - Track team activity and changes
| Role | View Runs | Sync Runs | Manage Project | Manage Team |
|---|---|---|---|---|
| Viewer | Yes | No | No | No |
| Developer | Yes | Yes | No | No |
| Admin | Yes | Yes | Yes | Yes |
Sharing & Collaboration
All dashboard URLs are shareable. Send a comparison link to your team and they'll see the same impact report, as long as they have access to the project.
- Shareable URLs - Every run and comparison has a stable permalink
- Team visibility - All team members with project access can view all runs and comparisons
- PR reviews - Paste comparison URLs directly into pull request descriptions
URL Structure
Dashboard URLs follow a predictable pattern for easy navigation and sharing:
# Projects list
https://khaos.exordex.com/projects
# Project detail
https://khaos.exordex.com/{owner}/{project}
# Evaluations list
https://khaos.exordex.com/{owner}/{project}/evaluations
# Run detail
https://khaos.exordex.com/{owner}/{project}/evaluations/{run-id}
# Comparison view
https://khaos.exordex.com/{owner}/{project}/evaluations/compare?baseline={run-id-1}&candidate={run-id-2}
# Agents list
https://khaos.exordex.com/{owner}/{project}/agents