Claude Code Integration

CROAK integrates natively with Claude Code through slash commands. This is the recommended way to use CROAK for an interactive, guided experience.

Setup

Initialize your project — this automatically sets up Claude Code integration:
```
npx croak-cv init
```
Open your project in Claude Code (VS Code with Claude extension, or Claude Code CLI)
Start with the Router — type /croak-router to get guidance on next steps

When you run croak init, CROAK creates:

.claude/skills/croak-*/SKILL.md — Skill files for each agent
CLAUDE.md — Project context file that Claude Code reads automatically

Claude Code discovers these files and makes them available as slash commands.

Agent Commands

Each command activates a specialized AI persona with domain expertise, guardrails, and a knowledge base.

Command	Agent	What It Does
`/croak-router`	Dispatcher	Start here. Pipeline coordinator that guides you through the workflow
`/croak-data`	Scout	Scan directories, validate images, manage vfrog SSAT or classic annotations
`/croak-training`	Coach	Configure training across local GPU, Modal, or vfrog platform
`/croak-evaluation`	Judge	Evaluate models, analyze errors, generate reports
`/croak-deployment`	Shipper	Deploy to vfrog inference, Modal serverless, or edge devices

Workflow Commands

End-to-end pipelines that chain multiple agent steps together.

Command	Description
`/croak-data-preparation`	Full data pipeline: scan, validate, annotate, split, export
`/croak-model-training`	Training pipeline: recommend, configure, execute, handoff
`/croak-model-evaluation`	Evaluation pipeline: evaluate, analyze, diagnose, report
`/croak-model-deployment`	Deployment pipeline: export, optimize, deploy, verify

Example Session

You: /croak-router

Claude: Dispatcher here! I see this is a new CROAK project.
        Current stage: uninitialized

        Let me help you get started. Do you have images ready to train on?

You: Yes, I have 500 product images in ~/photos/products

Claude: Great! Let me hand you off to Scout (Data Agent) to scan
        and validate them.

You: /croak-data

Claude: Scout reporting for duty! I'll help you prepare your dataset.
        Let me scan ~/photos/products...
        [Runs: croak scan ~/photos/products]

        Found 500 images. 487 valid, 13 have issues...

Agent Details

Dispatcher (Router)

The pipeline coordinator. Tracks your progress through the workflow stages, recommends next actions, and routes you to the right specialist agent.

Commands: status, init, reset, next, help, vfrog setup

Scout (Data)

Data quality specialist. Scans directories for images, validates annotations, checks class balance, and manages the annotation workflow — either vfrog SSAT or classic import.

Commands: scan, validate, convert, split, annotate, stats, visualize

Quality guardrails:

Minimum 100 images, 50+ per class
Maximum 10:1 class imbalance
Annotation coverage checks
Corrupt image detection

Coach (Training)

Training specialist. Recommends model architectures based on your dataset, estimates training cost and time, configures hyperparameters, and manages experiment tracking via MLflow or Weights & Biases.

Supported architectures: YOLOv8, YOLOv11, RT-DETR

Supported providers: Local GPU, Modal.com, vfrog platform

Judge (Evaluation)

Evaluation specialist. Calculates metrics (mAP, precision, recall, F1), performs error analysis, identifies failure patterns, and generates detailed performance reports.

Metrics: mAP@50, mAP@50-95, per-class precision/recall, confusion matrices

Shipper (Deployment)

Deployment specialist. Exports models to optimized formats, handles quantization, and deploys to your target environment.

Export formats: ONNX, TensorRT, CoreML, TFLite, TorchScript

Deploy targets: vfrog inference API, Modal serverless, edge devices

Agent Handoffs

Agents communicate through validated handoff contracts. When one agent completes its work, it produces a structured artifact that the next agent can pick up. For example:

Scout validates the dataset and produces a DatasetArtifact
Coach receives the artifact, trains, and produces a ModelArtifact
Judge evaluates the model and produces an evaluation report
Shipper deploys using the model and evaluation results

Handoff files are stored in .croak/handoffs/ and can be inspected for debugging.

Next Steps

CROAK Overview — Installation and annotation paths
Command Reference — Complete CLI command reference
CLI Overview — vfrog CLI setup and usage

Setup​

Agent Commands​

Workflow Commands​

Example Session​

Agent Details​

Dispatcher (Router)​

Scout (Data)​

Coach (Training)​

Judge (Evaluation)​

Shipper (Deployment)​

Agent Handoffs​

Next Steps​