Claude Code Integration¶

Sovara integrates with Claude Code to accelerate agent development. Instead of manually inspecting logs or stepping through debuggers, Claude Code can directly query your agent's dataflow graph, understand what happened, and help you iterate faster.

Sovara x Claude Code

Why Use This Integration?¶

Keep context clean: Agent runs produce verbose logs that quickly pollute Claude's context window. With so-tool, Claude queries only the specific nodes it needs.
Structured access: Claude gets structured JSON data (inputs, outputs, graph topology) rather than parsing raw logs.
Edit and rerun: Claude can programmatically edit an LLM's input or output and trigger a rerun to test hypotheses.

Setup¶

Run the interactive setup command:

so-tool install-skill

This will:

Ask for your project directory (with tab-completion)
Copy the Sovara skill file to .claude/skills/sovara/SKILL.md
Optionally add Bash permissions to .claude/settings.local.json so Claude can run so-tool commands without prompts

After setup, restart Claude Code to load the new skill.

Available Commands¶

Once set up, Claude Code can use these commands:

Record an Agent Run¶

so-tool record agent.py                    # Record and block until complete
so-tool record --timeout 60 agent.py       # With 60s timeout
so-tool record -m module_name              # Run as Python module
so-tool record --run-name "my run" agent.py  # With custom name

Query Session State¶

# List recent experiments
so-tool experiments --range :10

# Get session overview (graph topology with nodes and edges)
so-tool probe <session_id>

# Get full node details
so-tool probe <session_id> --node <node_id>

# Get multiple nodes
so-tool probe <session_id> --nodes <id1,id2,id3>

# Get truncated preview (20 char strings)
so-tool probe <session_id> --node <node_id> --preview

# Filter keys with regex
so-tool probe <session_id> --node <node_id> --key-regex "messages.*content"

# Only show input or output
so-tool probe <session_id> --node <node_id> --input
so-tool probe <session_id> --node <node_id> --output

Edit and Rerun¶

Edit commands use flattened key notation (e.g., messages.0.content) and always create a new run:

# Edit an output key and rerun
so-tool edit-and-rerun <session_id> <node_id> --output <key> <value>

# Edit an input key and rerun
so-tool edit-and-rerun <session_id> <node_id> --input <key> <value>

# With custom run name
so-tool edit-and-rerun <session_id> <node_id> --output <key> <value> --run-name "variant A"

# With timeout
so-tool edit-and-rerun <session_id> <node_id> --output <key> <value> --timeout 60

Examples:

# Change the model's response content
so-tool edit-and-rerun abc-123 node-1 --output "choices.0.message.content" "New response text"

# Modify a prompt message
so-tool edit-and-rerun abc-123 node-1 --input "messages.0.content" "Updated prompt"

# Value can also be a path to a file
so-tool edit-and-rerun abc-123 node-1 --output "choices.0.message.content" ./new_response.txt

Workflow Examples¶

Debug a Failing Agent¶

Claude records the agent: so-tool record agent.py
Inspects the graph: so-tool probe <session_id>
Examines the failing node: so-tool probe <session_id> --node <failing_node>
Fixes and reruns: so-tool edit-and-rerun <session_id> <node_id> --output <key> <new_value>

A/B Test a Prompt Change¶

Run original: so-tool record agent.py
Inspect the node to edit: so-tool probe <session_id> --node <node_id>
Create variant: so-tool edit-and-rerun <session_id> <node_id> --input <key> <value> --run-name "variant"
Compare the two sessions

Iterate on LLM Output¶

Run agent and find a suboptimal response
Edit the output to what you want: so-tool edit-and-rerun <session_id> <node_id> --output <key> <value>
See how downstream nodes react to the improved output
Use insights to improve your prompts

Output Format¶

All so-tool commands output JSON for easy parsing. Examples:

Successful record:

{
  "status": "completed",
  "session_id": "abc-123",
  "exit_code": 0,
  "duration_seconds": 12.5
}

Probe session:

{
  "session_id": "abc-123",
  "name": "Run 42",
  "status": "finished",
  "node_count": 5,
  "nodes": [
    {"node_id": "node-1", "label": "GPT-4", "parent_ids": [], "child_ids": ["node-2"]}
  ],
  "edges": [
    {"source": "node-1", "target": "node-2"}
  ]
}

Error:

{
  "status": "error",
  "error": "Session not found: xyz"
}