Harpoon

Lightweight agent orchestration runtime for LLM pipelines.

Harpoon lets you define multi-step AI workflows as directed acyclic graphs (DAGs) using simple YAML manifests and prompt templates. It supports prompt chaining, conditional branching, parallel execution, and Claude Agent SDK integration.

Installation

npm install -g harpoon-cli

# Or in your project
npm install harpoon-cli

Requires Node.js >=20.0.0

Quick Start

Create a new project:

harpoon project init my-project
cd my-project

This creates:

my-project/
├── agent.tml          # Project manifest (TML format)
└── prompts/
    └── example.prompt # Prompt template

Validate and run:

harpoon project validate
harpoon project run --dry-run -i '{"text": "Hello world"}'

TML - Text Markup Language

Remember HTML? Well, this is just the TML for marking up agents.

Harpoon uses .tml files as the conventional manifest format. TML files are valid YAML with a semantic file extension.

Auto-discovery order (when given a directory):

agent.tml (primary)
harpoon.tml (secondary)
trident.tml (legacy)
trident.yaml (legacy)

Explicit file paths work with any filename:

harpoon project run ./my-workflow.tml
harpoon project run ./pipelines/support.tml

Project Structure

Manifest (`agent.tml`)

harpoon: "0.1"
name: my-project
description: Project description

defaults:
  model: anthropic/claude-sonnet-4-20250514
  temperature: 0.7
  max_tokens: 1024

nodes:
  input:
    type: input
    schema:
      text: string, Input text to process

  output:
    type: output
    format: json

edges:
  e1:
    from: input
    to: process
    mapping:
      content: text

  e2:
    from: process
    to: output
    mapping:
      result: output

Prompt Templates (`prompts/*.prompt`)

---
id: process
name: Process Text
description: Processes input text

input:
  content:
    type: string
    description: The content to process

output:
  format: json
  schema:
    result: string, The processed result
---
You are a helpful assistant.

Process this: {{content}}

Return JSON with a "result" field.

Node Types

Type	Description
`input`	Entry point with schema validation
`output`	Exit point, collects final outputs
`prompt`	LLM prompt node (from `.prompt` files)
`agent`	Claude Agent SDK node with tool access
`tool`	TypeScript tool node for imperative code
`branch`	Sub-workflow execution with looping
`map`	Parallel fan-out over collections
`trigger`	Cross-workflow signal emission

Agent Nodes

Agent nodes use Claude Agent SDK for autonomous tool use:

nodes:
  analyzer:
    type: agent
    prompt: prompts/analyzer.prompt
    execution_mode: sdk  # 'sdk' or 'cli' (default: cli)
    allowed_tools:
      - Read
      - Write
      - Bash
    mcp_servers:
      github:
        command: npx
        args: ["@modelcontextprotocol/server-github"]
    max_turns: 50
    permission_mode: acceptEdits

Execution Modes:

cli - Uses Claude CLI with existing subscription (default)
sdk - Uses Claude Agent SDK with pay-per-token billing

Tool Nodes

Tool nodes execute TypeScript code for deterministic operations:

nodes:
  processor:
    type: tool
    handler: tools/processor.ts
    input:
      text: string, Input text

  # Or inline JavaScript
  validator:
    type: tool
    inline: |
      export default async (input) => {
        return { isValid: input.length > 0 };
      };

Branch Nodes

Branch nodes execute sub-workflows with optional looping:

nodes:
  refine:
    type: branch
    workflow: ./refine-workflow.tml
    condition: "needs_refinement == true"
    loop_while: "quality_score < 0.9"
    max_iterations: 5

Map Nodes

Map nodes parallelize operations over collections:

nodes:
  process_items:
    type: map
    source: items  # Input array field
    item_var: item  # Variable name for each item
    prompt: prompts/process-item.prompt
    concurrency: 5  # Max parallel operations

edges:
  e1:
    from: input
    to: process_items
    mapping:
      items: data

Trigger Nodes

Trigger nodes emit signals to coordinate cross-workflow execution:

nodes:
  notify_downstream:
    type: trigger
    signal: workflow-ready
    condition: "success == true"

Edges

Edges connect nodes and map outputs to inputs:

edges:
  e1:
    from: classify
    to: respond
    mapping:
      intent: intent           # target_var: source_field
      confidence: confidence
    condition: "confidence > 0.5"  # Optional condition

CLI Commands

# Project management
harpoon project init [path] [-t, --template <name>]  # Create new project (template: minimal, standard)
harpoon project validate [path]                       # Validate project
harpoon project graph [path] [options]                # Visualize DAG
harpoon project runs [path]                           # List past runs
harpoon project signals [path]                        # View/manage signals
harpoon project schedule [path] [options]             # Generate cron/systemd/launchd configs

# Execution
harpoon project run [path] [options]
  -i, --input JSON             # Input data as JSON
  -f, --input-file PATH        # Input from file
  -e, --entrypoint NODE        # Starting node
  -o, --output FORMAT          # json, text, pretty (default: pretty)
  --dry-run                    # Simulate without LLM calls
  --trace                      # Show execution trace
  --resume ID|latest           # Resume from checkpoint
  --start-from NODE            # Start from specific node (requires --resume)
  --run-id ID                  # Custom run identifier
  -v, --verbose                # Verbose output
  --no-artifacts               # Don't save run artifacts
  --artifact-dir PATH          # Custom artifacts directory
  --telemetry/--no-telemetry   # Enable/disable telemetry (default: enabled)
  --telemetry-format FORMAT    # json or text
  --telemetry-file PATH        # Write telemetry to file
  --telemetry-stdout           # Write telemetry to stdout
  --telemetry-level LEVEL      # debug, info, warn, error

# Orchestration options
  --input-from PATH            # Load inputs from file, alias:name, or run:id
  --emit-signal                # Emit orchestration signals (started/completed/failed/ready)
  --publish-to PATH            # Publish outputs to a well-known path
  --wait-for SIGNAL            # Wait for signal file(s) before starting
  --timeout SECONDS            # Timeout for --wait-for (default: 300)

# DAG visualization
harpoon project graph --format mermaid --open        # Open in mermaid.live
harpoon project graph --direction LR                 # Set diagram direction (LR, TB, etc.)

# Scheduling
harpoon project schedule ./my-project --format cron  # Generate cron expression
harpoon project schedule ./my-project --format systemd  # Generate systemd timer
harpoon project schedule ./my-project --format launchd  # Generate launchd plist

Validation

Harpoon validates your workflow at multiple levels:

Basic Validation

harpoon project validate ./my-project

Checks:

Manifest syntax and required fields
DAG structure (no cycles, valid node references)
Edge mapping warnings (source/target field mismatches)

Strict Mode

harpoon project validate ./my-project --strict

In strict mode, warnings become errors. Use this in CI/CD to catch:

Edge mappings that reference non-existent output fields
Edge mappings that target unexpected input fields

Example Output

Project: my-project
  Prompts: 2
  Tools: 1
  Edges: 3
  Nodes in execution order: 4

Warnings:
  ⚠ Target field 'wrong_name' not expected by 'process' (prompt).
    Expected inputs: ['content'] (edge: e1)

✓ Validation passed

Common Validation Errors

Tools must be in tools: section:

# ❌ Wrong - tool in nodes section
nodes:
  my_tool:
    type: tool
    module: mymodule

# ✅ Correct - tool in tools section or as tool node type
nodes:
  my_tool:
    type: tool
    handler: tools/mymodule.ts

TypeScript API

import { loadProject, run } from 'harpoon-cli';

// Load and execute
const project = await loadProject('./my-project');
const result = await run(project, {
  inputs: { text: 'Hello world' },
  dryRun: false,
  verbose: true,
});

// Check results
if (result.success) {
  console.log(result.outputs);
} else {
  console.log(`Failed: ${result.error}`);
}

// Access trace
for (const node of result.trace.nodes) {
  console.log(`${node.id}: ${node.tokens}`);
}

Features

DAG Execution: Automatic topological ordering and dependency resolution
Parallel Execution: Independent nodes run concurrently, map nodes fan out
Conditional Edges: Skip nodes based on runtime conditions
Multiple Providers: Anthropic and OpenAI support
Execution Modes: CLI mode (existing subscription) and SDK mode (pay-per-token)
Checkpoints: Resume interrupted runs from last successful node or specific node
Artifacts: Automatic persistence of runs, traces, and outputs in .harpoon/
Dry Run: Test pipelines without LLM calls
Mermaid Visualization: Generate interactive DAG diagrams, open in browser
Telemetry: Configurable telemetry with streaming output and multiple formats
Workflow Orchestration: Chain workflows with signals, wait conditions, and shared outputs
Scheduling: Generate cron, systemd, and launchd configurations

Workflow Orchestration

Harpoon supports file-based workflow orchestration for chaining workflows, scheduled execution, and event-driven triggers.

Signal Files

Workflows can emit signal files to indicate state transitions:

# Emit signals when running
harpoon project run ./my-workflow --emit-signal

Creates signal files in .harpoon/signals/:

{workflow}.started - Workflow began execution
{workflow}.completed - Workflow finished successfully
{workflow}.failed - Workflow encountered an error
{workflow}.ready - Outputs are available for downstream workflows

Input Chaining

Load inputs from previous workflow outputs:

# From a file path
harpoon project run --input-from ../upstream/.harpoon/outputs/latest.json

# From an alias
harpoon project run --input-from alias:upstream-workflow

# From a specific run
harpoon project run --input-from run:abc123

Wait Conditions

Block execution until signals are present:

# Wait for upstream workflow to complete
harpoon project run ./downstream \
  --wait-for ../upstream/.harpoon/signals/upstream.ready \
  --timeout 600

# Wait for multiple signals
harpoon project run ./final \
  --wait-for ./step1/.harpoon/signals/step1.ready \
  --wait-for ./step2/.harpoon/signals/step2.ready

Publishing Outputs

Publish outputs to well-known paths for downstream consumption:

harpoon project run ./my-workflow --publish-to ./outputs/latest.json

Manifest Configuration

Configure orchestration in the manifest:

orchestration:
  publish:
    path: .harpoon/outputs/latest.json
    alias: my-workflow

  signals:
    enabled: true

Examples

See the examples/ directory:

hello-world/ - The classic get your feet wet demo
tools-demo/ - Run TypeScript code to get deterministic outputs
branching-demo/ - Evaluate a condition and intelligently choose a path
looping-demo/ - Sub-workflows allow repetitive execution without violating the DAG
workflows-demo/ - Workflows are composable
map-demo/ - Parallelize operations over collections
trigger-demo/ - Coordinate workflows with signal emission

License

MIT

groksrc/harpoon

Harpoon

Installation

Quick Start

TML - Text Markup Language

Project Structure

Manifest (`agent.tml`)

Prompt Templates (`prompts/*.prompt`)

Node Types

Agent Nodes

Tool Nodes

Branch Nodes

Map Nodes

Trigger Nodes

Edges

CLI Commands

Validation

Basic Validation

Strict Mode

Example Output

Common Validation Errors

TypeScript API

Features

Workflow Orchestration

Signal Files

Input Chaining

Wait Conditions

Publishing Outputs

Manifest Configuration

Examples

License

On this page

Contributors

groksrc/harpoon

Harpoon

Installation

Quick Start

TML - Text Markup Language

Project Structure

Manifest (agent.tml)

Prompt Templates (prompts/*.prompt)

Node Types

Agent Nodes

Tool Nodes

Branch Nodes

Map Nodes

Trigger Nodes

Edges

CLI Commands

Validation

Basic Validation

Strict Mode

Example Output

Common Validation Errors

TypeScript API

Features

Workflow Orchestration

Signal Files

Input Chaining

Wait Conditions

Publishing Outputs

Manifest Configuration

Examples

License

On this page

Contributors

Manifest (`agent.tml`)

Prompt Templates (`prompts/*.prompt`)