Skip to main content

Documentation Index

Fetch the complete documentation index at: https://gump.build/docs/llms.txt

Use this file to discover all available pages before exploring further.

Cheap2Sota

Start cheap. Escalate only on failure.

When to use

For most everyday tasks. The workflow starts with the cheapest agent and only escalates to more powerful models if gates fail. Ideal for keeping costs low without sacrificing reliability.
gump run cheap2sota --spec spec.md

The workflow

name: cheap2sota
max_budget: 8.00

steps:
  - name: decompose
    type: split
    get:
      prompt: |
        Decompose {spec} into independent tasks.
        Each task must be implementable and testable in isolation.
    run:
      agent: claude-sonnet
    gate: [schema]
    each:
      - name: impl
        type: code
        get:
          prompt: |
            Implement: {task.description}
            Files: {task.files}
        run:
          agent: qwen
          guard:
            max_turns: 60
        gate: [compile, test]
        retry:
          - attempt: 3
            agent: claude-haiku
          - attempt: 4
            agent: claude-sonnet
          - attempt: 5
            agent: claude-opus
          - exit: 6

  - name: quality
    gate: [compile, lint, test]

How it works

Gump starts by asking Claude Sonnet to decompose the spec into independent tasks. Each task is then implemented by Qwen, the cheapest agent in the pool. If the gates (compile + test) fail, Gump retries with the same agent first — sometimes a second attempt with the error context is enough. From attempt 3, Gump escalates progressively: Claude Haiku, then Sonnet, then Opus. Each retry receives the failed diff and the gate stderr as context. The final gate (compile + lint + test) checks the overall integrity after all tasks.

Typical metrics

Most tasks pass on the first or second attempt with Qwen. On a 3–5 task spec, average cost is 0.500.50–2.00 with no escalation, and 33–5 with one or two escalations. Duration: 1–3 minutes.

Customize

Replace qwen with opencode.
Add hitl: before_gate on the decompose step.
Reduce max_budget to 3.00.
Add coverage: 80 to the quality gate.
By default, sessions are kept across retries. If you want a fresh start on escalation, add session: new to the escalation entries (automatic when agent changes).