Skip to content
@TheColliery

TheColliery

Quality tooling for AI coding agents - mined, sorted, and shipped.

TheColliery

TheColliery Banner

Quality tooling for AI coding agents — mined, sorted, and shipped.

Suite Downloads Developers Mostly Cross-Agent License


🏗️ What is TheColliery?

A colliery is the complete coal mining operation: the mine, the tipple, and the workspace that turns a raw underground seam into clean, burning fuel.

TheColliery is a series of small, sharp, offline-first tools designed to keep AI coding agents honest, secure, and blazing fast. We build the infrastructure that helps agents execute safely, coordinate concurrently, and optimize their token budgets without sacrificing quality.


⛏️ The Series Suite

The Colliery structures its tools by the processing stages of raw digital coal:

Project Stage Status Concept
CoalMine Extraction Live ver Nine quality-canary skills (code-health, grounding, supply-chain, resilience, observability, testability, scaling, drift, completeness) that equip agents for raw, safe code extraction.
CoalTipple Sorting Live ver A model/effort router: delegation (down, to save tokens) and escalation (up, for quality), a qualityBar staircase, and a fail-safe model-ranking Lock. Claude Code only — routing actuates only where an agent can pick a spawned worker's model + effort (built and validated across the 2.1.x line).
CoalBoard Governance Live ver A consensus & debate board: on an error-not-allowed task — or any hard problem worth several lenses — with consent, diverse epistemic lenses (empirical/source-grounded · formal · a show-me skeptic) debate in parallel, a judge synthesizes on VERIFIED inputs, an independent solver breaks ties, and the human signs off. Bounded cost + zero-breakage. Cross-agent — validated on Claude Code + Antigravity; other concurrent-subagent platforms design-supported.
CoalHearth Session Storage Live ver A session warm-resume engine: a hook journals session state every step, so an interrupted session resumes from a recovery block instead of a manual rebuild — plus an advisory near-limit budget nudge. Reduces the work lost to a session limit (it does not prevent one). Claude Code (Phoenix-13 hooks).
CoalFace Active Front Beta ver The fan-out discipline: a mandatory scout sizes the swarm, workers return anchor-edit orders as text, one writer applies behind a snapshot + QC + domain gate, and the whole run is bounded by the estimated solo cost (bandwidth dial with TCP-style backoff). Disciplines the fan-outs an agent would already do — it does not make models smarter.
CoalWash Washing 🔵 Blueprint A memory-defragment engine (the 6th sibling, in design): cleans + compacts an agent's persistent memory — notes and governance — under a hard zero-fact-loss guarantee (a code-core scout → gate → atomic-apply pipeline, with a 2-agent outsider+insider judgment and human-gated deletes). Cross-agent. Spec in design; build deferred.
CoalLedger Records 🔵 Blueprint A docs-health canary suite (the 7th sibling, in design): the docs-side mirror of CoalMine's code canaries — grounding (docs match their source-of-truth), completeness, rot, consistency, structure (AST rendering-fidelity), quality (incl. language mechanics), + an optional leak check. Cross-agent + multilingual; the engine reuses CoalMine. Spec in design; build deferred.
CoalPortal Gateway Reserve A remote web-to-CLI gateway (an idea held on the shelf): drive a local session from a phone. Superseded by first-party Claude Code Remote Control (/rc) and cut before build — kept as a reserve concept for a narrow air-gapped / zero-trust niche /rc may not serve. Whether it revives is undecided — a plan on record, not an active build.

📦 Install — pick your DLC

Each tool installs on its own, or grab them together — like choosing DLC on a store page: take the whole pack, or just the pieces you want.

One-step selector (needs node + git on PATH; the claude CLI for the plugins):

curl -fsSL https://raw.githubusercontent.com/TheColliery/.github/main/install.mjs -o colliery-install.mjs
node colliery-install.mjs all       # the whole suite (all 5)
node colliery-install.mjs 3         # just CoalBoard
node colliery-install.mjs 1 2 3     # CoalMine + CoalTipple + CoalBoard
node colliery-install.mjs 4 5       # CoalHearth + CoalFace (CoalFace beta)
node colliery-install.mjs coalmine  # by name, too

Or install each tool directly — each repo's README has the authoritative, always-current steps:

# Tool How
1 CoalMine a Claude Code pluginclaude plugin marketplace add HetCreep/CoalMine then claude plugin install coalmine@coalmine; other agentsnode scripts/install.mjs <agent> (Antigravity/Cursor/Codex/…; auto-trigger hooks are Claude-Code-only — see the README)
2 CoalTipple a Claude Code pluginclaude plugin marketplace add TheColliery/CoalTipple then claude plugin install coaltipple@coaltippleClaude Code only (routing can't actuate where an agent can't pick a worker's model)
3 CoalBoard a Claude Code pluginclaude plugin marketplace add TheColliery/CoalBoard then claude plugin install coalboard@coalboard; Antigravity (validated) — copy skills/coalboard into ~/.gemini/config/skills (see the README)
4 CoalHearth a Claude Code pluginclaude plugin marketplace add TheColliery/CoalHearth then claude plugin install coalhearth@coalhearthClaude Code only (Phoenix-13 hooks)
5 CoalFace a Claude Code plugin (beta) — claude plugin marketplace add TheColliery/CoalFace then claude plugin install coalface@coalface; other agents — copy skills/coalface into the agent's skills root (cross-agent contract; hooks/commands are Claude-Code-only)

📊 Benchmarks

Headline results — small, dated samples. Full tables, tested versions, + caveats live in each tool's benchmark docs (the single source of truth, never duplicated):

Tool Result
CoalMine 7 canaries × 4 engines, 3-5 repeated runs each: 100% recall on 6/7 suites for every engine · 0 decoy false alarms batch-wide · drift-canary 88% median (the discriminating suite) — measured 2026-07-03
CoalTipple ON-vs-OFF paired (4 tasks × 3 tiers × K=3, 2026-07-03): routing ON = 4/4 task quality on both baselines, OFF = 3/4 on both (each failing a different task) · ON is ~23% cheaper from an Opus main, cost-neutral from a Sonnet main · never-delegate-sensitive-down re-confirmed on the new lineup — see the record
CoalBoard solo-vs-board (2026-07-03): Opus 4.8 solo 4/5 · board 5/5 (strong solo catches the reasoning traps unaided; board's edge = the version-sensitive FACT T3). Cross-vendor mirror — Gemini 3.5 Flash solo ~4/15 · board 5/5 (weak solo → board recovers ALL traps). Board margin scales inversely with solo strength; T3 (run-the-check) is the win at both ends. Board = solo + ground-truth execution — see the record
CoalHearth interruption damage (2026-07-03, v1.0.0): on a 10-file mid-refactor, warm resume and cold restart BOTH finished correctly with a <1% token delta — at small scale a strong model rebuilds state from the tree, so CH's token saving is a large-session effect; its irreducible value is state fidelity — the in-flight sub-agent record (CH 1, cold 0) a cold restart cannot reconstruct — see the record
CoalFace fan-out cost (2026-07-03, beta.2): fanning out costs more raw tokens than solo (per-sub baseline × N — ad-hoc 4.2×, CF 5.3×), but cheap-tier workers make it −15% in dollars vs a solo Opus main — CF's wallet is a $-via-cheap-tier bound, not a token saving; scout+digest amortizes only above a shared-context threshold. Benchmark ≠ graduation, CF stays beta — see the record

See the benchmark records.


📜 The 5 Doctrine Layers

Every tool inside TheColliery is governed by our core constitution — the full doctrine spells out every Phoenix-13 and Quantum-11 point. The lines below are one-line digests:

  1. 🌐 Works in Every Mine (Cross-Agent): Vendor-agnostic by design — CoalMine, CoalBoard, and CoalFace run on any concurrent-subagent agent (Claude Code, Antigravity, Cursor, Codex, Cline, …). Validated on Claude Code; CoalBoard is also validated on Antigravity (2026-06-22); every other platform is design-supported (re-verify subagent support on yours). (CoalTipple and CoalHearth are the deliberate exceptions, Claude Code only — CoalTipple's model/effort routing only actuates where an agent can pick a spawned worker's model; CoalHearth's warm-resume runs on Claude Code hooks.)
  2. 🦅 Phoenix 13 Compliance: Immortal hooks, all 13 commandments — fail-silent · zero-dependency · zero-latency · zero-garbage · zero-side-effects · stateless · offline · deterministic · portable · sandboxed · future-proof · self-healing · zero-noise.
  3. 🔬 Quantum 11 Performance: All 11 principles — maximum output · zero visible errors · single brand · minimum power + consent · essential accessories · error correction · determinism · isolation · measurement · trust · entanglement.
  4. 🛡️ Antivirus/ESET Heuristics: Adaptive freshness checks, signature validation, and secure credential handling.
  5. 🔌 Single Power Button: Absolute minimal setup — a single command installs and runs the conductor.

🧩 The 5 Standard Systems

Beyond the doctrine, each tool is built like a proper program — it ships the same five standard systems, so any tool feels familiar the moment you open it (the full doctrine has the detail):

  1. ⚙️ Config — every knob a user might reasonably change lives in a .<tool>.json file (project overrides global); nothing load-bearing is hard-coded out of reach.
  2. 🌍 Language — output auto-detects the conversation's language (English is the always-safe fallback — zero setup) and can be locked via config. Prose is translated; technical terms — commands, paths, identifiers, config keys, model/tier/severity names — are kept verbatim (a translated command is a broken command).
  3. 🔄 Self-Update — a tool checks for and applies its own updates (consent-gated, offline-graceful), and every version transition leaves no old-version leftover — stale caches and renamed files are swept, never abandoned.
  4. 🐛 Problem-Report — one command offers to file an issue upstream, with any memory content scrubbed first; nothing is ever submitted without your say-so.
  5. 📊 Measurement — a /stats command shows what the tool actually did and what it was worth — tokens saved, issues caught, or fidelity preserved; a quality tool should prove its own impact.

More tools and skills are still deep underground. 🔦

Pinned Loading

  1. .github .github Public

    TheColliery -- quality tooling for AI coding agents. Holds the series landing page, the shared doctrine (Phoenix-13 hooks + Quantum-11), the one-step DLC installer, and the suite benchmarks.

    JavaScript

Repositories

Showing 5 of 5 repositories

Top languages

Loading…

Most used topics

Loading…