App Automaton / github.com/appautomaton
300+ 30 repos GitHub
30 repos · 300+ stars Claude Code · Codex · Gemini · OpenCode MIT licensed

Skills, harnesses,
and on-device tools.

App Automaton is an open-source workshop for engineering with coding agents — Claude Code, Codex, Gemini, and OpenCode. Portable SKILLs, stage-gated runtimes, and a quiet streak of pure-MLX work for Apple Silicon.

$ gh repo clone appautomaton/agent-designer
Clone · symlink · run Workflow-first · text, not code
github.com/appautomaton · Open source · Workflow-first
30
Public repos
18
SKILLs published
4
Agent CLIs supported
300+
Stars (org-wide)
100%
Workflow-first
Anatomy of a SKILL

One folder. One markdown file.
Three layers of disclosure.

Every App Automaton skill has the same shape — YAML frontmatter the agent always sees, a workflow body it loads on invocation, and a sibling tree of references and scripts pulled only as the path narrows.

skills/pdf/SKILL.md · main
loaded · 3-tier
Source · SKILL.md markdown · 18 lines
1---
2name: pdf
3description: "Extract text & forms,
4 fill, merge, OCR — and split PDFs."
5tags: [pdf, forms, ocr]
6---
7
8# PDF Skill
9
10## Quick start
111. Pick a path: extract /
12 forms / merge / ocr.
132. Run `uv run scripts/<path>.py`.
143. Read `references/<path>.md` only
15 if the quick path is unclear.
16
17## Safety
18Never overwrite the input file.
Disclosure · live 0 / 8 steps
L1 metadata~140 tok L2 bodyon invoke L3 refson demand portableclaude · codex · gemini · opencode
What's in the org

Six repos to know.

App Automaton ships about thirty repositories. These six map the territory — designing skills, distributing them, the runtimes that hold them together, and what runs locally on the laptop.

agent-designer

Where the skills are designed.

Codex- and Claude-style SKILLs, an MCP tool catalog, and bridge skills that let one agent delegate to another with session continuity.

gh repo clone appautomaton/agent-designer
document-SKILLs

Documents, sheets, slides, PDFs.

Four file-format skills — docx, xlsx, pptx, pdf — adapted from Anthropic's official set. uv plus PEP 723 inline metadata; no requirements.txt, no virtualenv.

ln -s ./docx ~/.claude/skills/docx
webmaton

The web, the way a researcher reads it.

Five skills for deep search with grounded citations, deterministic HTML→Markdown, and persistent browser sessions over Playwright, nodriver, and Chrome DevTools.

ln -s ./agentic-search ~/.claude/skills/agentic-search
automaton

A stage-gated harness.

frame → plan → review → execute → verify → resume. The npx installer drops the workflow into a project as plain markdown skills, with durable state under .agent/.

npx @appautomaton/automaton install --all .
automux

Multi-agent in tmux.

Spawn coders, planners, and reviewers in split panes. Each in its own git worktree. Coordination happens through files — never by scraping the terminal.

automux spawn login-page coder --provider claude
mlx-speech

Local speech, on Apple Silicon.

Eight TTS and ASR models in pure MLX — Fish S2 Pro, VibeVoice, Step-Audio, Cohere ASR. Voice cloning and emotion tags. No cloud, no PyTorch.

mlx-speech tts --model fish-s2-pro --text "..."
Quickstart

Three commands.

Clone, symlink, point an agent at it. The shape stays the same under Claude Code, Codex, Gemini, and OpenCode.

terminal
~/skills
$ git clone https://github.com/appautomaton/document-SKILLs
 
$ cd document-SKILLs
$ for s in docx pdf pptx xlsx; do
ln -s "$(pwd)/$s" ~/.claude/skills/$s
done
 
$ claude "build a PDF report from sales.csv"
✓ skill `pdf` invoked