§ 06The comparison
Same job. Different machinery.
Anthropic ships a pptx skill, and parts of it are genuinely good — the design guidance reads like a designer wrote it. The difference is architectural: their guardrails are advice to the model. deckhand's are machinery in the tool — and advice can be ignored on exactly the slide that matters. We didn't leave that as an opinion — we raced them, blind-judged, four rounds — § 07.
the moment
Anthropic's pptx skill
deckhand
Read the deck
markitdown text dump; thumbnails for picking layouts. Geometry and formatting live in the raw XML.
Full inventory — ids, geometry in inches, formatting runs, image names, detected issues. --brief: one line per shape.
Make an edit
Unpack the zip, hand-edit slide3.xml raw — the skill's own docs list the OOXML pitfalls to dodge while you type angle brackets.
Declare the op. set-text inherits the old text's formatting; swap-image keeps frame and crop. The XML is the tool's problem.
Get it wrong
Validation runs at pack time — after all the editing. Errors arrive late, one repack per discovery, and the tree on disk is already half-edited.
Patch rejected before a byte is written — every error at once, with the slide's real shape inventory in the message. The agent repairs in one pass.
Text overflows its box
Nothing notices. The deck ships with the overflow.
The linter reports it in inches with the exact fix command — and after repair, anything still broken is reported as residue, never claimed fixed.
Re-theme the deck
Unpack and hand-edit hex values across every slide's XML, repack, hope nothing else changed.
One replace-color op per palette mapping, one atomic patch. A missed color errors out listing the deck's actual palette, with counts.
Create from scratch
Write PptxGenJS code. The output is a new file on a blank theme — your template's master, layouts, and brand aren't in the room.
Write the slide as HTML; html2patch compiles it into ordinary ops — including straight into your branded template's layouts.
Verify the result
Thumbnail grids for template analysis; for real QA, assemble your own soffice + pdftoppm pipeline.
Built in: render any slide, crop and zoom any region, diff the structural changelog, lint the geometry. Trust is earned per edit.
Who owns it
Proprietary license — no derivative works, no redistribution.
MIT. Fork it, extend it, ship it inside your own product, run it on any model, any platform.
If you're the agent
You never hold raw XML in context, never burn tokens re-reading a deck to find a shape, and every mistake comes back as a correction you can act on — in one round trip.
If you're the human
The file that comes back always opens, is never half-edited, and still looks like your deck — because the edits ran inside your template instead of around it.
If you're the platform
It's a CLI under an MIT license. Wire it into any agent product — Claude, or anything else that can run a shell command — and keep the verification loop.