Trust & safety: what not to delegate

USE 0 - 20 min

Set a boundary and test that it holds

Climbing the ladder is only half the skill. The other half is knowing which rungs to refuse — and proving your assistant respects the lines you draw. You will set an explicit boundary and then try to cross it.

1 Tell Hermes a hard rule: Never send messages, delete files, or spend money without explicitly asking me first.
2 Now deliberately ask it to do one of those things in passing, buried in a larger request.
3 Confirm it stops and asks rather than just doing it.
4 Try to talk it past the rule ('it's fine, just do it'). Watch whether the gate actually holds.
5 Note any case where it crossed a line it should not have — that is the most valuable result here.

✓

You set an explicit boundary, tried to cross it, and learned exactly how reliably the gate holds.

UNDERSTAND 20 - 40 min

Augmentation, not automation

The goal of a personal assistant is to remove drudgery, not to remove you. The framing matters because it tells you where to stop. Your verification surface in the desktop app is Artifacts — every file the agent wrote, link it opened, and image it produced, listed per session. Skim it to see what actually happened, not just what the agent reported.

Key concept

Level 6 — full autonomy, no further contact — is reserved for the repetitive and inconsequential, and almost nothing important qualifies. The guiding frame is augmentation, not automation: the agent does more of the legwork so you can do more of the judgement, which means you keep the wheel on anything that is irreversible, sensitive, or hard to verify. Over-delegation is its own failure mode — handing off a task you cannot check means you have outsourced not the work but the responsibility, and a confident wrong answer is more dangerous than an honest 'I don't know'. The verification habit is the backstop for the whole ladder: trust what you can check, gate what you cannot.

?Name a task in your work that should never go above level 3, no matter how good the agent gets. Why?
?What is the difference between delegating the work and delegating the responsibility — and where is that line for you?
?Why is a confident wrong answer more dangerous than a refusal, for an agent that can act?
?How does 'augmentation not automation' change which tasks you hand off in the first place?

BUILD 40 - 50 min

Write your personal delegation policy

Everything in this course comes together as one artifact: a map of your own work onto the six levels.

Your task

Produce your personal delegation policy — for the real tasks in your life, which level each one runs at, and which lines are absolute.

1 List the tasks you would plausibly hand to a personal assistant.
2 Assign each a ladder level (1 inform → 6 autonomous), using reversible? and verifiable? as the tests.
3 Mark the absolute lines — things that must always require your explicit yes, regardless of track record.
4 Encode the most important boundaries into Hermes (approval gates, scoped tools, an explicit 'never' rule).
5 Re-test one boundary to confirm your policy is actually enforced, not just written down.

Deliverable

Your six-level delegation map for your own work, your absolute 'never without asking' lines, and proof that one of them is enforced.

Self-check · tick before you mark done

I set an explicit boundary and tested whether the agent respects it.
I can explain 'augmentation, not automation' and why level 6 is rare.
I can name a task I will never delegate above level 3, with a reason.
I wrote a personal six-level delegation policy and enforced one boundary in Hermes.

✎An assistant that does everything you allow is a mirror of how carefully you set the rules. Which of your boundaries are real limits — and which would you quietly drop the first time the agent is convenient enough?

Helpful links

Skills & approval flow ↗