Context
Goals, sources, constraints, decisions, and current state.
01 / Reframe
A better representation is this: an agent is a work unit placed inside context, roles, artifacts, and feedback loops.
02 / Resources
A real workflow is not a chat trace. It is a set of objects that can be identified, assigned, checked, and reused; feedback makes them loop.
Goals, sources, constraints, decisions, and current state.
Reader, Planner, Reviewer, and Writer are work boundaries.
A document, task, judgment, code change, or reusable record.
03 / Flow
Work is not a single exchange but a loop you can run again: goals go in, judgment comes back.
Name what to solve and where the edges are.
Hand the agent the material, limits, and past decisions.
Run the tools and produce a checkable artifact.
Human trade-offs become the next round's context.
04 / Baseline
Do not optimize the prompt first. Make the workflow repeatable, recordable, and comparable.
git clone <repo> && cd agent-workflowpnpm installcp .env.example .envpnpm eval --case beforepnpm run session -- --record
Every improvement needs the same inputs and the same judgment surface.
05 / Evals
Evaluation is not a school test. It tells the workflow which part is actually improving.
| ID | Task | What it tests | Grader |
|---|---|---|---|
| R1 | Extract context | Missing constraints | human spot-check |
| R2 | Generate plan | Executable steps | structure match |
| R3 | Draft artifact | Fit for audience | human score |
| R4 | Review risk | Key assumptions | checklist |
| R5 | Handoff summary | Reusable next time | reuse rate |
06 / Dashboard
A strong agent system is not more fluent. It has shorter context, steadier judgment, and more reusable artifacts.
Background compresses into reusable chunks.
Every run leaves something inspectable.
Human tradeoffs enter the next round.
07 / Doer vs Tutor
The same agent behavior can help you form a mental model, or let you bypass one.
It removes search, but also removes understanding, tradeoffs, and internalization.
It lowers extraneous load while leaving the key judgment to you.
08 / Feedback
A workflow improves because human judgment becomes future context, not because the agent remembers more.
artifact = agent.run(context)review = human.check(artifact)memory.add(review.decision)rules.add(review.risk)next.run(memory, rules)
Why this tradeoff was chosen.
What could invalidate the result.
What should be reused next time.
09 / Shift
Same AI, a different representation — and a very different long-term result.
Models, plugins, commands, and buttons; it scatters as you learn.
Objects, mechanisms, feedback, and delivery; it clarifies as you use it.
10 / Inner Map
A good agent workflow does not only finish the task. It helps you recognize the next situation faster, judge the path better, and ship more steadily.
— Keyboard shortcuts