Everyone you know "uses AI." Almost none of them run a stack — a set of layers that work together so that output compounds instead of evaporating after each chat session. The difference shows up within a month: the chat-user has a history full of clever one-offs; the stack-runner has systems quietly producing work while they sleep.
Here's the thing the tool-review industry won't tell you: the brand names barely matter. Stacks fail because of missing layers and wrong build order, not wrong vendors. This guide gives you the architecture. Slot your preferred tools into it.
The five layers
Layer 1 — The engine (an agent-grade assistant)
One frontier AI assistant on a paid plan — the model that does your actual thinking and executing. Selection criteria, in order: (1) can it complete multi-step tasks, not just answer questions; (2) can it read your files and folders; (3) can it use tools — browse, run code, produce real documents; (4) does it hold enough context for your biggest recurring job. The $20-per-month tier of any frontier assistant clears this bar in 2026; agent-grade capability is now table stakes.
The one-engine rule: pick ONE and go deep. Tool-hoppers relearn interfaces; stack-builders accumulate templates, examples, and context files that make every future task better. Switching engines quarterly costs you your compounding.
Layer 2 — The memory (your context library)
The highest-ROI hour of your entire AI journey is writing what we call context docs — plain files the engine can be pointed at:
- WHO.md — your role, company, goals, current projects, the acronyms of your world.
- VOICE.md — three examples of your writing plus rules ("short sentences, no corporate throat-clearing").
- STANDARDS.md — what good output looks like for your recurring deliverables, with pasted examples.
- STACK.md — your saved delegation briefs (see the $2 Test).
These four files turn every future prompt from a cold start into a warm handoff. They are the difference between an intern on day one and an intern in month six — except you write the six months in an afternoon.
Layer 3 — The scheduler (recurrence)
A task that runs only when you remember it isn't a system; it's a hobby. You need one mechanism that fires work on a schedule: an assistant with built-in scheduled tasks, an automation platform, or even calendar blocks that trigger you to trigger the saved brief. Criteria: (1) survives you being busy, (2) delivers output somewhere you already look (inbox, chat), (3) fails loudly, not silently.
Layer 4 — The capture net
Your stack is only as good as what reaches it. You need a single, frictionless place where raw material lands: a notes inbox, a "read later" queue, a voice-memo habit, forwarding rules that sweep receipts and reports into one folder. One place. If capture is scattered across six apps, Layer 2 starves and every delegation starts with twenty minutes of archaeology.
Layer 5 — The review loop
The layer everyone skips, and the reason most stacks quietly rot. Once a week, fifteen minutes (fold it into your weekly review): Which automated outputs did I actually use? Which briefs need better examples? What new candidate earned a $2 Test? Kill anything you ignored twice — an automation you don't read is negative value, because it trains you to ignore the whole stack.
Budget tiers
- $0/month — the proof of concept. Free tiers of a frontier assistant now include agent-grade models. Context docs cost nothing. Scheduling = recurring calendar reminders. This tier is fully capable of your first three systematized workflows.
- $20–40/month — the working stack. One paid assistant plan (bigger context, priority capability) plus perhaps one automation tool. This is where most professionals should live. Expect the setup to pay for itself inside the first week — one reclaimed hour covers the month.
- $100/month — the operations team. Paid engine + automation platform + a transcription tool + a specialized vertical tool for your field. Justified only when Layer 5 shows you're actually consuming everything the smaller stack produces.
Spending rule: upgrade a layer only when it's the binding constraint. Most people buy Layer-1 upgrades when their real bottleneck is an empty Layer 2. A $200/month stack with no context docs loses to a free stack with good ones. Every time.
Build order: the four-week install
- Week 1 — Engine + Memory. Pick your assistant. Write the four context docs (one afternoon). Run your first $2 Test using them.
- Week 2 — First recurrence. Take the week-one win, save the brief, schedule it. You now own one L3 workflow. Feel the difference between "used AI" and "runs a system."
- Week 3 — Capture. Consolidate to one inbox for raw material. Set up 2–3 forwarding/sweep rules. Point your recurring brief at the captured material.
- Week 4 — Review. First weekly review of the stack. Kill, keep, or improve each running piece. Add one new $2 Test. This meeting with yourself is now permanent.
Why this order: engine-without-memory produces generic output (you'll quit); recurrence-before-a-proven-win schedules garbage; capture-before-recurrence gives you a tidy pile of material nothing consumes. Build the consumer before the supply chain.
The four mistakes that waste month one
- Tool safari. Trying nine tools shallowly instead of one deeply. The stack is boring on purpose.
- The all-in-one fantasy. Waiting for a single app to be all five layers. Assemble; don't wait.
- Automating judgment. The stack prepares decisions — drafts, digests, options. The ones with your name on them stay yours.
- No kill switch. Stacks accumulate zombie automations. The weekly review exists to shoot them. A lean stack you trust beats a sprawling one you've learned to skim past.
What "working" looks like
Ninety days in, a healthy stack has: four context docs that get referenced daily, five to ten scheduled workflows, a capture inbox that empties weekly, and a review habit that's killed at least two automations. The visible symptom is calendar white space. The invisible one is that your output stopped scaling with your hours — which is the entire point.
Worked example: a consultant's stack, layer by layer
Abstract architecture becomes obvious when you watch one person install it. Meet a solo strategy consultant — billable hours are everything, admin is death:
- Layer 1 (engine): one $20/month frontier assistant. Her selection test was a single task: "read this client folder and draft the kickoff memo." The first tool that nailed it won. Total evaluation time: one afternoon, two tools.
- Layer 2 (memory): WHO.md lists her niche, her three active clients (codenames), and her methodology. VOICE.md holds two past proposals and the rule "no consultant-speak: never write 'leverage synergies.'" STANDARDS.md contains her best-ever client memo. Writing them took 90 minutes; they've been read by the engine hundreds of times since.
- Layer 3 (scheduler): Monday 7 a.m. — industry digest per client, delivered before her weekly check-in calls. Friday 4 p.m. — a draft of each client's weekly status note from her meeting notes folder. Both were $2 Test wins first.
- Layer 4 (capture): one folder per client, and a rule: every call gets a voice memo dumped to it. The Friday automation reads the week's memos. No memo, no status note — the system's hunger enforces the capture habit.
- Layer 5 (review): fifteen minutes inside her Friday shutdown. In month two she killed a daily news automation she never read, upgraded the Monday digest's examples, and added a proposal-skeleton workflow that now saves her three hours per pitch.
Net effect: roughly six reclaimed hours per week, or — at consulting rates — the stack pays for itself several hundred times over. Nothing above required code, and every piece maps to a layer you can install this week.
The upgrade triggers
Don't shop; wait for triggers. Engine upgrade when you hit context limits on your biggest recurring job twice in a week. Automation-platform purchase when you're manually triggering more than three scheduled briefs. Transcription tool when voice capture becomes your main input. Vertical tool when a domain task (legal review, design, code) hits the $2 threshold weekly but general models keep failing your GOOD LOOKS LIKE bar. Money follows proven bottlenecks — never the other direction.
