Advanced Agent Stack

Codex × Claude Code × Hermes: turn three agents into one working system

This page is for people who already use AI coding tools and want a cleaner operating model: Codex implements, Claude Code reasons and reviews, and Hermes keeps workflows memorable, schedulable, and auditable.

What this guide covers

If you only need to fix a bug

Use Codex for a narrow patch, then finish with tests and diff review.

If you are breaking down a large requirement

Use Claude Code or Hermes to plan the risk and scope before implementation.

If you repeat the same process every week

Convert the proven workflow into a Hermes skill, cron job, or handoff routine.

01 / Blueprint

The right mental model: not three chat boxes, but a three-layer system

Many teams install Codex, Claude Code, Cursor, and Hermes, yet still paste the same context every day. The advanced move is to divide the system into execution, judgment, and operations.

Codex|Execution layer

Best for working inside a repository: reading files, editing code, adding tests, running commands, and summarizing pull requests. Treat it as an implementation specialist.

Claude Code|Judgment layer

Best for long reasoning, architecture tradeoffs, requirement breakdowns, interactive code review, subagents, hooks, and MCP workflows. Treat it as a senior reviewer or architect.

Hermes|Operations layer

Best for durable preferences, reusable skills, Telegram / CLI / API triggered work, scheduled checks, handoffs, and turning successful work into repeatable procedures. Treat it as the agent operating system.

808 takeaway: Codex does not need to replace Claude Code; Claude Code does not need to replace Hermes. The stable pattern is: Codex patches, Claude Code reasons, Hermes remembers and operates.

02 / Division of Labor

Task split: do not ask every agent to do the same job

SituationSmall bug or feature
CodexImplement directly, add tests, run lint
Claude CodeSecond review when needed
HermesCapture reusable lessons or verification results
SituationUnfamiliar large repo
CodexScan code, find entry points, list risks
Claude CodeBreak down architecture and decisions
HermesTurn repo rules into reusable onboarding
SituationLong task or multiple branches
CodexPatch inside a branch or worktree
Claude CodePlan task slices and review diffs
HermesSchedule, track, hand off, and notify
SituationRepeated workflow
CodexRun templated implementation tasks
Claude CodeImprove prompts and subagents
HermesSave the workflow as a Hermes skill
SituationRisky deployment
CodexPrepare a change set, do not deploy by default
Claude CodeReview risk and rollback plan
HermesRequire human approval and preserve audit trail

03 / Shared Memory

Let all three tools read the same project operating manual

The practical lesson from official docs and community practice is consistent: advanced agent workflows are not powered by magical prompts. They work because the project has durable context. Codex reads AGENTS.md, Claude Code reads CLAUDE.md, and Hermes uses skills, memory, and session handoff.

Codex

AGENTS.md

Store repository structure, startup commands, tests, definition of done, and hard limits. Keep the root file short and link to docs for details.

Claude Code

CLAUDE.md

Store interaction habits, architecture principles, common commands, and review standards. Claude Code also supports memory, hooks, subagents, and MCP.

Hermes

Skills / Memory / Handoff

Save repeatable procedures as skills, long-term preferences as memory, and task progress as handoff so Telegram, CLI, and cron can continue the work.

Minimal repository context structure
your-project/
├─ AGENTS.md          # Codex / general coding-agent rules
├─ CLAUDE.md          # Claude Code interaction and architecture rules
├─ docs/agent/
│  ├─ architecture.md # system architecture and data flow
│  ├─ testing.md      # test, lint, and build commands
│  └─ release.md      # deploy, rollback, and human-approval gates
└─ .hermes/plans/     # optional: Hermes plans / handoffs

04 / Recipes

Three hybrid workflows you can actually run

A

Plan → Patch → Review: the most stable daily development loop

Ask Claude Code or Hermes for a concise plan, let Codex implement in an isolated branch or worktree, then finish with artifact-only review. This works well for small features, bug fixes, and documentation changes.

Human instruction
Please do not edit files yet.
1. Read AGENTS.md / CLAUDE.md / docs/agent/* first.
2. Propose a plan in 3 steps or fewer.
3. List files to change and how to verify them.
4. Wait for confirmation before implementation.
B

Use Codex as a specialist tool from Claude Code

Community workflows often expose Codex CLI through MCP so Claude Code can call Codex as an implementation specialist. This is an advanced pattern; command names may change, so verify your installed version with official docs and --help.

Community-style example; verify against your installed version
# 1. Install and sign in to Codex
codex

# 2. If your version supports it, start the MCP server
codex mcp-server

# 3. Register it in Claude Code
claude mcp add --transport stdio codex -- codex mcp-server

# 4. Check it inside Claude Code
/mcp
Important: validate this in a toy repository or read-only task before granting write access in a production repository.
C

Use Hermes as the long-term operations layer

Claude Code and Codex are strongest inside the current session. Hermes is strongest across sessions: memory, cron, Telegram / Discord gateway, skills, and handoff. Use it for daily checks, PR summaries, release checklists, and issue triage.

Hermes workflow-design prompt
Please turn this repository release process into a Hermes skill:
- trigger conditions
- files to read first
- verification commands to run
- actions requiring human approval
- handoff / report to leave at completion

05 / Templates

Copy-ready templates

Minimal AGENTS.md

AGENTS.md
# Project Agent Guide

## What this project is
One-sentence product, user, and data-flow summary.

## Commands
- Install: `...`
- Dev: `...`
- Test: `...`
- Build: `...`

## Conventions
- Main language / framework
- File placement rules
- Naming and error-handling principles

## Definition of Done
- Tests and lint pass
- Human-readable summary exists
- Major changes include rollback notes

## Never do
- Do not read or print secrets
- Do not deploy production directly
- Do not run destructive database changes

Claude Code review prompt

review prompt
Please review only; do not edit files.
Check this diff for:
1. compliance with AGENTS.md / CLAUDE.md
2. security, data-loss, permission, or migration risk
3. unverified assumptions
4. missing tests
Classify findings as Critical / Major / Minor.

Hermes skill trigger

skill description
Use when preparing or reviewing a production release:
- inspect current git diff and deployment config
- verify tests, build, and smoke checks
- identify human-approval gates
- produce release handoff and rollback notes
Do not deploy unless explicitly approved.

Three-agent handoff

handoff
## Agent Handoff
- Goal:
- In scope / out of scope:
- Files changed:
- Commands actually run:
- Test / build result:
- Risks:
- Human approval needed:
- Next best action:

06 / Guardrails

The stronger the multi-agent workflow, the more you need hard guardrails

AGENTS.md, CLAUDE.md, and prompts are still text. Real safety comes from environment isolation, permission boundaries, and verification gates.

Recommended order: first master AGENTS.md + planning + test gates in one repository; then add Claude Code review; finally connect Hermes cron, gateway, and skills for long-running operations.

07 / Sources

Sources and further reading

This guide is based on public documentation and community practices. Use community posts for patterns and inspiration, but verify exact commands against each tool's current official documentation and --help.