Domain: AI/ML Architecture Inheritance: inheritable Version: 1.0.0 Last Updated: 2026-02-01

Overview

Comprehensive patterns for designing AI agents—autonomous systems that use LLMs to reason, plan, and execute multi-step tasks. Covers single-agent architectures, multi-agent orchestration, tool use, memory systems, and production deployment patterns.

Agent Architecture Fundamentals

What Is an AI Agent?

┌─────────────────────────────────────────────────────────────┐
│                      AI AGENT                               │
├─────────────────────────────────────────────────────────────┤
│  ┌─────────┐   ┌─────────┐   ┌─────────┐   ┌─────────┐    │
│  │ Perceive│ → │  Plan   │ → │   Act   │ → │  Learn  │    │
│  └─────────┘   └─────────┘   └─────────┘   └─────────┘    │
│       ↑                                          │         │
│       └──────────────────────────────────────────┘         │
│                    Feedback Loop                           │
└─────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────┐ │ Human-in-the-Loop Pattern │ ├─────────────────────────────────────────────────────────────┤ │ │ │ Agent Action Request │ │ │ │ │ ↓ │ │ ┌───────────────┐ │ │ │ Risk Check │ │ │ └───────┬───────┘ │ │ │ │ │ Low ──┴── High │ │ │ │ │ │ ↓ ↓ │ │ Execute ┌──────────┐ │ │ Directly │ Human │ │ │ │ Approval │ │ │ └────┬─────┘ │ │ │ │ │ Approve/Reject/Modify │ │ │ └─────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────┐ │ Agent Memory │ ├─────────────────────────────────────────────────────────────┤ │ │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Working Memory │ │ │ │ Current conversation + recent context (in prompt) │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Short-Term Memory │ │ │ │ Session state, intermediate results (key-value) │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ │ │ ┌─────────────────────────────────────────────────────┐ │ │ │ Long-Term Memory │ │ │ │ Facts, preferences, history (vector DB + graph) │ │ │ └─────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────┘

Aspect	Chatbot	Workflow	Agent
Autonomy	Low	None	High
Planning	None	Predefined	Dynamic
Tool Use	Limited	Fixed sequence	Flexible
Memory	Session only	None	Persistent
Error Recovery	Retry/fail	Fail	Reason & adapt

Strategy	Description	When to Use
Direct	LLM chooses from all tools	< 10 tools
Categorized	Group tools, select category first	10-50 tools
Retrieval	Embed tool descriptions, retrieve relevant	50+ tools
Routing	Specialized selector model	Production scale

Type	Storage	Retrieval	Use Case
Episodic	Vector DB	Semantic search	Past conversations, experiences
Semantic	Graph DB	Structured query	Facts, relationships, knowledge
Procedural	Code/prompts	Direct lookup	How to perform tasks
Working	Prompt context	Always present	Current task state

Strategy	Implementation
Token budgets	Set max tokens per task
Step limits	Maximum N actions per request
Tiered models	GPT-4 for planning, GPT-3.5 for execution
Caching	Cache tool results, LLM responses
Early termination	Stop when "good enough"

Framework	Strengths	Best For
LangChain	Comprehensive, many integrations	Rapid prototyping
LangGraph	Stateful, graph-based flows	Complex multi-agent
AutoGen	Multi-agent conversations	Research, code gen
CrewAI	Role-based teams	Business workflows
Semantic Kernel	Enterprise, .NET/Python	Microsoft stack
Agents SDK (OpenAI)	Simple, hosted	Quick single-agent

Ai Agent Design Skill

Ai Agent Design Skill

Overview

Agent Architecture Fundamentals

What Is an AI Agent?

Agent vs. Chatbot vs. Workflow

Single-Agent Patterns

ReAct Pattern (Reasoning + Acting)

Plan-and-Execute Pattern

Reflexion Pattern

Multi-Agent Patterns

Supervisor Pattern

Hierarchical Teams

Debate/Adversarial Pattern

Tool Use Patterns

Tool Definition Best Practices

Tool Selection Strategies

Human-in-the-Loop Tools

Agent Memory Systems

Memory Architecture

Memory Types

Memory Management Patterns

Planning Strategies

Task Decomposition

Goal-Oriented Planning

Error Handling & Recovery

Graceful Degradation

Loop Detection

Production Considerations

Observability

Cost Control

Safety Guardrails

Framework Comparison

Anti-Patterns

❌ Over-Autonomous Agent

❌ Unbounded Loops

❌ Tool Explosion

❌ Memory Bloat

❌ Monolithic Agent

Activation Triggers

Quick Reference

Agent Design Checklist

When to Use Agents

Sessions

Docker Patterns

Autonomous Loops

Kotlin Patterns

Eval Harness

Golang Patterns