Framework for analyzing control architecture in agentic AI systems, contrasting Cartesian agency with brain-like integrated control. Design patterns for autonomy-robustness-oversight tradeoffs.
Framework for analyzing control architecture in agentic AI systems, contrasting Cartesian agency (learned core + engineered runtime) with brain-like integrated control. Provides design patterns for trading off autonomy, robustness, and oversight.
LLM agents implement a Cartesian split: a learned core (the LLM) coupled to an engineered runtime via a symbolic interface. This contrasts with biological brains, which embed prediction within layered feedback controllers calibrated by action consequences.
Cartesian Agency (Current LLM Agents):
┌─────────────┐ Symbolic ┌─────────────────┐
│ Learned │◄────Interface────►│ Engineered │
│ Core │ (Text/JSON) │ Runtime │
│ (LLM) │ │ (Tools/Env) │
└─────────────┘ └─────────────────┘
▲ │
└────────── Prediction ────────────┘
Brain-like Integrated Agency:
┌─────────────────────────────────────────────────────┐
│ Layered Feedback Controllers │
│ ┌─────────┐ ┌─────────┐ ┌─────────┐ │
│ │ Layer 1 │─▶│ Layer 2 │─▶│ Layer 3 │─▶ Action │
│ │(Reflex) │ │(Motor) │ │(Planning)│ │
│ └────┬────┘ └────┬────┘ └────┬────┘ │
│ └─────────────┴─────────────┘ │
│ Prediction embedded in control │
└─────────────────────────────────────────────────────┘
The separation between:
| Aspect | Cartesian | Integrated (Brain-like) |
|---|---|---|
| Bootstrapping | ✅ Easy to build | ❌ Hard to initialize |
| Modularity | ✅ Clear separation | ❌ Tightly coupled |
| Governance | ✅ External oversight | ❌ Opaque control |
| Sensitivity | ❌ Brittle to perturbations | ✅ Robust adaptation |
| Bottlenecks | ❌ Interface limitations | ✅ Fluid control flow |
| Autonomy | ❌ Constrained by interface | ✅ Emergent behavior |
Concept: Constrain the Cartesian cut to well-defined, limited scope.
┌─────────────────────────────────────────┐
│ Bounded Service │
│ ┌─────────┐ ┌─────────────────┐ │
│ │ LLM │◄────►│ Fixed Runtime │ │
│ │ Core │ │ (Limited API) │ │
│ └─────────┘ └─────────────────┘ │
│ │
│ Scope: Specific task, limited tools │
│ Example: Code completion, SQL query gen │
└─────────────────────────────────────────┘
Characteristics:
Use When:
Concept: Expand the Cartesian cut with richer interfaces while maintaining separation.
┌──────────────────────────────────────────────┐
│ Cartesian Agent │
│ ┌─────────┐ Rich ┌─────────────┐ │
│ │ LLM │◄───Interface──►│ Extensible │ │
│ │ Core │ (Multi-modal)│ Runtime │ │
│ └─────────┘ └─────────────┘ │
│ ▲ │ │
│ └────── State Tracking ──────┘ │
│ │
│ Scope: General tasks, extensible tools │
│ Example: General assistants, coding agents │
└──────────────────────────────────────────────┘
Characteristics:
Use When:
Concept: Minimize the Cartesian cut by embedding control within the learned system.
┌──────────────────────────────────────────────┐
│ Integrated Agent │
│ ┌───────────────────────────────────────┐ │
│ │ Unified Control Architecture │ │
│ │ ┌─────────┐ ┌─────────┐ ┌───────┐ │ │
│ │ │Predictor│─▶│Controller│─▶│Actor │ │ │
│ │ │ (LLM) │ │(Learned)│ │(Env) │ │ │
│ │ └────┬────┘ └────┬────┘ └───┬───┘ │ │
│ │ └─────────────┴───────────┘ │ │
│ │ Feedback Loop │ │
│ └───────────────────────────────────────┘ │
│ │
│ Scope: Complex adaptive behavior │
│ Example: Embodied agents, robots │
└──────────────────────────────────────────────┘
Characteristics:
Use When:
Autonomy
▲
│
Integrated ◄──┼──► Cartesian
Agents │ Agents
│
Robustness ◄───────┼──────► Oversight
│
Bounded ◄─────┴──────► Cartesian
Services Agents (rich interface)
│
▼
Implementation
Complexity
Layered Feedback: Brains use hierarchical feedback controllers
Prediction as Control: Prediction is embedded within control loops
No Clear Interface: No symbolic boundary between "thinking" and "acting"
| Brain Feature | AI Implementation |
|---|---|
| Layered control | Hierarchical agent architecture |
| Prediction embedded | End-to-end learned policies |
| Feedback calibration | RL with environment feedback |
| Distributed representation | Multi-modal embeddings |
def select_pattern(requirements):
"""
requirements: {
'safety_critical': bool,
'task_scope': 'narrow' | 'broad' | 'open',
'environment': 'static' | 'dynamic',
'oversight_need': 'high' | 'medium' | 'low',
'autonomy_desired': 'low' | 'medium' | 'high'
}
"""
if requirements['safety_critical'] or requirements['oversight_need'] == 'high':
if requirements['task_scope'] == 'narrow':
return 'bounded_services'
else:
return 'cartesian_agents'
if requirements['autonomy_desired'] == 'high':
return 'integrated_agents'
return 'cartesian_agents' # default
For Cartesian Patterns:
# Rich but structured interface
class AgentInterface:
def observe(self, multi_modal_input) -> Perception:
"""Process sensory input"""
pass
def act(self, action_spec) -> ActionResult:
"""Execute action, return outcome"""
pass
def reflect(self, outcome) -> LearningUpdate:
"""Update from action consequences"""
pass
For Integrated Patterns:
# Minimal explicit interface
class IntegratedAgent:
def step(self, observation, internal_state) -> (action, new_state):
"""Unified perception-action loop"""
# Internal state includes "thoughts", "plans", "goals"
# No hard boundary between cognition and action
pass
| System | Pattern | Notes |
|---|---|---|
| ChatGPT Plugins | Bounded Services | Limited tool set, clear API |
| Claude Code | Cartesian Agent | Rich interface, extensible |
| AutoGPT | Cartesian Agent | Tool use loop, stateful |
| Devin | Integrated Agent | Tighter integration, emergent behavior |
read: Read research papers and documentationexec: Run Python code for agent architecture analysiswrite: Save design patterns and implementation resultsedit: Modify agent architecture specificationsWhen analyzing or designing agentic AI systems using the Cartesian cut framework:
Identify the control pattern:
Evaluate design tradeoffs:
Apply neuroscience insights:
Select implementation approach based on requirements:
User: "帮我设计一个智能客服系统的控制架构"
Agent:
1. 识别需求: 客服任务scope较窄,但需要extensible工具
2. 选择模式: Cartesian Agent - rich interface with extensible tools
3. 应用设计模式: 使用多模态接口(stateful)维持separation
4. 输出: 完整的agent specification with Cartesian split
User: "分析Devin的控制架构属于哪种模式"
Agent:
1. 识别Devin的特点: tight integration, emergent behavior
2. 分析模式: Integrated Agent - tight coupling of predictor/controller
3. 对比其他模式: prints design tradeoffs for comparison
4. 输出: 架构分析报告 with pattern classification