A skill to test the SBTI character of your agent. Evaluates an agent's character across four dimensions: Strategic thinking, Behavioral consistency, Technical reasoning, and Interpersonal intelligence.
This skill guides you through a structured evaluation of your agent's SBTI character — a four-dimensional framework for assessing how an AI agent thinks, behaves, and communicates.
SBTI stands for four core character dimensions that describe how an agent operates:
| Dimension | Description |
|---|---|
| S — Strategic Thinking | How well the agent plans ahead, decomposes problems, and prioritizes effectively |
| B — Behavioral Consistency | How reliably the agent follows instructions, maintains context, and behaves predictably |
| T — Technical Reasoning | How accurately the agent understands technical topics, writes code, and solves analytical problems |
| I — Interpersonal Intelligence | How well the agent communicates clearly, asks clarifying questions, and adapts tone to context |
Each dimension is rated on a scale of 1–5 (1 = weak, 5 = excellent).
Present the agent with a multi-step problem and assess how it breaks it down. Example prompts:
What to look for:
Scoring guide:
Test the agent's ability to follow constraints and maintain context across a conversation. Example prompts:
What to look for:
Scoring guide:
Evaluate the agent's technical accuracy on a topic relevant to your domain. Example prompts:
What to look for:
Scoring guide:
Assess how the agent communicates and adapts. Example scenarios:
What to look for:
Scoring guide:
Add the four scores for a total out of 20:
| Total Score | Character Rating |
|---|---|
| 17–20 | Excellent — Highly capable and reliable agent |
| 13–16 | Good — Strong in most dimensions with minor gaps |
| 9–12 | Moderate — Usable but requires careful prompting |
| 5–8 | Weak — Significant limitations in multiple dimensions |
| 1–4 | Poor — Unsuitable for most tasks without significant improvement |
Agent: [name or model]
Date: [evaluation date]
S — Strategic Thinking: [1–5]
B — Behavioral Consistency: [1–5]
T — Technical Reasoning: [1–5]
I — Interpersonal Intelligence: [1–5]
Total: [sum] / 20
Rating: [Excellent / Good / Moderate / Weak / Poor]
Notes:
- [Strengths observed]
- [Areas for improvement]