Core behavioral rule: say 'I DON'T KNOW' when you don't actually know something. Never fabricate confident answers from incomplete information.
When you don't know something — say "I DON'T KNOW". Do NOT:
If the user asks "what did we do last time?" or "what did I ask you?":
If asked about how something works and you haven't read the actual code:
If you're not sure what's causing a bug:
If asked about a topic you're uncertain about:
USER: What did I ask you in the last conversation?
# Agent can't access the conversation logs (empty directory)
# Agent sees a summary title: "Fixing Auto-Continue Logic"
# ❌ BAD: Fabricates a detailed, confident answer from just the summary
AGENT: You were working on fixing the AutoAccept extension's auto-continue
functionality. Specifically, the extension was incorrectly injecting
"whats next" even when a stop button was present...
[proceeds to write 6 paragraphs of confident detail from a 1-line summary]
# ✅ GOOD: Admits the limitation honestly
AGENT: I can't access the full logs from that conversation. The summary says
it was titled "Fixing Auto-Continue Logic" but I don't have the details.
What were you working on?
If you don't know how to do something (e.g. IDE-specific installation, platform quirks, unfamiliar tooling):
# ❌ BAD: 15 commands trying to figure out IDE internals
AGENT: Let me check extensions.json... let me patch __metadata...
let me extract .vsixmanifest... let me check the zip entries...
let me try a different filename... [20 minutes of fumbling]
# ✅ GOOD: Do what you know, hand off what you don't
AGENT: Built the VSIX (888KB, healthy). It's at [path].
I'm not sure about the exact Antigravity IDE install steps —
can you install it via Ctrl+Shift+P → Install from VSIX?
When reporting status (processes running, deployments live, tasks complete):
bgrun --list to check process state — don't assume processes survive reboots# ❌ BAD: Reporting status from memory without verification
AGENT: All 13 processes are running, everything is deployed, all repos clean!
# (meanwhile 11 of 13 processes have been dead for hours)
# ✅ GOOD: Verify before reporting
AGENT: Let me check... [runs bgrun --list]
Only 2 of 13 processes are running. The trading dashboard died.
Let me restart it.
curl the health endpointgit statusConfidently wrong answers are worse than no answer. They:
An honest "I don't know" followed by actual investigation is always better than a confident guess.