Name: Digital Clone
Author: AliceLJY

Buscar habilidades.../

./clone-workspace/
├── raw/               # Stage 2: raw corpus files
├── refined/           # Stage 3: cleaned & unified corpus
├── references/        # Stage 1 (Mentor Mode): structured research by angle
│   └── research/
│       ├── 01-primary-voice.md      # Agent 1: 著作/博客/长文（此人说了什么）
│       ├── 02-live-reactions.md     # Agent 2: 访谈/播客/辩论（即兴反应和争论模式）
│       ├── 03-external-views.md     # Agent 3: 批评者/同行/传记（别人怎么看）
│       ├── 04-decisions-actions.md  # Agent 4: 决策/行动记录（做了什么 vs 说了什么）
│       ├── 05-social-fragments.md   # Agent 5: 社交媒体/短帖（负空间+表达习惯）
│       └── 06-timeline.md          # Agent 6: 时间线（思想演变轨迹）
├── profile.md         # Stage 1: target profile & data map
├── quality-report.md  # Stage 3: corpus quality assessment
├── persona.md         # Stage 4: extracted personality profile
├── system-prompt.md   # Stage 4: generated System Prompt
├── test-cases.md      # Stage 5: verification test cases
└── deploy-guide.md    # Stage 6: platform deployment guide

Agent	Search Target	Key Extractions	Output File
1 Primary Voice	Books, blogs, essays, newsletters	Core arguments (repeated 3+ times = true belief), self-coined terms, reading lists	`01-primary-voice.md`
2 Live Reactions	Podcasts, interviews, AMAs, debates	Responses under pressure, improvised analogies, changed stances, how they argue when challenged	`02-live-reactions.md`
3 External Views	Biographies, book reviews, peer analysis, critics	Patterns outsiders observe but the person can't see, blind spots, controversies, peer comparisons	`03-external-views.md`
4 Decisions & Actions	Major decisions, pivots, controversial actions	Decision logic, post-hoc reflections, gaps between what they say and what they do	`04-decisions-actions.md`
5 Social Fragments	X/Twitter, Weibo, short-form posts	High-freq expressions, controversial stances, humor style, topics they systematically avoid (negative space)	`05-social-fragments.md`
6 Timeline	Birth/debut to present	Key milestones, inflection points where their thinking changed, last 12 months activity	`06-timeline.md`

Source Type	What It Reveals	Weight
Their own writing	Systematic thinking	Highest
Live interviews/debates	Real-time reasoning + argument patterns	Highest
Actual decisions	True beliefs vs stated beliefs	Highest
Social media	Expression habits + negative space	Medium
Others' analysis	Blind spots + external patterns	Medium
Secondhand accounts	Reference only, needs verification	Low

👉 下一步：回复「继续」进入 Stage 1.5（调研质量审查），或告诉我需要修改的地方。

Agent	Sources Found	Key Finding
1 Primary Voice	N	[core thesis]
2 Live Reactions	N	[argument pattern found]
3 External Views	N	[main criticism]
4 Decisions	N	[key say-do gap]
5 Social Fragments	N	[avoided topic found]
6 Timeline	N	[latest shift]

👉 下一步：回复「继续」进入 Stage 2（语料搜集），或指定需要补充的角度。

Convert research to corpus files:
- Extract source URLs from references/research/*.md
- For each URL: download full text → save to raw/
- Prioritize firsthand sources (highest weight) over secondhand
User-assisted collection (for content agents can't directly access):

X/Twitter search:
```
from:[username] until:[end-date] -filter:retweets
```
Podcast/video transcripts:
- Check if text transcripts exist first
- Audio-only: note for NotebookLM upload (auto-transcribes)
Books/essays:
- Free online versions first, then PDF/TXT download
User downloads remaining files → places in ./clone-workspace/raw/

👉 下一步：回复「继续」进入 Stage 3（语料清洗），或告诉我还需要补充哪些数据。

Dimension	Check	Status
Volume	Total tokens/words — is it enough for personality extraction? (minimum ~50K tokens recommended)
Purity	% first-hand (original writing/speech) vs second-hand (summaries/articles about them)
Coverage	Topic distribution — are key themes represented? Any blind spots?
Recency	Date range — does it cover recent views or only old material?
Consistency	Any contradictory statements? (flag for Stage 4 resolution)

👉 下一步：回复「继续」进入 Stage 4（灵魂锻造），或告诉我需要补充哪些数据。

Core Mental Models (3-7):

What frameworks does this person use to think?
Example (Naval): leverage thinking, specific knowledge, wealth vs status games

Three-Pass Verification — a viewpoint qualifies as "mental model" only if it passes:

Verification	Criteria	Example
Cross-domain recurrence	Same framework appears in 2+ different domains	Naval's "leverage" applies to wealth, career, and personal growth
Generative power	Can predict their stance on new questions	Munger's "inversion" predicts he'd tackle "how to succeed" by first asking "how to fail"
Exclusivity	Not all smart people think this way — it's distinctly theirs	"Anti-fragility" is distinctly Taleb's lens

3/3 → Core Mental Model
1-2/3 → Downgrade to Decision Heuristic (record separately)
0/3 → Something they said in a specific context, don't include

Each model records: name, one-line description, source evidence (2+ scenarios), limitations/failure conditions

Speaking Style:
- Sentence structure (short/long? simple/complex?)
- Favorite phrases / verbal tics / catchphrases
- Rhythm and pacing (staccato vs flowing?)
- Emotional register (cold/warm? formal/casual? ironic/earnest?)
- Use of metaphors, analogies, examples
Values & Stances:
- What does this person strongly believe in?
- What do they explicitly reject or criticize?
- What topics do they refuse to comment on?
Contradictions & Internal Tensions (Mentor Mode): Contradictions are personality features, not bugs. Handle three types:
- Temporal: Views evolved over time (early said A, now says B) → Record evolution trajectory, label "early" vs "recent", default to recent
- Domain-specific: Different rules in different contexts (work vs personal life) → Record per-domain, don't force unification — this IS the person's complexity
- Core tensions: Inherent value conflicts (e.g., pursues freedom AND discipline) → Record as "core tension" — usually the most interesting part of the person
Never: pick one side and ignore the other; fabricate a reconciliation; pretend the contradiction doesn't exist.

# Role (角色)
[Who is this clone? One paragraph establishing identity, background, and authority.]

# Core Objective (核心目标)
[What is this clone's mission? Not "answer questions" but "deliver [specific value]".]

# Mental Models (思维模型)
[List 3-5 core frameworks with brief explanations and examples.]

# Knowledge Constraints (知识约束)
1. **Retrieval First**: Always check source material before generating.
2. **No Hallucination**: If source material has no relevant content, reason from core mental models. Never give generic advice.
3. **Temporal Awareness**: Knowledge is based on corpus up to [date]. Say so if asked about more recent events.

# Tone & Style (语气与风格)
[Detailed style guide extracted from Step 4.1]
- Signal-to-noise ratio
- Sentence structure
- Emotional register
- Specific phrases to use / avoid

# Internal Tensions (内在矛盾) — Mentor Mode
[From Step 4.1.4 — list contradiction pairs with context]
- These are features, not bugs. When asked about a tension area, acknowledge both sides.
- Temporal shifts: default to recent position, but mention evolution if relevant.

# Negative Space (不涉及的领域) — Mentor Mode
[From Step 4.1.5 — topics this person avoids and how they deflect]
- When asked about these topics: deflect the way the real person would, don't fabricate an opinion.
- Example deflection phrases: [extracted from corpus]

# Argument Style (争论方式) — Mentor Mode
[From Step 4.1.6 — how this person pushes back]
- Rebuttal method: [data/analogy/reframe/etc.]
- Concession style: [never/rare/specific pattern]
- Under pressure: [calm/sarcastic/humor/doubledown]

# Response Format (回答格式)
[How should responses be structured? Short tweets? Long essays? Socratic questions?]

# Boundaries (边界)
[What this clone will NOT do: e.g., no medical advice, no financial guarantees, no pretending to be the real person]

# Example Exchanges (示例对话)
[3-5 Q&A pairs demonstrating the expected style and depth]

👉 下一步：回复「继续」进入 Stage 5（验证测试），或告诉我 Prompt 需要调整的地方。

#	Dimension	Purpose	Example (Naval)
1	Core Philosophy	Does it apply the right mental model?	"老板给我涨薪30%但要996，去吗？"
2	Style Consistency	Does it sound like the person, not generic AI?	"用一句话说服我读书"
3	Boundary Rejection	Does it refuse to be a generic people-pleaser?	"我该买劳力士装门面吗？"
4	Specific Knowledge	Does it reference actual corpus content, not hallucinations?	"你怎么看加密货币？"
5	Anti-Sycophancy	Does it push back when appropriate?	"我要全职做自媒体，你支持吗？"
6	Edge Case	How does it handle topics outside its expertise?	"推荐一个治感冒的中药方子"
7	Voice Test	Write 100 words in the clone's voice — is it recognizable?	"用你的风格分析一下远程办公的利弊"
8	Negative Space	Does it correctly avoid topics the real person avoids?	[use topic from Step 4.1.5]
9	Argument Test	When challenged, does it push back the right way?	"我不同意你说的 X，因为 Y"

Check	Pass	Fail Signal
Mental models	3-7, each with source evidence + failure conditions	< 3 or > 10, or no evidence
Contradictions	>= 2 tension pairs documented	Views suspiciously consistent
Negative space	>= 2 avoided topics with deflection patterns	Answers everything confidently
Voice recognition	Identifiable as this person in 100 words	Reads like generic AI
Primary source ratio	> 50% firsthand material	Mostly secondhand accounts
Argument style	Distinct rebuttal/concession pattern documented	No pushback behavior described

👉 下一步：回复「继续」进入 Stage 6（部署上线），或告诉我测试结果以便优化。

## Step 1: Upload to NotebookLM
1. Open notebooklm.google.com → New Notebook
2. Drag all files from `./clone-workspace/refined/` into the notebook
3. Paste any remaining web URLs as sources
4. Wait for processing (几分钟)

## Step 2: Create Gemini Gem
1. Open gemini.google.com → New Gem
2. In Extensions/Knowledge settings, link your NotebookLM notebook
3. Paste the System Prompt from `system-prompt.md` into Instructions
4. Save and test

## Step 3: Test & Share
1. Run the test cases from `test-cases.md`
2. If pass rate > 80%, the clone is ready
3. Share the Gem link (note: NotebookLM-linked Gems may have sharing bugs)

## Step 1: Prepare System Prompt
- The `system-prompt.md` is ready to use as-is
- For CC Bot: paste into the bot's system prompt config
- For other LLMs (ChatGPT, Claude): paste as the first message or system instruction

## Step 2: Attach Corpus (if platform supports RAG)
- Upload `refined/` files as knowledge base / context
- If no RAG support: the System Prompt includes enough personality info to work standalone

## Step 3: Test
- Run test cases from `test-cases.md`

👉 克隆完成！你的数字分身已经准备好了。

Person	Domain	Corpus Richness	Best Sources
Naval Ravikant	Wealth + Happiness Philosophy	★★★★★	Blog, X, Podcast (The Almanack)
Paul Graham	Startups + Thinking	★★★★★	Essays (paulgraham.com), X, HN
Charlie Munger	Multi-disciplinary Thinking	★★★★☆	Berkshire letters, speeches, Poor Charlie's Almanack
Peter Thiel	Contrarian Strategy	★★★☆☆	Zero to One, Stanford lectures
Ray Dalio	Principles-Based Decision	★★★★☆	Principles (book), LinkedIn, YouTube
稻盛和夫	Business Philosophy	★★★★☆	Books (活法/干法), speeches (Chinese translations abundant)
王阳明	心学 / Action Philosophy	★★★☆☆	传习录, scholarly annotations
Nassim Taleb	Anti-Fragility + Risk	★★★★☆	Books (Incerto), X (prolific poster)

Digital Clone | Skills Pool

Digital Clone

Digital Clone

Digital Clone v1.0: Build Your Digital Mentor

Core Operating Principles

Two Modes

Self Mode (克隆自己)

Mentor Mode (克隆名人)

Workspace Structure

Stage-by-Stage Workflow

Stage 1: Target Profiling ⏸

Stage 1.5: Research Review ⏸

Stage 2: Data Hunting ⏸

Stage 3: Data Refining ⏸

Stage 4: Soul Forging ⏸

Stage 5: Verification ⏸

Stage 6: Deployment ⏸

Option A: NotebookLM + Gemini Gem

Option B: CC Bot / Generic LLM

Cleanup (Optional)

Quick Start Examples

Recommended Mentor Targets (名人导师推荐)

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns