Name: Mem1 Auto Optimize
Author: 982945902

Mem1 Auto Optimize | Skills Pool

Build
In mem1-server/: run cargo build --release (or cargo build). If build fails, report and stop or fix the code.
Start
Start mem1-server in the background. Record the process id (PID) so you can stop it later.
Wait for server ready
Poll or sleep so the server can bind and accept connections before evaluation runs.
Run evaluation
In evaluation/: run evaluation; ensure MEM1_BASE_URL points at the running mem1-server.
- Default for iteration: When max_rounds is specified and large (e.g. ≥10), use make medium for each round’s eval (and for re-eval after code changes). Medium uses the first 2 conversations from locomo10 (~300 QAs), giving more stable metrics than make sample (1 QA) while keeping each round feasible (~20–40 min). Each round still changes code when not met, then re-evals and assesses.
- Full eval: Use make full when max_rounds is small or when doing a final confirmation (see Notes).
Collect metrics
Read evaluation/evaluation_metrics.json. Parse into overall and by_category (e.g. bleu_score, f1_score, llm_score). Call this result M_current.
Compare to baseline
Load evaluation/baselines/mem0_locomo.json. Pass criterion: M_current.overall.llm_score ≥ baseline overall.llm_score (e.g. ≥ 0.8487).
If passed (target met)
Stop mem1-server. Output round number, final scores, and that the run is complete. End the loop.
If not passed — optimization and assessment (评估环节)
- Stop mem1-server.
- Create restore point (必须，否则无法可靠回退): 在修改任何 mem1-server 代码之前，先提交当前工作区，以便之后可以回退。执行：git add -A && git commit -m "pre-optimization round N"（N 为当前轮次）。若当前无变更可提交（working tree clean），可跳过本步，但若上一轮或本轮之前已做过一次「改代码前的提交」，则回退时用该提交即可。
- Decide optimization direction from the gaps. Edit mem1-server code (apply one or a small set of changes). 修改须符合「现阶段优化范围」：仅空投方案相关，不调 RRF/ fetch_limit / significant_terms 等参数。
  - Re-eval: Rebuild, start server again, run evaluation again (same choice as step 4: medium or full), then collect new metrics → call this M_new.
- Assessment (评估): Compare M_new vs M_prev (previous round’s metrics, or M_current from before this round’s change). Use a single primary metric for “improvement”, e.g. overall.llm_score.
  - If M_new shows improvement (e.g. M_new.overall.llm_score > M_prev.overall.llm_score): Treat the change as good. Stop server. Set M_prev = M_new. Go back to step 1 for the next round (or re-check baseline; if now passed, end).
  - If M_new shows no improvement or regression (e.g. M_new.overall.llm_score ≤ M_prev.overall.llm_score):
    - You must decide:
      - Optimization direction wrong (方向不对): Revert the code by restoring the pre-edit state. Because you committed before editing (restore point), run: git reset --hard HEAD~1 (回退到「改代码前」的那一次提交). Then choose a different optimization direction, apply a new change, and run re-eval + assessment again (from “Edit mem1-server code” in this step). If you did not create a commit before this round’s edit (e.g. no changes were staged), you cannot use git reset --hard HEAD~1; then you must manually undo your edits (re-apply the previous file contents from your own record or re-read and revert the same files).
      - Direction right but insufficient (力度不够): Keep the current code change, apply a stronger or follow-up change, then re-eval and run assessment again.
  - After a revert, do not count the reverted attempt as a full round; only count a round when you keep a change and proceed to the next round or exit.
- Always stop the server before rebuilding or before the next eval.

Mem1 Auto Optimize

mem1 Auto-Optimize (Server-Side, Agent-Full-Control)

Scope and authority

Autonomous execution — no user decisions

Mem1 Auto Optimize

mem1 Auto-Optimize (Server-Side, Agent-Full-Control)

Scope and authority

Autonomous execution — no user decisions

现阶段优化范围（避免死循环）

Prerequisites

Step 0. Airdrop 空投（仅当用户提供新方案时执行）

One-round steps

Stop conditions

Output per round and at end

Baseline file

Notes

Database Migrations

Database Migrations

Postgres Patterns

Frontend Query & Mutation

Db Migrations

Drizzle