Browse, search, and summarize experiment artifacts stored in S3. Use when the user wants to see what checkpoints, datasets, or results are available from past experiments, filter by date or keywords, or manage storage.
You are helping the user explore and understand what experiment artifacts are stored in their S3 bucket. This is a lightweight, local-only skill — no EC2 instances are launched.
rl-experiments-sujaius-east-2s3://rl-experiments-sujai/
experiments/
<experiment-name>/
checkpoints/ ← model weights, optimizer state
logs/ ← raw training logs
data/ ← datasets, preprocessed data
config.yaml ← experiment config snapshot
metadata.json ← auto-generated metadata (if available)
The user's query can be:
muon, reward, 7b)Parse what the user is asking for. Map to one of these actions:
# List all top-level experiments with dates and sizes
aws s3 ls s3://rl-experiments-sujai/experiments/ --recursive --summarize
# Or for just top-level experiment names:
aws s3 ls s3://rl-experiments-sujai/experiments/
# List all files in an experiment with sizes
aws s3 ls s3://rl-experiments-sujai/experiments/<name>/ --recursive --human-readable --summarize
# Get total bucket size
aws s3 ls s3://rl-experiments-sujai/ --recursive --summarize
# Get per-experiment sizes using a script
aws s3api list-objects-v2 --bucket rl-experiments-sujai --prefix experiments/ --query 'Contents[].{Key:Key,Size:Size,Modified:LastModified}' --output json
══════════════════════════════════════════════════════════════
S3 Experiments: rl-experiments-sujai
══════════════════════════════════════════════════════════════
Experiment Size Files Last Modified
─────────────────────────────────────────────────────────────
adam-vs-muon-sweep-v1 2.3 GB 24 2026-04-01
reward-shaping-7b 48.1 GB 156 2026-03-28
grpo-baseline 12.7 GB 89 2026-03-25
─────────────────────────────────────────────────────────────
Total: 63.1 GB across 3 experiments ($1.45/month)
══════════════════════════════════════════════════════════════
══════════════════════════════════════════════════════════════
Experiment: reward-shaping-7b
Last modified: 2026-03-28
Total size: 48.1 GB (156 files)
══════════════════════════════════════════════════════════════
Contents:
checkpoints/ 42.0 GB 12 files
├── checkpoint-epoch-10.pt 3.5 GB 2026-03-27
├── checkpoint-epoch-20.pt 3.5 GB 2026-03-27
├── ...
└── checkpoint-final.pt 3.5 GB 2026-03-28
logs/ 1.2 GB 89 files
data/ 4.9 GB 3 files
├── train.jsonl 3.2 GB
├── val.jsonl 1.5 GB
└── test.jsonl 0.2 GB
config.yaml 0.1 KB 1 file
══════════════════════════════════════════════════════════════
Estimated cost: $1.11/month (Standard tier)
To use these checkpoints, run:
/run-experiment resume-v1 --from-checkpoint s3://rl-experiments-sujai/experiments/reward-shaping-7b/checkpoints/checkpoint-final.pt
══════════════════════════════════════════════════════════════
══════════════════════════════════════════════════════════════
S3 Storage Report: rl-experiments-sujai
══════════════════════════════════════════════════════════════
Total: 63.1 GB ($1.45/month at Standard, auto-tiering enabled)
By experiment:
reward-shaping-7b 48.1 GB (76%) 28 days old → moving to IA soon
grpo-baseline 12.7 GB (20%) 7 days old
adam-vs-muon-sweep-v1 2.3 GB (4%) 1 day old
By type:
Checkpoints 52.3 GB (83%)
Datasets 8.1 GB (13%)
Logs 2.7 GB (4%)
Suggestions:
• reward-shaping-7b has 12 checkpoints — consider keeping only top 3
• Deleting intermediate checkpoints would save 29.4 GB ($0.68/month)
══════════════════════════════════════════════════════════════
If the user is searching for experiments by characteristics (e.g., "which experiments used Muon optimizer?" or "experiments with R² > 0.8"), cross-reference with W&B:
import wandb
api = wandb.Api()
# Search across projects for matching runs
runs = api.runs("<entity>/<project>", filters={...})
This helps connect "this S3 checkpoint came from that W&B run" — the link is the experiment name which appears in both places.
If the user asks to delete or clean up:
aws s3 rm --recursive for directories# Show what would be deleted
aws s3 ls s3://rl-experiments-sujai/experiments/<name>/ --recursive --human-readable --summarize
# After user confirms:
aws s3 rm s3://rl-experiments-sujai/experiments/<name>/ --recursive
For selective cleanup (e.g., keep only best checkpoint):
# List checkpoints
aws s3 ls s3://rl-experiments-sujai/experiments/<name>/checkpoints/ --human-readable
# Delete all except the one the user wants to keep
/run-sweep or run names from /run-experimentpython (conda) not python3 if you need to query W&B API