Skill ファイル

Runpod

Name: Runpod
Author: digitalsamba

Cloud GPU processing via RunPod serverless. Use when setting up RunPod endpoints, deploying Docker images, managing GPU resources, troubleshooting endpoint issues, or understanding costs. Covers all 5 toolkit images (qwen-edit, realesrgan, propainter, sadtalker, qwen3-tts).

digitalsamba896 スター2026/03/22

職業
カテゴリ: クラウド

スキル内容

RunPod Cloud GPU

Run open-source AI models on cloud GPUs via RunPod serverless. Pay-per-second, no minimums.

Setup

# 1. Create account at https://runpod.io
# 2. Add API key to .env
echo "RUNPOD_API_KEY=your_key_here" >> .env

# 3. Deploy any tool with --setup
python tools/image_edit.py --setup
python tools/upscale.py --setup
python tools/dewatermark.py --setup
python tools/sadtalker.py --setup
python tools/qwen3_tts.py --setup

Each --setup command:

Creates a RunPod template from the Docker image
Creates a serverless endpoint with appropriate GPU
Saves the endpoint ID to .env (e.g. RUNPOD_QWEN_EDIT_ENDPOINT_ID)

Available Images

All images are public on GHCR — no authentication needed.

関連 Skill

Runpod | Skills Pool

Local CLI → Upload input to cloud storage → RunPod API → Poll for result → Download output

workersMin: 0    — Scale to zero when idle (no cost)
workersMax: 1    — Max concurrent jobs (increase for throughput)
idleTimeout: 5   — Seconds before worker scales down

query { myself { endpoints { id name gpuIds templateId workersMax workersMin } } }

query { myself { currentSpendPerHr spendDetails { localStoragePerHour networkStoragePerHour gpuComputePerHour } } }

query { myself { pods { id name runtime { uptimeInSeconds } machine { gpuDisplayName } desiredStatus } } }

mutation { saveEndpoint(input: {
  id: "endpoint_id",
  name: "endpoint-name",
  templateId: "template_id",
  gpuIds: "AMPERE_24",
  workersMin: 0,
  workersMax: 1
}) { id gpuIds } }

{
  "jobs": { "completed": 16, "failed": 1, "inProgress": 0, "inQueue": 2, "retried": 0 },
  "workers": { "idle": 0, "initializing": 1, "ready": 0, "running": 0, "throttled": 0 }
}

gpuIds: "AMPERE_24,ADA_24"   # Try 3090 first, fall back to 4090

AWS_ACCESS_KEY_ID="$R2_ACCESS_KEY_ID" \
AWS_SECRET_ACCESS_KEY="$R2_SECRET_ACCESS_KEY" \
aws s3api list-objects-v2 \
  --bucket "$R2_BUCKET_NAME" \
  --endpoint-url "https://${R2_ACCOUNT_ID}.r2.cloudflarestorage.com" \
  --region auto

docker buildx build --platform linux/amd64 -t ghcr.io/conalmullan/video-toolkit-<name>:latest docker/runpod-<name>/
docker push ghcr.io/conalmullan/video-toolkit-<name>:latest

image_edit	`ghcr.io/conalmullan/video-toolkit-qwen-edit:latest`	A6000/L40S	48GB+	~$0.05-0.15/job
upscale	`ghcr.io/conalmullan/video-toolkit-realesrgan:latest`	RTX 3090/4090	24GB	~$0.01-0.05/job
dewatermark	`ghcr.io/conalmullan/video-toolkit-propainter:latest`	RTX 3090/4090	24GB	~$0.05-0.30/job
sadtalker	`ghcr.io/conalmullan/video-toolkit-sadtalker:latest`	RTX 4090	24GB	~$0.05-0.15/job
qwen3_tts	`ghcr.io/conalmullan/video-toolkit-qwen3-tts:latest`	ADA 24GB	24GB	~$0.01-0.05/job

Action	Method	URL
Submit job	POST	`/v2/{id}/run`
Check status	GET	`/v2/{id}/status/{job_id}`
Cancel job	POST	`/v2/{id}/cancel/{job_id}`
List pending	GET	`/v2/{id}/requests`
Health/stats	GET	`/v2/{id}/health`

ID	GPU	VRAM	Typical Cost
`AMPERE_24`	RTX 3090	24GB	~$0.34/hr
`ADA_24`	RTX 4090	24GB	~$0.69/hr
`AMPERE_48`	A6000	48GB	~$0.76/hr
`AMPERE_80`	A100	80GB	~$1.99/hr

Runpod

RunPod Cloud GPU

Setup

Available Images

Runpod

RunPod Cloud GPU

Setup

Available Images

How It Works

Endpoint Management

Workers

Checking Endpoint Status

Disabling an Endpoint

RunPod API Reference

Authentication

GraphQL Queries

GraphQL Mutations

REST API (Serverless)

GPU Type IDs

Cloudflare R2 via AWS CLI

Troubleshooting

Force Image Pull

Cold Start Too Slow

Job Fails with OOM

"No workers available"

Docker Images

Cost Optimization

Feishu Drive

Nanoclaw Repl

Crosspost

Cloudflare

Mcp Integration

Setup Deploy

Tool	Env Var
image_edit	`RUNPOD_QWEN_EDIT_ENDPOINT_ID`
upscale	`RUNPOD_UPSCALE_ENDPOINT_ID`
dewatermark	`RUNPOD_DEWATERMARK_ENDPOINT_ID`
sadtalker	`RUNPOD_SADTALKER_ENDPOINT_ID`
qwen3_tts	`RUNPOD_QWEN3_TTS_ENDPOINT_ID`