Name: Runpod
Author: latentwill

SkillsPool

Search skills.../

Skill Content

RunPod — Pod & Infrastructure Management

Auth

Set RUNPOD_API_KEY environment variable, or prompt user for their API key.

Authorization: Bearer <RUNPOD_API_KEY>

API Surface

Two APIs exist. Use REST for everything unless noted otherwise.

API	Base URL	Use For
REST (primary)	`https://rest.runpod.io/v1`	All CRUD operations on pods, volumes, templates
GraphQL	`https://api.runpod.io/graphql`	GPU availability queries, runtime metrics, spot instance deployment

POST https://api.runpod.io/graphql

query {
  gpuTypes {
    id
    displayName
    memoryInGb
    communityPrice
    securePrice
    stockStatus        # LOW, MEDIUM, HIGH, or null (unavailable)
    communityCloud
    secureCloud
  }
}

GPU	VRAM	ID	Notes
RTX 4090	24 GB	`NVIDIA GeForce RTX 4090`	Dev/testing, cheap
RTX 4000 Ada	20 GB	`NVIDIA RTX 4000 Ada Generation`	Light inference
L40S	48 GB	`NVIDIA L40S`	Best value for training
A40	48 GB	`NVIDIA A40`	Inference workhorse
RTX 6000 Ada	48 GB	`NVIDIA RTX 6000 Ada Generation`	Alternative 48 GB
A100 80 GB	80 GB	`NVIDIA A100 80GB PCIe`	Large models
A100 SXM	80 GB	`NVIDIA A100-SXM4-80GB`	Higher bandwidth A100
H100 SXM	80 GB	`NVIDIA H100 80GB HBM3`	Fastest training
H100 NVL	94 GB	`NVIDIA H100 NVL`	Max VRAM H100

Use Case	Try In Order
Training (48 GB+)	L40S → A40 → RTX 6000 Ada → A100 80 GB
Inference (24 GB+)	RTX 4090 → RTX 4000 Ada → L40S
Frontier models	H100 SXM → H100 NVL → A100 SXM → A100 80 GB
Budget dev	RTX 4090 → RTX 4000 Ada

POST https://rest.runpod.io/v1/pods
Content-Type: application/json
Authorization: Bearer <API_KEY>

{
  "name": "my-pod",
  "imageName": "runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04",
  "gpuTypeIds": ["NVIDIA L40S"],
  "gpuCount": 1,
  "cloudType": "ALL",
  "containerDiskInGb": 50,
  "volumeInGb": 100,
  "volumeMountPath": "/workspace",
  "ports": ["8888/http", "22/tcp"],
  "env": {
    "JUPYTER_PASSWORD": "mypassword",
    "HF_TOKEN": "hf_..."
  }
}

Field	Type	Default	Notes
`name`	string	`"my pod"`	Max 191 chars
`imageName`	string	required	Container image tag
`gpuTypeIds`	string[]	—	Array. GPU pods only
`gpuCount`	int	1	Multi-GPU: 2, 4, 8
`gpuTypePriority`	string	`"availability"`	`availability` or `custom` (use ordering in array)
`computeType`	string	`"GPU"`	`GPU` or `CPU`
`cpuFlavorIds`	string[]	—	CPU pods only: `cpu3c`, `cpu3g`, `cpu3m`, `cpu5c`, `cpu5g`, `cpu5m`
`cloudType`	string	`"SECURE"`	`SECURE`, `COMMUNITY`, or omit for secure only
`containerDiskInGb`	int	50	Ephemeral — wiped on every restart
`volumeInGb`	int	20	Persists across restarts, mounted at `volumeMountPath`
`volumeMountPath`	string	`"/workspace"`	Where volume mounts
`networkVolumeId`	string	—	Attach a network volume (replaces local volume)
`ports`	string[]	`["8888/http","22/tcp"]`	Format: `port/protocol`
`env`	object	`{}`	`{"KEY": "value"}` format
`dockerEntrypoint`

POST https://api.runpod.io/graphql

mutation {
  podRentInterruptable(input: {
    name: "spot-training"
    imageName: "runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04"
    gpuTypeId: "NVIDIA L40S"
    gpuCount: 1
    cloudType: SECURE
    bidPerGpu: 0.40
    containerDiskInGb: 50
    volumeInGb: 100
    volumeMountPath: "/workspace"
    ports: "8888/http,22/tcp"
    env: [{ key: "HF_TOKEN", value: "hf_..." }]
  }) {
    id
    machineId
    machine { podHostId }
  }
}

GET https://rest.runpod.io/v1/pods

GET https://rest.runpod.io/v1/pods/{podId}

query {
  myself {
    pods {
      id
      name
      desiredStatus
      costPerHr
      runtime {
        uptimeInSeconds
        ports { ip isIpPublic privatePort publicPort type }
        gpus { id gpuUtilPercent memoryUtilPercent }
        container { cpuPercent memoryPercent }
      }
    }
  }
}

POST https://rest.runpod.io/v1/pods/{podId}/stop

POST https://rest.runpod.io/v1/pods/{podId}/start

POST https://rest.runpod.io/v1/pods/{podId}/restart

POST https://rest.runpod.io/v1/pods/{podId}/reset

DELETE https://rest.runpod.io/v1/pods/{podId}

mutation {
  podBidResume(input: {
    podId: "abc123"
    bidPerGpu: 0.40
    gpuCount: 1
  }) {
    id
    desiredStatus
  }
}

PATCH https://rest.runpod.io/v1/pods/{podId}
Content-Type: application/json

{
  "env": {"NEW_VAR": "value"},
  "ports": ["8888/http", "22/tcp", "5000/http"],
  "volumeInGb": 200,
  "imageName": "runpod/pytorch:2.8.0-py3.11-cuda12.6-devel-ubuntu22.04"
}

POST https://rest.runpod.io/v1/networkvolumes

{
  "name": "training-data",
  "size": 100,
  "dataCenterId": "US-TX-3"
}

GET https://rest.runpod.io/v1/networkvolumes

GET https://rest.runpod.io/v1/networkvolumes/{networkVolumeId}

PATCH https://rest.runpod.io/v1/networkvolumes/{networkVolumeId}

DELETE https://rest.runpod.io/v1/networkvolumes/{networkVolumeId}

{
  "name": "my-pod",
  "imageName": "runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04",
  "gpuTypeIds": ["NVIDIA L40S"],
  "networkVolumeId": "vol_abc123",
  "volumeMountPath": "/workspace"
}

GET https://rest.runpod.io/v1/templates

POST https://rest.runpod.io/v1/templates

{
  "name": "my-training-template",
  "imageName": "runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04",
  "containerDiskInGb": 50,
  "volumeInGb": 100,
  "volumeMountPath": "/workspace",
  "ports": ["8888/http", "22/tcp"],
  "env": {"JUPYTER_PASSWORD": "default"},
  "dockerStartCmd": []
}

PATCH https://rest.runpod.io/v1/templates/{templateId}

DELETE https://rest.runpod.io/v1/templates/{templateId}

Runpod | Skills Pool

Runpod

Runpod

RunPod — Pod & Infrastructure Management

Auth

API Surface

1. Check GPU Availability

Common GPU IDs

Suggested Fallback Chains

2. Create a Pod

REST (preferred)

Full Field Reference (REST POST /pods)

GraphQL (use for spot instances)

3. List Pods

REST

Get single pod

Runtime metrics (GraphQL only)

4. Pod Lifecycle

Stop (releases GPU, preserves volume)

Start / Resume

Restart (soft restart, keeps GPU)

Reset (wipes container disk, keeps volume)

Terminate (permanent deletion)

Resume spot instance (GraphQL only)

5. Update a Pod

6. Network Volumes

Create

List

Get

Update

Delete

Attach to pod at creation

Gotchas

7. Templates

List

Create

Update

Delete

8. Connecting to a Pod

HTTP Proxy (easiest, has timeout)

Mcporter

Sonoscli

Openhue

Healthcheck

Things Mac

Eightctl