Name: Pidog Control
Author: illogical

Pidog Control | Skills Pool

PiDog API running: uvicorn app.main:app --host 0.0.0.0 --port 8000
Base URL: http://<pi-hostname>:8000/api/v1
For vision: camera must be started (POST /camera/start) and a vision-capable model configured
For voice: Whisper STT endpoint running

Environment variables (in api/.env):

PIDOG_OLLAMA_URL=http://localhost:11434
PIDOG_OLLAMA_MODEL=llama3.2:3b
PIDOG_OLLAMA_VISION_MODEL=llava:7b
PIDOG_OPENROUTER_API_KEY=sk-or-...
PIDOG_OPENROUTER_VISION_MODEL=meta-llama/llama-3.2-11b-vision-instruct
PIDOG_CAMERA_ENABLED=true

POST /api/v1/actions/execute
Content-Type: application/json

{"actions": ["sit", "handshake"], "speed": 80}

Category	Actions
Movement	`forward`, `backward`, `turn left`, `turn right`, `stop`
Postures	`stand`, `sit`, `lie`
Expressions	`bark`, `bark harder`, `pant`, `howling`, `wag tail`, `shake head`, `nod`, `think`, `recall`, `fluster`, `surprise`
Social	`handshake`, `high five`, `lick hand`, `scratch`
Physical	`stretch`, `push up`, `twist body`, `relax neck`
Idle	`doze off`, `waiting`, `feet shake`

GET /api/v1/sensors/all

POST /api/v1/camera/start

POST /api/v1/agent/vision
Content-Type: application/json

{
  "question": "Who is in front of me? Are they waving?",
  "provider": "openrouter",
  "model": "meta-llama/llama-3.2-11b-vision-instruct"
}

POST /api/v1/agent/chat
Content-Type: application/json

{"message": "Do a trick to impress me!", "provider": "ollama"}

POST /api/v1/rgb/mode
Content-Type: application/json

{"style": "breath", "color": "cyan", "bps": 1.0, "brightness": 0.8}

Style	Effect
`monochromatic`	Solid color
`breath`	Slowly pulses in and out
`boom`	Explodes from center outward
`bark`	Radiates from center (alarm)
`speak`	Oscillates center↔edges (talking)
`listen`	Sweeps left→right (listening)

POST /api/v1/servos/head
{"yaw": 45, "roll": 0, "pitch": -20, "speed": 60}

POST /api/v1/servos/tail
{"angle": 60, "speed": 50}

GET  /api/v1/sound/list           # List all available sounds
POST /api/v1/sound/play
{"name": "single_bark_1", "volume": 80}

const ws = new WebSocket("ws://<pi-hostname>:8000/api/v1/ws");
ws.send(JSON.stringify({
  "type": "subscribe",
  "channels": ["sensors", "action_status", "status", "logs"]
}));

{"actions": ["stand", "stretch", "sit", "handshake"]}

Problem	Solution
503 on `/agent/vision`	Start camera first: `POST /camera/start`
502 on `/agent/vision`	Vision model not running — use `provider=openrouter` or run `ollama pull llava:7b`
422 on action execute	Invalid action name — check `/actions` for valid list
422 "battery low"	Battery below 6.5V — charge before heavy movement
429 rate limit	Max 10 actions/second — add delays between rapid commands
Actions not chaining	Include posture setup in same request: `["sit", "handshake"]`
WebSocket disconnects	Reconnect and re-send subscribe message

Channel	Rate	Data
`sensors`	5 Hz	distance, IMU, touch, sound
`status`	0.2 Hz	battery, posture, uptime
`action_status`	on change	current action queue state
`logs`	as emitted	server log stream

Pidog Control

PiDog Control Skill

When to Use This Skill

Prerequisites

Pidog Control

PiDog Control Skill

When to Use This Skill

Prerequisites

Step-by-Step Workflows

1. Execute a Named Action

2. Read Sensor Data

3. Visual Perception (Camera Q&A)

4. AI Chat (Natural Language → Actions)

5. Control RGB LEDs

6. Servo Fine Control

7. Play Sounds

8. WebSocket Real-Time Streaming

Clever Use Cases

Obstacle Avoidance Guardian

Sound-Guided Head Tracking

Pickup Detection

Petting Response Loop

Room Security Patrol

Performance / Show-Off Sequence

Emotional Mirror

Vision Narration (Livestream Companion)

Troubleshooting

References

Liquid Glass Design

Compose Multiplatform Patterns

Foundation Models On Device

Swiftui Patterns

Foundation Models On Device

Swiftui Patterns