Name: Jetson Deploy
Author: michaelalber

Skills suchen.../

Jetson Deploy | Skills Pool

Principle	Description	Priority
Power Mode Awareness	Select and validate power mode before benchmarking or deploying; results are meaningless without a fixed power profile	Critical
TensorRT First	Convert all inference models to TensorRT engines before deployment; never ship raw ONNX or PyTorch models to production	Critical
JetPack Compatibility	Verify JetPack version, L4T version, and CUDA version before installing any package or building any container	Critical
Container Reproducibility	Use jetson-containers for all deployments; pin base images to specific L4T versions; never rely on bare-metal installs	High
Thermal Management	Profile thermal behavior under sustained load; set power mode and fan policy before benchmarking; monitor with tegrastats	High
Memory Budget Discipline	The Orin Nano has 8GB unified memory shared between CPU and GPU; always account for OS overhead (~1.5GB), display server, and framework footprint	High
On-Device Validation	Never trust desktop or cloud benchmarks; always validate latency, throughput, and accuracy on the target Jetson device	High
Precision-Accuracy Tradeoff	FP16 is the default for Orin Nano; INT8 requires calibration data and accuracy validation; never assume precision reduction is lossless	Medium
Incremental Deployment	Deploy one component at a time; validate each stage before adding the next pipeline element	Medium
Telemetry from Day One	Instrument with tegrastats and jtop from the first deployment; do not wait for production to add monitoring	Medium

Query	When to Call
`search_knowledge("TensorRT FP16 INT8 quantization Jetson")`	During CONVERT/OPTIMIZE — selecting precision and quantization strategy
`search_knowledge("Jetson JetPack CUDA cuDNN compatibility")`	During SETUP — verifying version compatibility before any installation
`search_knowledge("Docker container NVIDIA GPU runtime")`	During CONTAINERIZE — configuring nvidia-docker runtime
`search_knowledge("TensorRT ONNX model conversion trtexec")`	During CONVERT — converting ONNX models to TensorRT engines
`search_knowledge("Jetson power mode thermal monitoring tegrastats")`	During BENCHMARK — measuring thermal behavior and power draw
`search_code_examples("TensorRT Python inference engine")`	Before writing inference code — find TensorRT Python API patterns
`search_code_examples("Docker Compose systemd service autostart")`	During DEPLOY — configuring auto-start and restart policies

┌──────────────────────────────────────────────────────────────────────┐
│                                                                      │
│   ┌───────┐    ┌─────────────┐    ┌─────────┐    ┌──────────┐       │
│   │ SETUP │───>│ CONTAINERIZE│───>│ CONVERT │───>│ OPTIMIZE │       │
│   └───────┘    └─────────────┘    └─────────┘    └──────────┘       │
│                                                       │              │
│                                                       v              │
│                                   ┌────────┐    ┌───────────┐       │
│                                   │ DEPLOY │<───│ BENCHMARK │       │
│                                   └────────┘    └───────────┘       │
│                                       │                              │
│                                       └──── (iterate if needed) ──┐ │
│                                                                   │ │
│                                   ┌──────────┐                    │ │
│                                   │ OPTIMIZE │<───────────────────┘ │
│                                   └──────────┘                      │
│                                                                      │
└──────────────────────────────────────────────────────────────────────┘

PRE-FLIGHT VERIFICATION
┌──────────────────────────────────────────────────────────────────┐
│ □ JetPack version confirmed (cat /etc/nv_tegra_release)         │
│ □ L4T version matches expected (dpkg -l nvidia-l4t-core)        │
│ □ CUDA version confirmed (nvcc --version)                       │
│ □ TensorRT version confirmed (dpkg -l tensorrt)                 │
│ □ Available disk space > 10GB (df -h)                           │
│ □ Docker runtime is nvidia (docker info | grep -i runtime)      │
│ □ Power mode is set (sudo nvpmodel -q)                          │
│ □ Fan mode is set (sudo jetson_clocks --show)                   │
│ □ Network access for container pulls (if needed)                │
│ □ Model files are accessible on device                          │
└──────────────────────────────────────────────────────────────────┘

If ANY checkbox is unchecked → STOP. Resolve before proceeding.

<jetson-deploy-state>

Jetson Deploy

Jetson Orin Nano Deployment & TensorRT Optimization

Core Philosophy

Jetson Deploy

Jetson Orin Nano Deployment & TensorRT Optimization

Core Philosophy

Domain Principles Table

Knowledge Base Lookups

Workflow

Deployment Pipeline

Pre-Flight Checklist

Step 1: SETUP

Step 2: CONTAINERIZE

Step 3: CONVERT

Step 4: OPTIMIZE

Step 5: BENCHMARK

Step 6: DEPLOY

State Block Format

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns