Name: Using Dali Dynamic Mode
Author: NVIDIA

Skills suchen.../

Using Dali Dynamic Mode | Skills Pool

__cuda_array_interface__

__array__

b = ndd.batch([arr1, arr2])    # copy
b = ndd.as_batch(data)         # wrap, no copy if possible

xy = ndd.random.uniform(batch_size=16, range=[0, 1], shape=2)
crop_x = xy.slice[0]       # Batch of 16 scalars, first element from each sample
crop_y = xy.slice[1]       # Batch of 16 scalars, second element from each sample
sample_0 = xy.select(0)    # Tensor, the entire first sample [x, y]

reader = ndd.readers.File(file_root=image_dir, random_shuffle=True)

for epoch in range(num_epochs):
    for jpegs, labels in reader.next_epoch(batch_size=64):
        # jpegs, labels are Batch objects
        ...

reader = ndd.readers.File(
    file_root=image_dir,
    shard_id=rank, num_shards=world_size,
    stick_to_shard=True,
    pad_last_batch=True,
)

with ndd.EvalMode.sync_full:
    images = ndd.decoders.image(jpegs, device="gpu")
    images = ndd.resize(images, size=[224, 224])
    # Any error surfaces here, at the exact op that failed

ndd.set_num_threads(4)  # Call once at startup

# Approach 1: set the thread-local default seed (simple, good enough for most cases)
ndd.random.set_seed(42)
angles = ndd.random.uniform(batch_size=64, range=(-30, 30))

# Approach 2: explicit RNG object (finer control, pass rng= to each op)
rng = ndd.random.RNG(seed=42)
values = ndd.random.uniform(batch_size=64, range=[0, 1], shape=2, rng=rng)

import nvidia.dali.experimental.dynamic as ndd

ndd.set_num_threads(4)
reader = ndd.readers.File(file_root="/data/imagenet/train", random_shuffle=True)

for epoch in range(num_epochs):
    for jpegs, labels in reader.next_epoch(batch_size=64):
        images = ndd.decoders.image(jpegs, device="gpu")
        images = ndd.resize(images, size=[224, 224])
        images = ndd.crop_mirror_normalize(
            images,
            mean=[0.485 * 255, 0.456 * 255, 0.406 * 255],
            std=[0.229 * 255, 0.224 * 255, 0.225 * 255],
        )
        train_step(images.torch(), labels.torch())

Wrong	Right	Why
`device="mixed"`	`device="gpu"`	`"mixed"` is pipeline mode only
`batch[i]`	`batch.select(i)`	`Batch` has no `__getitem__`
`batch.select(0)` for per-sample slicing	`batch.slice[0]`	`.select()` picks samples; `.slice` slices within each sample
`.evaluate()` after every op	Let consumption trigger eval	`.torch()`, `.shape`, etc. trigger it automatically
`.cpu()` before GPU model	`.torch()` directly	Avoids wasteful D2H + H2D round-trip
Recreate reader each epoch	`reader.next_epoch()`	Readers are stateful -- create once, reuse
`ndd.readers.file(...)`	`ndd.readers.File(...)`	Reader classes are PascalCase
`break` from `next_epoch()` loop	Exhaust iterator or create new reader	Iterator must be fully consumed before next `next_epoch()`
No `batch_size` to random ops	`ndd.random.uniform(batch_size=N, ...)`	No pipeline-level batch size to inherit

Pipeline Mode	Dynamic Mode
`@pipeline_def` / `pipe.build()` / `pipe.run()`	Direct function calls in a loop
`fn.readers.file(...)`	`ndd.readers.File(...)` (PascalCase, stateful)
`fn.decoders.image(jpegs, device="mixed")`	`ndd.decoders.image(jpegs, device="gpu")`
`fn.op_name(...)`	`ndd.op_name(...)`
Pipeline-level `batch_size=64`	`reader.next_epoch(batch_size=64)` + random ops `batch_size=64`
Pipeline-level `seed=42`	`ndd.random.set_seed(42)` or `ndd.random.RNG(seed=42)`
Pipeline-level `num_threads=4`	`ndd.set_num_threads(4)` at startup
`output.at(i)`	`batch.select(i)`
`output.as_cpu()`	`batch.cpu()`
`pipe.run()` returns tuple of `TensorList`	`reader.next_epoch(batch_size=N)` yields tuples of `Batch`

Intent	Method	Returns
Get sample i	`batch.select(i)`	`Tensor`
Get subset of samples	`batch.select(slice_or_list)`	`Batch`
Slice within each sample	`batch.slice[...]`	`Batch` (same batch_size)

Using Dali Dynamic Mode

DALI Dynamic Mode

Core Data Types

Tensor -- single sample

Using Dali Dynamic Mode

DALI Dynamic Mode

Core Data Types

Tensor -- single sample

Batch -- collection of samples (variable shapes OK)

Readers

Device Handling

Execution Model

Thread Configuration

RNG

Example: Image Classification Pipeline

Common Mistakes

Pipeline Mode Migration

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns