技能档案

Comfyui Node Datatypes

Name: Comfyui Node Datatypes
Author: MusfiqurRahma

ComfyUI data types - IMAGE, LATENT, MASK, CONDITIONING, MODEL, CLIP, VAE, AUDIO, VIDEO, 3D types, widget types, and custom types. Use when working with ComfyUI tensors, model types, or defining input/output data types.

MusfiqurRahma0 星标2026年3月5日

分类: 媒体

技能内容

ComfyUI Data Types

ComfyUI uses specific data types for node inputs and outputs. Understanding tensor shapes and data formats is essential.

Complete Type Reference

Tensor/Data Types

Type	V3 Class	Format	Description
IMAGE	`io.Image`	`torch.Tensor [B,H,W,C]` float32 0-1	Batch of RGB images
MASK	`io.Mask`	`torch.Tensor [H,W]` or `[B,H,W]` float32 0-1	Grayscale masks
LATENT	`io.Latent`	`{"samples": Tensor[B,C,H,W], "noise_mask"?: Tensor, "batch_index"?: list[int], "type"?: str}`	Latent space
CONDITIONING	`io.Conditioning`

相关技能

Comfyui Node Datatypes | Skills Pool

Type	V3 Class	Python Type
MODEL	`io.Model`	`ModelPatcher`
CLIP	`io.Clip`	`CLIP`
VAE	`io.Vae`	`VAE`
CONTROL_NET	`io.ControlNet`	`ControlNet`
CLIP_VISION	`io.ClipVision`	`ClipVisionModel`
CLIP_VISION_OUTPUT	`io.ClipVisionOutput`	`ClipVisionOutput`
STYLE_MODEL	`io.StyleModel`	`StyleModel`
GLIGEN	`io.Gligen`	`ModelPatcher` (wrapping Gligen)
UPSCALE_MODEL	`io.UpscaleModel`	`ImageModelDescriptor`
LATENT_UPSCALE_MODEL	`io.LatentUpscaleModel`	Any
SAMPLER	`io.Sampler`	`Sampler`
GUIDER	`io.Guider`	`CFGGuider`
HOOKS	`io.Hooks`	`HookGroup`
HOOK_KEYFRAMES	`io.HookKeyframes`	`HookKeyframeGroup`
MODEL_PATCH	`io.ModelPatch`	Any
AUDIO_ENCODER	`io.AudioEncoder`	Any
AUDIO_ENCODER_OUTPUT	`io.AudioEncoderOutput`	Any
PHOTOMAKER	`io.Photomaker`	Any
POINT	`io.Point`	Any
FACE_ANALYSIS	`io.FaceAnalysis`	Any
BBOX	`io.BBOX`	Any
SEGS	`io.SEGS`	Any

Type	V3 Class	Python Type	Description
MESH	`io.Mesh`	`MESH(vertices, faces)`	3D mesh with vertices + faces tensors
VOXEL	`io.Voxel`	`VOXEL(data)`	Voxel data tensor
FILE_3D	`io.File3DAny`	`File3D`	Any supported 3D format
FILE_3D_GLB	`io.File3DGLB`	`File3D`	Binary glTF
FILE_3D_GLTF	`io.File3DGLTF`	`File3D`	JSON-based glTF
FILE_3D_FBX	`io.File3DFBX`	`File3D`	FBX format
FILE_3D_OBJ	`io.File3DOBJ`	`File3D`	OBJ format
FILE_3D_STL	`io.File3DSTL`	`File3D`	STL format (3D printing)
FILE_3D_USDZ	`io.File3DUSDZ`	`File3D`	Apple AR format
SVG	`io.SVG`	`SVG`	Scalable vector graphics
LOAD_3D	`io.Load3D`	`{"image": str, "mask": str, "normal": str, "camera_info": CameraInfo}`	3D model with renders
LOAD_3D_ANIMATION	`io.Load3DAnimation`	Same as Load3D	Animated 3D model
LOAD3D_CAMERA	`io.Load3DCamera`	`{"position": dict, "target": dict, "zoom": int, "cameraType": str}`	3D camera info

Type	V3 Class	Python Type	Description
INT	`io.Int`	`int`	Integer with min/max/step
FLOAT	`io.Float`	`float`	Float with min/max/step/round
STRING	`io.String`	`str`	Text (single/multi-line)
BOOLEAN	`io.Boolean`	`bool`	Toggle with labels
COMBO	`io.Combo`	`str`	Dropdown selection
COMBO (multi)	`io.MultiCombo`	`list[str]`	Multi-select dropdown
COLOR	`io.Color`	`str` (hex)	Color picker, default `#ffffff`
BOUNDING_BOX	`io.BoundingBox`	`{"x": int, "y": int, "width": int, "height": int}`	Rectangle region
CURVE	`io.Curve`	`list[tuple[float, float]]`	Spline curve points
IMAGECOMPARE	`io.ImageCompare`	`dict`	Image comparison widget
WEBCAM	`io.Webcam`	`str`	Webcam capture widget

Type	V3 Class	Description
`*` (ANY)	`io.AnyType`	Matches any type
COMFY_MULTITYPED_V3	`io.MultiType`	Accept multiple specific types on one input
COMFY_MATCHTYPE_V3	`io.MatchType`	Generic type matching across inputs/outputs
COMFY_AUTOGROW_V3	`io.Autogrow`	Dynamic growing inputs
COMFY_DYNAMICCOMBO_V3	`io.DynamicCombo`	Combo that reveals sub-inputs per option
COMFY_DYNAMICSLOT_V3	`io.DynamicSlot`	Slot that reveals sub-inputs when connected
FLOW_CONTROL	`io.FlowControl`	Internal testing only
ACCUMULATION	`io.Accumulation`	Internal testing only

import torch
import numpy as np
from PIL import Image as PILImage

class ImageProcessor(io.ComfyNode):
    @classmethod
    def define_schema(cls):
        return io.Schema(
            node_id="ImageProcessor",
            display_name="Image Processor",
            category="image",
            inputs=[io.Image.Input("image")],
            outputs=[io.Image.Output("IMAGE")],
        )

    @classmethod
    def execute(cls, image):
        b, h, w, c = image.shape
        result = torch.clamp(image * 1.5, 0.0, 1.0)
        return io.NodeOutput(result)

from PIL import ImageOps

# Load from file → tensor
def load_image(path):
    img = PILImage.open(path)
    img = ImageOps.exif_transpose(img)   # fix rotation from camera EXIF
    if img.mode == "I":                  # handle 16-bit images
        img = img.point(lambda i: i * (1 / 255))
    img = img.convert("RGB")
    return torch.from_numpy(np.array(img).astype(np.float32) / 255.0).unsqueeze(0)

# Tensor → save to file
def save_image(tensor, path):
    if tensor.dim() == 4:
        tensor = tensor[0]
    PILImage.fromarray(np.clip(255.0 * tensor.cpu().numpy(), 0, 255).astype(np.uint8)).save(path)

# Batch operations
batch = torch.cat([img1, img2], dim=0)    # stack into batch
single = image[i]                          # extract from batch [H,W,C]
single_batch = image.unsqueeze(0)          # add batch dim [1,H,W,C]

# Invert mask
inverted = 1.0 - mask

# Mask ↔ Image conversion
alpha = mask.unsqueeze(0).unsqueeze(-1)                   # [1,H,W,1]
gray_mask = 0.299*img[:,:,:,0] + 0.587*img[:,:,:,1] + 0.114*img[:,:,:,2]
image_from_mask = mask.unsqueeze(-1).repeat(1, 1, 1, 3)  # [B,H,W,3]

# Ensure batch dim
if mask.dim() == 2:
    mask = mask.unsqueeze(0)  # [1, H, W]

class LatentDict(TypedDict):
    samples: torch.Tensor       # [B, C, H, W] - required
    noise_mask: NotRequired[torch.Tensor]
    batch_index: NotRequired[list[int]]
    type: NotRequired[str]      # only for "audio", "hunyuan3dv2"

samples = latent["samples"]       # [B, C, H, W]
# Always preserve extra keys when modifying:
result = latent.copy()
result["samples"] = modified_samples

class PooledDict(TypedDict):
    pooled_output: torch.Tensor
    control: NotRequired[ControlNet]
    area: NotRequired[tuple[int, ...]]
    strength: NotRequired[float]           # default 1.0
    mask: NotRequired[torch.Tensor]
    start_percent: NotRequired[float]      # 0.0-1.0
    end_percent: NotRequired[float]        # 0.0-1.0
    guidance: NotRequired[float]           # Flux-like models
    hooks: NotRequired[HookGroup]
    # ... many more model-specific keys (SDXL, SVD, WAN, etc.)

class VideoInput(ABC):
    def get_components(self) -> VideoComponents    # images tensor + audio + frame_rate
    def save_to(self, path, format, codec, metadata)
    def as_trimmed(self, start_time, duration) -> VideoInput | None
    def get_stream_source(self) -> str | BytesIO
    def get_dimensions(self) -> tuple[int, int]     # (width, height)
    def get_duration(self) -> float                  # seconds
    def get_frame_count(self) -> int
    def get_frame_rate(self) -> Fraction
    def get_container_format(self) -> str

from comfy_api.latest import Types

# File3D wraps a 3D file (disk path or BytesIO stream)
file_3d = Types.File3D(source="/path/to/model.glb", file_format="glb")
file_3d.format              # "glb"
file_3d.is_disk_backed      # True
file_3d.get_data()          # BytesIO
file_3d.get_bytes()         # raw bytes
file_3d.save_to("/output/model.glb")

from comfy_api.latest import Types

mesh = Types.MESH(vertices=torch.tensor(...), faces=torch.tensor(...))
voxel = Types.VOXEL(data=torch.tensor(...))

io.Color.Input("color", default="#ff0000", socketless=True)
# Value is a hex string like "#ff0000"

io.BoundingBox.Input("bbox",
    default={"x": 0, "y": 0, "width": 512, "height": 512},
    socketless=True,
    component="my_component",  # optional custom UI component
)
# Value is {"x": int, "y": int, "width": int, "height": int}

io.Curve.Input("curve",
    default=[(0.0, 0.0), (1.0, 1.0)],  # linear
    socketless=True,
)
# Value is list of (x, y) tuples

io.MultiCombo.Input("tags",
    options=["tag1", "tag2", "tag3"],
    default=["tag1"],
    placeholder="Select tags...",
    chip=True,  # show as chips
)
# Value is list[str]

io.Webcam.Input("webcam_capture")
# Value is str (captured image data)

io.ImageCompare.Input("comparison", socketless=True)
# Value is dict

# Simple: create inline custom type
MyData = io.Custom("MY_DATA_TYPE")

# Use in inputs/outputs
io.Schema(
    inputs=[MyData.Input("data")],
    outputs=[MyData.Output("MY_DATA")],
)

from comfy_api.latest._io import comfytype, ComfyTypeIO

@comfytype(io_type="MY_DATA_TYPE")
class MyData(ComfyTypeIO):
    Type = dict[str, Any]  # type hint for the data

# Accept any single type (always a connection input, no widget)
io.AnyType.Input("anything")

# Accept specific multiple types
io.MultiType.Input("data", types=[io.Image, io.Mask, io.Latent])

# MultiType with widget override (shows widget for first type)
io.MultiType.Input(
    io.Float.Input("value", default=1.0),
    types=[io.Float, io.Int],
)

from comfy_api.latest import (
    ComfyExtension,  # extension registration
    ComfyAPI,        # runtime API (progress, node replacement)
    io,              # all io types (io.Image, io.Schema, io.ComfyNode, etc.)
    ui,              # UI output helpers (ui.PreviewImage, ui.SavedImages, etc.)
    Input,           # Input.Image (ImageInput), Input.Audio, Input.Mask, Input.Latent, Input.Video
    InputImpl,       # InputImpl.VideoFromFile, InputImpl.VideoFromComponents
    Types,           # Types.MESH, Types.VOXEL, Types.File3D, Types.VideoCodec, etc.
)

# CORRECT
if image is not None:
    process(image)

# WRONG — multi-element tensors don't support bool()
if image:       # raises RuntimeError
    process(image)

# For boolean conditions on tensors, use .all() or .any()
if (mask > 0.5).all():
    ...

# IMAGE [B,H,W,C] → MASK [B,H,W]
mask = 0.299 * image[:,:,:,0] + 0.587 * image[:,:,:,1] + 0.114 * image[:,:,:,2]

# MASK [B,H,W] → IMAGE [B,H,W,C]
image = mask.unsqueeze(-1).repeat(1, 1, 1, 3)

# Resize image tensor
import torch.nn.functional as F
resized = F.interpolate(
    image.permute(0, 3, 1, 2),  # [B,C,H,W] for interpolate
    size=(new_h, new_w), mode='bilinear', align_corners=False
).permute(0, 2, 3, 1)  # back to [B,H,W,C]

Comfyui Node Datatypes

ComfyUI Data Types

Complete Type Reference

Tensor/Data Types

Comfyui Node Datatypes

ComfyUI Data Types

Complete Type Reference

Tensor/Data Types

Model Types (opaque, typically pass-through)

3D Types

Widget Types (create UI controls)

Special Types

IMAGE Type

Loading / Saving Images

MASK Type

LATENT Type

CONDITIONING Type

VIDEO Type

3D Types

File3D

MESH and VOXEL

Widget Types with Special Features

Color

BoundingBox

Curve

MultiCombo

Webcam

ImageCompare

Custom Types

Advanced: @comfytype decorator

AnyType / Wildcard

Imports from comfy_api.latest

Tensor Safety

Type Conversion Patterns

See Also

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api