Object Detection
Overview
Object Detection node detects objects in an image using a custom-loaded model and a configurable post-processing pipeline.
This node gives full control over the detection stack: model selection, confidence filtering, NMS, label mapping, frame tiling, and overlay styling. Use this node when General Object Detection does not offer enough control, or when you need to run a domain-specific model.
Input
Input Image
image requiredThe image frame to analyze. Connect this to a camera or upstream image output.
Model Directory Path
string requiredPath to the model directory to load for inference. See Model Directory Path for details.
Confidence Threshold
number required advancedMinimum confidence score to keep a detection. See Confidence Threshold for tuning guidance.
Range: 0.0 to 1.0
Default: 0.3
Overlap threshold
number required advancedBoxes overlapping Filter (higher allows more overlap). See Overlap threshold for tuning guidance.
Range: 0.0 to 1.0
Default: 0.5
Use TensorRT If Available
boolean advancedWhen enabled, try to use TensorRT backend for inference (Nvidia GPU only). This may improve inference speed. For the first run, it will take some time to convert the model, subsequent runs will be much faster. If the model is not compatible, it will fallback to non-TensorRT inference with a warning.
Divide Frame
boolean advancedSplits a large frame into overlapping regions before inference, then merges results. Useful for improving recall on small objects in high-resolution images.
Transform Contains
array advancedRule-based relabeling by text containment on detected labels.
Transform Labels
array advancedDirect mapping from source labels to target labels.
Keep Labels
array advancedAllow-list of labels to keep in final output.
Ignore Labels
array advancedDeny-list of labels to remove from final output.
Overlay Results
boolean required advancedWhether to draw detection overlays on output frame. See Overlay Results.
Default: true
Label Colors
object advancedCustom per-label color mapping for overlay rendering.
Custom Model Configs
boolean advancedReveals extra framework and architecture fields for non-.nam models. See Custom Model Configs.
Model Color Format
Color format of the image data sent to the model. If left blank, the default for the chosen architecture is used. Adjust if your model was trained with a different color format.
RGB- Red, Green, Blue channel order.BGR- Blue, Green, Red channel order.
Scale
Multiplicative factor for the R, G, and B channels of the image before sending to the neural network. If left blank, the default for the chosen architecture is used. Used to normalize pixel values to the range expected by the model.
Mean
Mean values for the R, G, and B channels that are subtracted from the image before sending to the neural network. If left blank, the default for the chosen architecture is used. Used to normalize the image to match the data distribution the model was trained on.
Input Format
Tensor format for the image data sent to the neural network. If left blank, the default for the chosen architecture is used.
NHWC- Batch x Height x Width x Channel (channels last).NCHW- Batch x Channel x Height x Width (channels first).
Input Type
Data type of the image input sent to the neural network.
Uint8- Pixel values are integers in the range 0-255.Float32- Pixel values are 32-bit floating point numbers, often in the range 0.0-1.0 or normalized according to the model's training.
Post Process Type
Method for processing the raw output from the neural network before producing final detections. If left blank, the default for the chosen architecture is used.
objdet_ontf2: ONNX TensorFlow 2"detection_classes": [N], [1_class_id, ..., n_class_id], "detection_scores": [N], [1_score, ..., n_score], "detection_boxes": [N*4], [1_y1, 1_x1, 1_y2, 1_x2, ..., n_y1, n_x1, n_y2, n_x2]objdet_onmmyolo: ONNX YOLO"labels": [N], [1_class_id, ..., n_class_id], "scores": [N], [1_score, ..., n_score], "boxes": [N*4], [1_x1, 1_y1, 1_x2, 1_y2, ..., n_x1, n_y1, n_x2, n_y2]objdet_onyoloworldv2: ONNX YOLO WorldV2"scores": [N*C], [1_class_a_score, 1_class_b_score, ..., n_class_a_score, n_class_b_score, ...], "boxes": [N*4], [1_x1, 1_y1, 1_x2, 1_y2, ..., n_x1, n_y1, n_x2, n_y2]objdet_onmmdet: ONNX MMDetection"dets": [N*5] e.g. [1_x1, 1_y1, 1_x2, 1_y2, 1_score, ..., n_x1, n_y1, n_x2, n_y2, n_score], "labels": [N] e.g. [1_class_id, ..., n_class_id]
Note
N: number of detected objectCis the number of classes
Output
Overlay Image
imageFrame with detection overlays.
Detected Count
integerNumber of detected objects.
Detected Objects
arrayArray of detected objects, each containing:
bboxarray:[x, y, width, height]labelstringconfidencenumber
Tips
- Use a higher confidence threshold to reduce false positives.
- Tune Overlap threshold based on how close objects appear in your scene.
- When objects are small in high-resolution frames, frame division can improve detection recall.