DetectionSet¶

DetectionSet represents a set of object detections.

Example JSON payload:

{
  "detections":[
    {
      "video_id":"05497d88-488e-4731-a96e-7aaffa9d63e7",
      "frame_time":1750683362080,
      "track_id":127,
      "frame_width":1920,
      "frame_height":1080,
      "obj_type":"person",
      "box":{
        "x":1738,
        "y":389,
        "width":51,
        "height":74
      },
      "confidence":0.6845703
    }
  ]
}

Field Descriptions¶

video_id: string
A unique identifier for the video from which the frame was extracted. Useful for correlating detections with specific video streams.
frame_time: integer (timestamp in milliseconds)
Epoch timestamp indicating when the frame was captured, in milliseconds since Unix epoch. This allows alignment with other time-based data streams.
track_id: integer
An identifier used to associate detections across frames, i.e., to track the same object over time. Unique per video per object instance.
frame_width: integer
Width of the video frame in pixels. Useful for normalizing coordinates or calculating relative positions.
frame_height: integer
Height of the video frame in pixels. Like frame_width, this helps interpret bounding box dimensions in context.
obj_type: string
The semantic class of the detected object, e.g., "person", "car", etc. Derived from the object detection model.
box: object
Describes the bounding box around the detected object within the frame:
x: integer — X-coordinate of the top-left corner of the box.
y: integer — Y-coordinate of the top-left corner of the box.
width: integer — Width of the bounding box in pixels.
height: integer — Height of the bounding box in pixels.
confidence: float
A score between 0 and 1 indicating the model’s confidence in the detection. Higher values imply greater certainty.