Skip to content

DetectionSet

DetectionSet represents a set of object detections.

Example JSON payload:

{
  "detections":[
    {
      "video_id":"05497d88-488e-4731-a96e-7aaffa9d63e7",
      "frame_time":1750683362080,
      "track_id":127,
      "frame_width":1920,
      "frame_height":1080,
      "obj_type":"person",
      "box":{
        "x":1738,
        "y":389,
        "width":51,
        "height":74
      },
      "confidence":0.6845703
    }
  ]
}

Field Descriptions

  • video_id: string
    A unique identifier for the video from which the frame was extracted. Useful for correlating detections with specific video streams.

  • frame_time: integer (timestamp in milliseconds)
    Epoch timestamp indicating when the frame was captured, in milliseconds since Unix epoch. This allows alignment with other time-based data streams.

  • track_id: integer
    An identifier used to associate detections across frames, i.e., to track the same object over time. Unique per video per object instance.

  • frame_width: integer
    Width of the video frame in pixels. Useful for normalizing coordinates or calculating relative positions.

  • frame_height: integer
    Height of the video frame in pixels. Like frame_width, this helps interpret bounding box dimensions in context.

  • obj_type: string
    The semantic class of the detected object, e.g., "person", "car", etc. Derived from the object detection model.

  • box: object
    Describes the bounding box around the detected object within the frame:

  • x: integer — X-coordinate of the top-left corner of the box.
  • y: integer — Y-coordinate of the top-left corner of the box.
  • width: integer — Width of the bounding box in pixels.
  • height: integer — Height of the bounding box in pixels.

  • confidence: float
    A score between 0 and 1 indicating the model’s confidence in the detection. Higher values imply greater certainty.