DetectionSet¶
DetectionSet represents a set of object detections.
Example JSON payload:
{
"detections":[
{
"video_id":"05497d88-488e-4731-a96e-7aaffa9d63e7",
"frame_time":1750683362080,
"track_id":127,
"frame_width":1920,
"frame_height":1080,
"obj_type":"person",
"box":{
"x":1738,
"y":389,
"width":51,
"height":74
},
"confidence":0.6845703
}
]
}
Field Descriptions¶
-
video_id:
string
A unique identifier for the video from which the frame was extracted. Useful for correlating detections with specific video streams. -
frame_time:
integer (timestamp in milliseconds)
Epoch timestamp indicating when the frame was captured, in milliseconds since Unix epoch. This allows alignment with other time-based data streams. -
track_id:
integer
An identifier used to associate detections across frames, i.e., to track the same object over time. Unique per video per object instance. -
frame_width:
integer
Width of the video frame in pixels. Useful for normalizing coordinates or calculating relative positions. -
frame_height:
integer
Height of the video frame in pixels. Likeframe_width
, this helps interpret bounding box dimensions in context. -
obj_type:
string
The semantic class of the detected object, e.g.,"person"
,"car"
, etc. Derived from the object detection model. -
box:
object
Describes the bounding box around the detected object within the frame: - x:
integer
— X-coordinate of the top-left corner of the box. - y:
integer
— Y-coordinate of the top-left corner of the box. - width:
integer
— Width of the bounding box in pixels. -
height:
integer
— Height of the bounding box in pixels. -
confidence:
float
A score between 0 and 1 indicating the model’s confidence in the detection. Higher values imply greater certainty.