HyperAI

Action Triplet Detection

Action Triplet Detection is a subtask in the field of computer vision that focuses on detecting and localizing the bounding boxes of instruments and anatomical structures in images, and predicting the action relationships between them to form <instrument, verb, target> triplets. This task aims to enhance the accuracy and efficiency of understanding surgical scenes, providing key technical support for applications such as medical robot navigation and surgical video analysis.