WCSE 2019 SUMMER ISBN: 978-981-14-1684-2
DOI: 10.18178/wcse.2019.06.064

FSNet: Pose Estimation of Endoscopic Surgical Tools Using Feature Stacked Network

Yakui Chu, Xilin Yang, Yuan Ding, Danni Ai, Jingfan Fan, Xu Li, Yongtian Wang, Jian Yang

Abstract— Identification of surgical instruments is important to understand surgical scenarios and provide assistant processing in endoscopic image-guided surgery. In this paper, we propose a novel feature stacked network (FSNet) for the recognition of surgical tools in endoscopic images. With a lateral connection and concatenation operation on the different layers of the feature pyramid network, high-level semantic information is fused to low-level features, and the bounding boxes are regressed for the tool instance proposals. Then, low-level semantic information is propagated to a high-level network through the bottom-up feature concatenating path. The keypoints of tools are detected in each proposed boundary box. Two state-ofthe- art end-to-end tool keypoint recognition networks and three backbones are implemented for comparison. The AP and AR of the our FSNet based on ResNeXt101 are 46.1% and 36.5%, respectively, which surpass the results of other methods.

Index Terms— pose estimation, endoscopic image, convolutional neural networks, image-guided surgery

Yakui Chu, Xilin Yang, Yuan Ding, Danni Ai, Jingfan Fan, Xu Li, Jian Yang
Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, CHINA
Yongtian Wang
AICFVE of Beijing Film Academy, 4, Xitucheng Rd, Haidian, CHINA

[Download]


Cite: Yakui Chu, Xilin Yang, Yuan Ding, Danni Ai, Jingfan Fan, Xu Li, Yongtian Wang, Jian Yang, "FSNet: Pose Estimation of Endoscopic Surgical Tools Using Feature Stacked Network," Proceedings of 2019 the 9th International Workshop on Computer Science and Engineering, pp. 427-431, Hong Kong, 15-17 June, 2019.