A Real-Time Multi-Stage Architecture for Pose Estimation of Zebrafish Head with Convolutional Neural Networks

Zhang-Jin Huang; Xiang-Xiang He; Fang-Jun Wang; Qing Shen

doi:10.1007/s11390-021-9599-5

| Sign up

Article Link

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

Show Outline

Outline

Abstract

Keywords

Electronic Supplementary Material

References

Show full outline

Hide outline

Regular Paper

A Real-Time Multi-Stage Architecture for Pose Estimation of Zebrafish Head with Convolutional Neural Networks

Zhang-Jin Huang^{¹^,²^,³}, Xiang-Xiang He^¹, Fang-Jun Wang^{¹^,²}, Qing Shen^¹

School of Computer Science and Technology, University of Science and Technology of China, Hefei 230027, China

School of Data Science, University of Science and Technology of China, Hefei 230027, China

Anhui Province Key Laboratory of Software in Computing and Communication, Hefei 230027, China

Recommended by CAD/Graphcis 2019

Show Author Information

Abstract

In order to conduct optical neurophysiology experiments on a freely swimming zebrafish, it is essential to quantify the zebrafish head to determine exact lighting positions. To efficiently quantify a zebrafish head's behaviors with limited resources, we propose a real-time multi-stage architecture based on convolutional neural networks for pose estimation of the zebrafish head on CPUs. Each stage is implemented with a small neural network. Specifically, a light-weight object detector named Micro-YOLO is used to detect a coarse region of the zebrafish head in the first stage. In the second stage, a tiny bounding box refinement network is devised to produce a high-quality bounding box around the zebrafish head. Finally, a small pose estimation network named tiny-hourglass is designed to detect keypoints in the zebrafish head. The experimental results show that using Micro-YOLO combined with RegressNet to predict the zebrafish head region is not only more accurate but also much faster than Faster R-CNN which is the representative of two-stage detectors. Compared with DeepLabCut, a state-of-the-art method to estimate poses for user-defined body parts, our multi-stage architecture can achieve a higher accuracy, and runs 19x faster than it on CPUs.

Keywords

convolutional neural network pose estimation real-time zebrafish

Electronic Supplementary Material

Download File(s)

jcst-36-2-434-Highlights.pdf (560.2 KB)

References

[1]

Cong L, Wang Z, Chai Y, Han W, Shang C, Yang W, Bai L, Du J, Wang K, Wen Q. Rapid whole brain imaging of neural activity in freely behaving larval zebrafish (Danio rerio). Elife, 2017, 6: Article No. e28158. https://doi.org/10.7554/elife.28158.

Crossref Google Scholar

[2]

Xu Z P, Cheng X E. Zebrafish tracking using convolutional neural networks. Scientific Reports, 2017, 7: Article No. 42815. https://doi.org/10.1038/srep42815.

Crossref Google Scholar

[3]

Mathis A, Mamidanna P, Cury K M, Abe T, Murthy V N, Mathis M W, Bethge M. DeepLabCut: Markerless pose estimation of user-defined body parts with deep learning. Nature Neuroscience, 2018, 21: 1281-1289. https://doi.org/10.1038/s41593-018-0209-y.

Crossref Google Scholar

[4]

Girshick R, Donahue J, Darrell T, Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proc. the 2014 IEEE Conference on Computer Vision and Pattern Recognition, June 2014, pp.580-587. https://doi.org/10.1109/CVPR.2014.81.