刘斌, 王凯歌, 李晓蒙, 胡春海. 基于语义部位分割的条纹斑竹鲨鱼体运动姿态解析[J]. 农业工程学报, 2021, 37(3): 179-187. DOI: 10.11975/j.issn.1002-6819.2021.03.022
    引用本文: 刘斌, 王凯歌, 李晓蒙, 胡春海. 基于语义部位分割的条纹斑竹鲨鱼体运动姿态解析[J]. 农业工程学报, 2021, 37(3): 179-187. DOI: 10.11975/j.issn.1002-6819.2021.03.022
    Liu Bin, Wang Kaige, Li Xiaomeng, Hu Chunhai. Motion posture parsing of Chiloscyllium plagiosum fish body based on semantic part segmentation[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2021, 37(3): 179-187. DOI: 10.11975/j.issn.1002-6819.2021.03.022
    Citation: Liu Bin, Wang Kaige, Li Xiaomeng, Hu Chunhai. Motion posture parsing of Chiloscyllium plagiosum fish body based on semantic part segmentation[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2021, 37(3): 179-187. DOI: 10.11975/j.issn.1002-6819.2021.03.022

    基于语义部位分割的条纹斑竹鲨鱼体运动姿态解析

    Motion posture parsing of Chiloscyllium plagiosum fish body based on semantic part segmentation

    • 摘要: 条纹斑竹鲨具有较高的经济价值和医用研究价值。人工驯养对环境和温度等因素要求较高,时常出现大规模病死现象。利用视频图像量化分析鱼体运动行为,有助于进行异常识别和早期预警,将有效提高养殖养护水平。该研究针对人工驯养的条纹斑竹鲨鱼,提出一种基于深度神经网络的语义部位分割方法,并将分割结果应用于剖析条斑鲨鱼体运动姿态。首先,依据条斑鲨形态特征将其划分为7个可视的身体组成构件(头部、右胸鳍、左胸鳍、右腹鳍、左腹鳍、躯干、尾巴);再对全景养殖监控视频中抽取的476幅条斑鲨子图进行各部位的像素级标记,通过数据增强到1 944幅建立鱼体语义部位数据集,其中训练集为1 166幅图像,测试集为778幅图像;然后,在语义分割网络模型基础上进行深度学习训练,使用深度学习框架对网络参数进行微调使得网络训练结果达到最优。最后,利用语义部位分割结果定位躯干和鱼头质心建立随体坐标,通过随体坐标的方向变化判明鱼体动作姿态。基于FCN-8s和Segnet两种深度网络模型进行了鱼体部位分割的对比试验,测试结果表明基于Segnet网络的分割方法在头部、右胸鳍、左胸鳍、右腹鳍、左腹鳍、躯干、尾巴部位的准确度分别高出FCN-8s深度网络1.50,4.70,6.95,6.56,6.01,0.85,0.84个百分点。语义部位分割结果能够有效判别条斑鲨鱼体目标的动作姿态,可为鱼体异常行为识别和进一步开展面向条斑鲨的动物行为学试验提供技术参考。

       

      Abstract: The Chiloscyllium plagiosum has high economic and medical value. However, the real artificial breeding conditions cannot meet the high requirements for the breeding environment of marine fish, such as water quality and temperature, often leading to large-scale illness even death. Since video imaging has been widely used to quantitatively analyze the movement behavior of farmed fish, the technique can contribute to identifying abnormal behavior for the early warning, and thereby effectively improving the level of breeding and conservation. In this study, an imaging algorithm was proposed for the semantic part segmentation of Chiloscyllium plagiosum using encoder-decoder architecture, thereby analyzing the body movement and posture of the Chiloscyllium plagiosum. Three steps were as follows: 1) The images of Chiloscyllium plagiosum were divided into 7 visible body components, according to the morphological characteristics, including the head, left pectoral fin, right pectoral fin, left ventral fin, right ventral fin, trunk, and tail. Then, the sub-images of Chiloscyllium plagiosum were extracted from the video images in the panoramic breeding surveillance under a breeding circumstance, where a total of 476 candidate patterns were obtained, while all the images in the dataset were manually marked. After that, data augmentation was used to increase the number of images, and thus a total of 1 944 images were obtained, of which 1 166 images were selected as training images, and 778 images were selected as test images. 2) The pre-processed training dataset was fed into the network model of semantic segmentation by fine-tuning network parameters, where a deep learning framework was used to optimize the network training for the best. Then, the test dataset was put into the trained model for the segmentation. 3) Post-processing was performed to fill the holes within objects or remove small objects, where a disk structure of mathematical morphology was used to calculate the areas of connected regions. Simple and effective post-processing was utilized to obtain the optimal segmentation of fish body images under complex backgrounds or interference environments. Then, the semantic part segmentations in different colors were used to locate the centroid of the fish head and trunk for the body coordinates. The posture of the target was analyzed to calculate in a single frame image, and thereby identify the movement changes of the fish body in the frame sequence. The main steps of this work included: 1) To draw the body coordinates; 2) to analyze and calculate the direction of the fish body; 3) to identify the direction of movement. Compared with the Segnet and FCN-8s network architecture for semantic part segmentation, the test dataset showed that the segmentation using the Segnet network improved the accuracy of FCN-18s network by 1.5, 4.7, 6.95, 6.56, 6.01, 0.85, and 0.84 percentage points, respectively. Semantic part segmentation can be used to effectively distinguish the action posture of Chiloscyllium plagiosum body. The finding can lay a foundation for the recognition of abnormal fish behavior and further development of animal behavior experiments for the Chiloscyllium plagiosum.

       

    /

    返回文章
    返回