Segments-Based 3D ConvNet for Action Recognition

Wei Li; Ning Xu; Ge Liu; Linglan Zhao; Xiangzhong Fang

doi:10.1088/1742-6596/1621/1/012042

Journal of Physics: Conference Series

Paper • The following article is Open access

Segments-Based 3D ConvNet for Action Recognition

Wei Li¹, Ning Xu¹, Ge Liu¹, Linglan Zhao¹ and Xiangzhong Fang¹

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 1621, 2020 International Conference on Computer Science and Communication Technology (ICCSCT) 2020 25-26 July 2020, Hangzhou, China Citation Wei Li et al 2020 J. Phys.: Conf. Ser. 1621 012042 DOI 10.1088/1742-6596/1621/1/012042

Download Article PDF

Article metrics

98 Total downloads

Author e-mails

liweihfyz@sjtu.edu.cn

Author affiliations

¹ Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

Learning to capture both long-range and short-range temporal information is crucial for action recognition task. Previous works utilize 3D ConvNets to capture short-range temporal dynamics in replacement of optical-flow which needs time-consuming extraction. However, dramatically incresed parameters limit the capacity for modeling long-term interactions. In this paper, we propose Segments-based 3D ConvNet (S3D) to integrate both long-term and short-term temporal dynamics. Firstly, we utilize 3D ResNet without temporal downsampling to capture short-range video contents. Secondly, we integrate a sparse sampling strategy to model long-range temporal structure. Finally, experiments on UCF-101 and HMDB-51 datasets show the effectiveness of our S3D compared with corresponding 3D ConvNet.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

Segments-Based 3D ConvNet for Action Recognition

Article metrics

Share this article

Author e-mails

Author affiliations

Abstract