ABSTRACT
Advanced data collection technologies, computational tools, and sophisticated algorithms have a revolutionary impact on sports analytics on various aspects of sports, from athletes performance to fan engagement. Computer Vision (CV) and Deep Learning (DL) technologies play a crucial role in predicting players and game states from videos, but their effectiveness depends on the quantity and quality of training data, especially in sports with unique dynamics and camera angles. Each sport comes with its own set of challenges.
This paper introduces DeepSportradar-v2, a multi-sport suite of CV tasks that address the need for high-quality datasets for different sports. Supporting multi-sport allows academic researchers to better understand the dynamics of each sport and their specific challenges. In this paper, we first report the results from the 2022 competition, and provide all resources to replicate each result. Then, we present a newly released Cricket dataset and task, given the global popularity and relevance of this sport for the automated analysis and video understanding.
Similarly to the first edition, a competition has been organized as part of the MMSports workshop, where participants are invited to develop state-of-the-art methods for solving the proposed tasks using the publicly available datasets, development kits, and baselines.
Supplemental Material
- Antony Anuraj, Gurtej S Boparai, Carson K Leung, Evan WR Madill, Darshan A Pandhi, Ayush Dilipkumar Patel, and Ronak K Vyas. 2023. Sports data mining for cricket match prediction. In International Conference on Advanced Information Networking and Applications. Springer, 668--680.Google ScholarCross Ref
- Ben Athiwaratkun, Marc Finzi, Pavel Izmailov, and Andrew Gordon Wilson. 2018. There are many consistent explanations of unlabeled data: Why you should average. arXiv preprint arXiv:1806.05594 (2018).Google Scholar
- Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).Google Scholar
- Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, and Marc Van Droogenbroeck. 2022. Scaling up SoccerNet with multi-view spatial localization and re-identification. Scientific Data 9, 1 (2022), 1--9.Google ScholarCross Ref
- Anthony Cioppa, Silvio Giancola, Adrien Deliege, Le Kang, Xin Zhou, Zhiyu Cheng, Bernard Ghanem, and Marc Van Droogenbroeck. 2022. SoccerNetTracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos. In Proceedings of the IEEE/CVF Conference on CVPR. 3491--3502.Google Scholar
- Adrien Deliege, Anthony Cioppa, Silvio Giancola, Meisam J Seikavandi, Jacob V Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B Moeslund, and Marc Van Droogenbroeck. 2021. Soccernet-v2: A dataset and benchmarks for holistic understanding of broadcast soccer videos. In Proceedings of the IEEE/CVF Conference on CVPR. 4508--4519.Google ScholarCross Ref
- Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248--255.Google ScholarCross Ref
- Martin A Fischler and Robert C Bolles. 1981. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6 (1981), 381--395.Google ScholarDigital Library
- Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D Cubuk, Quoc V Le, and Barret Zoph. 2021. Simple copy-paste is a strong data augmentation method for instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2918--2928.Google ScholarCross Ref
- Silvio Giancola, Mohieddine Amine, Tarek Dghaily, and Bernard Ghanem. 2018. Soccernet: A scalable dataset for action spotting in soccer videos. In Proceedings of CVPR workshops. 1711--1721.Google ScholarCross Ref
- Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xinxing Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei A. Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chenle Zhang, Chen Zhao, Che-Hsien Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, F. L. Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João Victor Bentes Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lin Chen, M L Santos Marqués, Mike Azatov, N. I. Kasatkin, Ning Wang, Qi Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, Rengang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shi-Jin Chen, Shoichi Masui, Shouhong Ding, Sin wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas Baltzer Moeslund, Wan-Chi Siu, Wei Zhang, W. Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yan Guo, Yaqian Zhao, Yi Yu, Yingying Li, Yue He, Yujie Zhong, Zhenhua Guo, and Zhiheng Li. 2022. SoccerNet 2022 Challenges Results. Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports (2022).Google ScholarDigital Library
- Konrad Habel, Fabian Deuser, and Norbert Oswald. 2022. CLIP-ReIdent: Contrastive Training for Player Re-Identification. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 129--135.Google ScholarDigital Library
- Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.Google ScholarCross Ref
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarCross Ref
- Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning. pmlr, 448--456.Google Scholar
- Pavel Izmailov, Dmitrii Podoprikhin, Timur Garipov, Dmitry Vetrov, and Andrew Gordon Wilson. 2018. Averaging weights leads to wider optima and better generalization. arXiv preprint arXiv:1803.05407 (2018).Google Scholar
- Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, and Piotr Dollár. 2019. Panoptic segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9404--9413.Google ScholarCross Ref
- Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, and Ross Girshick. 2023. Segment Anything. arXiv:2304.02643 (2023).Google Scholar
- Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision--ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6--12, 2014, Proceedings, Part V 13. Springer, 740-- 755.Google Scholar
- Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017).Google Scholar
- Adrien Maglo, Astrid Orcesi, and Quoc-Cuong Pham. 2022. KaliCalib: A Framework for Basketball Court Registration. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 111--116.Google ScholarDigital Library
- Xiaohan Nie, Shixing Chen, and Raffay Hamid. 2021. A robust and efficient framework for sports-field registration. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 1936--1944.Google ScholarCross Ref
- Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748--8763.Google Scholar
- Prajit Ramachandran, Barret Zoph, and Quoc V Le. 2017. Searching for activation functions. arXiv preprint arXiv:1710.05941 (2017).Google Scholar
- AZisserman RHartley. 2003. MultipleViewGeometryinComputer Vision.Google Scholar
- Vladimir Somers, Christophe De Vleeschouwer, and Alexandre Alahi. 2022. Body Part-Based Representation Learning for Occluded Person Re-Identification. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2022), 1613--1623.Google Scholar
- Gabriel Van Zandycke, Vladimir Somers, Maxime Istasse, Carlo Del Don, and Davide Zambrano. 2022. Deepsportradar-v1: Computer vision dataset for sports understanding with high quality annotations. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 1--8.Google ScholarDigital Library
- Bo Yan, Fengliang Qi, Zhuang Li, Yadong Li, and Hongbin Wang. 2022. Strong Instance Segmentation Pipeline for MMSports Challenge. arXiv preprint arXiv:2209.13899 (2022).Google Scholar
Index Terms
- DeepSportradar-v2: A Multi-Sport Computer Vision Dataset for Sport Understandings
Recommendations
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations
MMSports '22: Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in SportsWith the recent development of Deep Learning applied to Computer Vision, sport video understanding has gained a lot of attention, providing much richer information for both sport consumers and leagues. This paper introduces DeepSportradar-v1, a suite of ...
Game idea jam for sport and exertion games
CHI PLAY '14: Proceedings of the first ACM SIGCHI annual symposium on Computer-human interaction in playGame Jams have successfully been introduced to the CHI Community during the past two years. Game developers meet to plan, design, and create one or more games within a short time span (ranging from 24 to 48 hours). We propose a Game Idea Jam focusing on ...
Can Some Computer Games Be a Sport?: Issues with Legitimization of eSport as a Sporting Activity
This paper focuses on the problem of social legitimization of eSport in the context of traditional sports. Its objective is to investigate knowledge and attitudes towards eSports, as well as their recognition as legitimate sports. The first part of the ...
Comments