research-article

DeepSportradar-v2: A Multi-Sport Computer Vision Dataset for Sport Understandings

Authors:
Maxime Istasse

Sportradar AG & UCLouvain, Louvain-la-Neuve, Belgium

Sportradar AG & UCLouvain, Louvain-la-Neuve, Belgium

0000-0002-5153-1106
View Profile

,
Vladimir Somers

Sportradar AG & EPFL & UCLouvain, Louvain-la-Neuve, Belgium

Sportradar AG & EPFL & UCLouvain, Louvain-la-Neuve, Belgium

0000-0001-5787-4276
View Profile

,
Pratheeban Elancheliyan

Sportradar AG, St. Gallen, Switzerland

Sportradar AG, St. Gallen, Switzerland

0009-0005-5166-4151
View Profile

,
Jaydeep De

Sportradar AG, St. Gallen, Switzerland

Sportradar AG, St. Gallen, Switzerland

0009-0004-7706-4480
View Profile

,
Davide Zambrano

Sportradar AG, St. Gallen, Switzerland

Sportradar AG, St. Gallen, Switzerland

0000-0003-3977-4647
View Profile

MMSports '23: Proceedings of the 6th International Workshop on Multimedia Content Analysis in SportsOctober 2023Pages 23–29https://doi.org/10.1145/3606038.3616160

Published:29 October 2023Publication History

MMSports '23: Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports

Pages 23–29

ABSTRACT

Advanced data collection technologies, computational tools, and sophisticated algorithms have a revolutionary impact on sports analytics on various aspects of sports, from athletes performance to fan engagement. Computer Vision (CV) and Deep Learning (DL) technologies play a crucial role in predicting players and game states from videos, but their effectiveness depends on the quantity and quality of training data, especially in sports with unique dynamics and camera angles. Each sport comes with its own set of challenges.

This paper introduces DeepSportradar-v2, a multi-sport suite of CV tasks that address the need for high-quality datasets for different sports. Supporting multi-sport allows academic researchers to better understand the dynamics of each sport and their specific challenges. In this paper, we first report the results from the 2022 competition, and provide all resources to replicate each result. Then, we present a newly released Cricket dataset and task, given the global popularity and relevance of this sport for the automated analysis and video understanding.

Similarly to the first edition, a competition has been organized as part of the MMSports workshop, where participants are invited to develop state-of-the-art methods for solving the proposed tasks using the publicly available datasets, development kits, and baselines.

Supplemental Material

mmsp022-video.mp4

mp4

247.5 MB

Download

References

Antony Anuraj, Gurtej S Boparai, Carson K Leung, Evan WR Madill, Darshan A Pandhi, Ayush Dilipkumar Patel, and Ronak K Vyas. 2023. Sports data mining for cricket match prediction. In International Conference on Advanced Information Networking and Applications. Springer, 668--680.Google ScholarCross Ref
Ben Athiwaratkun, Marc Finzi, Pavel Izmailov, and Andrew Gordon Wilson. 2018. There are many consistent explanations of unlabeled data: Why you should average. arXiv preprint arXiv:1806.05594 (2018).Google Scholar
Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, and Dahua Lin. 2019. MMDetection: Open MMLab Detection Toolbox and Benchmark. arXiv preprint arXiv:1906.07155 (2019).Google Scholar
Anthony Cioppa, Adrien Deliège, Silvio Giancola, Bernard Ghanem, and Marc Van Droogenbroeck. 2022. Scaling up SoccerNet with multi-view spatial localization and re-identification. Scientific Data 9, 1 (2022), 1--9.Google ScholarCross Ref
Anthony Cioppa, Silvio Giancola, Adrien Deliege, Le Kang, Xin Zhou, Zhiyu Cheng, Bernard Ghanem, and Marc Van Droogenbroeck. 2022. SoccerNetTracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos. In Proceedings of the IEEE/CVF Conference on CVPR. 3491--3502.Google Scholar
Adrien Deliege, Anthony Cioppa, Silvio Giancola, Meisam J Seikavandi, Jacob V Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B Moeslund, and Marc Van Droogenbroeck. 2021. Soccernet-v2: A dataset and benchmarks for holistic understanding of broadcast soccer videos. In Proceedings of the IEEE/CVF Conference on CVPR. 4508--4519.Google ScholarCross Ref
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248--255.Google ScholarCross Ref
Martin A Fischler and Robert C Bolles. 1981. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6 (1981), 381--395.Google ScholarDigital Library
Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D Cubuk, Quoc V Le, and Barret Zoph. 2021. Simple copy-paste is a strong data augmentation method for instance segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2918--2928.Google ScholarCross Ref
Silvio Giancola, Mohieddine Amine, Tarek Dghaily, and Bernard Ghanem. 2018. Soccernet: A scalable dataset for action spotting in soccer videos. In Proceedings of CVPR workshops. 1711--1721.Google ScholarCross Ref
Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xinxing Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei A. Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chenle Zhang, Chen Zhao, Che-Hsien Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, F. L. Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João Victor Bentes Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lin Chen, M L Santos Marqués, Mike Azatov, N. I. Kasatkin, Ning Wang, Qi Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, Rengang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shi-Jin Chen, Shoichi Masui, Shouhong Ding, Sin wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas Baltzer Moeslund, Wan-Chi Siu, Wei Zhang, W. Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yan Guo, Yaqian Zhao, Yi Yu, Yingying Li, Yue He, Yujie Zhong, Zhenhua Guo, and Zhiheng Li. 2022. SoccerNet 2022 Challenges Results. Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports (2022).Google ScholarDigital Library
Konrad Habel, Fabian Deuser, and Norbert Oswald. 2022. CLIP-ReIdent: Contrastive Training for Player Re-Identification. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 129--135.Google ScholarDigital Library
Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE international conference on computer vision. 2961--2969.Google ScholarCross Ref
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarCross Ref
Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning. pmlr, 448--456.Google Scholar
Pavel Izmailov, Dmitrii Podoprikhin, Timur Garipov, Dmitry Vetrov, and Andrew Gordon Wilson. 2018. Averaging weights leads to wider optima and better generalization. arXiv preprint arXiv:1803.05407 (2018).Google Scholar
Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, and Piotr Dollár. 2019. Panoptic segmentation. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9404--9413.Google ScholarCross Ref
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, and Ross Girshick. 2023. Segment Anything. arXiv:2304.02643 (2023).Google Scholar
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision--ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6--12, 2014, Proceedings, Part V 13. Springer, 740-- 755.Google Scholar
Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101 (2017).Google Scholar
Adrien Maglo, Astrid Orcesi, and Quoc-Cuong Pham. 2022. KaliCalib: A Framework for Basketball Court Registration. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 111--116.Google ScholarDigital Library
Xiaohan Nie, Shixing Chen, and Raffay Hamid. 2021. A robust and efficient framework for sports-field registration. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 1936--1944.Google ScholarCross Ref
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. 2021. Learning transferable visual models from natural language supervision. In International conference on machine learning. PMLR, 8748--8763.Google Scholar
Prajit Ramachandran, Barret Zoph, and Quoc V Le. 2017. Searching for activation functions. arXiv preprint arXiv:1710.05941 (2017).Google Scholar
AZisserman RHartley. 2003. MultipleViewGeometryinComputer Vision.Google Scholar
Vladimir Somers, Christophe De Vleeschouwer, and Alexandre Alahi. 2022. Body Part-Based Representation Learning for Occluded Person Re-Identification. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) (2022), 1613--1623.Google Scholar
Gabriel Van Zandycke, Vladimir Somers, Maxime Istasse, Carlo Del Don, and Davide Zambrano. 2022. Deepsportradar-v1: Computer vision dataset for sports understanding with high quality annotations. In Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports. 1--8.Google ScholarDigital Library
Bo Yan, Fengliang Qi, Zhuang Li, Yadong Li, and Hongbin Wang. 2022. Strong Instance Segmentation Pipeline for MMSports Challenge. arXiv preprint arXiv:2209.13899 (2022).Google Scholar

Index Terms

DeepSportradar-v2: A Multi-Sport Computer Vision Dataset for Sport Understandings
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Image segmentation
      2. Computer vision tasks
        Activity recognition and understanding
        Scene understanding
        Video summarization

Recommendations

DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations
MMSports '22: Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports

With the recent development of Deep Learning applied to Computer Vision, sport video understanding has gained a lot of attention, providing much richer information for both sport consumers and leagues. This paper introduces DeepSportradar-v1, a suite of ...
Read More
Game idea jam for sport and exertion games
CHI PLAY '14: Proceedings of the first ACM SIGCHI annual symposium on Computer-human interaction in play

Game Jams have successfully been introduced to the CHI Community during the past two years. Game developers meet to plan, design, and create one or more games within a short time span (ranging from 24 to 48 hours). We propose a Game Idea Jam focusing on ...
Read More
Can Some Computer Games Be a Sport?: Issues with Legitimization of eSport as a Sporting Activity

This paper focuses on the problem of social legitimization of eSport in the context of traditional sports. Its objective is to investigate knowledge and attitudes towards eSports, as well as their recognition as legitimate sports. The first part of the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MMSports '23: Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports
October 2023
174 pages
ISBN:9798400702693
DOI:10.1145/3606038
Program Chairs:
Rainer Lienhart
University of Augsburg
,
Thomas B. Moeslund
Aalborg University
,
Hideo Saito
Keio University
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 29 October 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
basketball
computer vision
cricket
datasets
deep learning
neural networks
sport understanding
sports
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate29of49submissions,59%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 110
  Total Downloads
- Downloads (Last 12 months)110
- Downloads (Last 6 weeks)15
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

DeepSportradar-v2: A Multi-Sport Computer Vision Dataset for Sport Understandings

MMSports '23: Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations

Game idea jam for sport and exertion games

Can Some Computer Games Be a Sport?: Issues with Legitimization of eSport as a Sporting Activity