soccersum
SoccerSum: Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch
Science Score: 67.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 9 DOI reference(s) in README -
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.7%) to scientific vocabulary
Keywords
Repository
SoccerSum: Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch
Basic Info
- Host: GitHub
- Owner: simula
- License: mit
- Language: Python
- Default Branch: main
- Homepage: https://dl.acm.org/doi/10.1145/3625468.3652197
- Size: 6.65 MB
Statistics
- Stars: 1
- Watchers: 5
- Forks: 1
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
SoccerSum
Abstract
The rapid advancement of technology has been revolutionizing the field of sports media, where there is a growing need for sophisticated data processing methods. Current methodologies for extracting information from soccer broadcast videos to generate game highlights and summaries for social media are predominantly manual and rely heavily on text-based NLP techniques, overlooking the rich visual and auditory information available. In response to this challenge, our research introduces SoccerSum, a tool that innovates in the field by integrating computer vision, audio analysis with advanced language models like GPT-4. This multimodal approach enables automated, enriched content summarization, including detection of players and key field elements, thereby enhancing the metadata used in summarization algorithms. SoccerSum uniquely combines textual and visual data, offering a comprehensive solution for generating accurate, platform-specific content. This development represents a significant advancement in automated, data-driven sports media dissemination, and sets a new benchmark in the realm of soccer information extraction. A video of the demo can be found here: Demonstration Video.
Dataset
The SoccerSum dataset was curated by capturing and annotating soccer videos from the Norwegian Eliteserien league. This collection spans three years, covering 2021, 2022, and 2023. It comprises 750 frames from 41 unique sequences, with 4 to 40 frames per sequence, carefully chosen to represent a diverse selection of scenarios encountered in professional soccer games. - Zenodo Record: SoccerSum dataset on Zenodo - Simula Dataset Page: Link to page
[!TIP] How to use SoccerSum?
Source Code
- Repository: SoccerSum GitHub
- Programming Languages: Python, JavaScript, HTML, CSS, AJAX
- Python Version: Python 3.11
Packages and Libraries
- Required Python Packages:
flask,re,torch,torchvision,ultralytics,cv2,numpy,requests,openai,whisper,librosa,sklearn.feature_extraction.text,sklearn.metrics.pairwise,matplotlib,mpl_toolkits.mplot3d
Hardware Requirements
- System Tested On: Linux
- GPU Requirement: At least 1 GPU with 4 GB VRAM
Installation and Experimentation
Setup
Clone the SoccerSum repository:
git clone https://github.com/simula/SoccerSumInstall the necessary requirements:
cd SoccerSum pip install -r requirements.txtDownload the model weights for Detection, Segmentation, and Classification: SoccerSum HuggingFace repository
Place the weights in the
weightsfolder. Adjust paths inmain.py.
Running the Application
Ensure port 5000 is free on your machine.
Start the Flask application:
python3 app.pyThe GUI will open in your default web browser.
Usage
- Provide the path to a Goal event file (MP4 or M3U8 URL).
- A valid OPENAI API Access Token for GPT-4 is required.
- The system processes a 30-second clip in about 1 minute; longer clips take more time.
- Progress is shown on the GUI sidebar.
Video
Acknowledgements
$\color{black}{This\ research\ was\ partly\ funded\ by\ the\ Research\ Council\ of\ Norway\,\ project\ number\ 346671.\ }$ (AI-storyteller).
Citations
[!IMPORTANT] Please cite our research using the following BibTeX entries:
@incollection{Houshmand_MMSYS_ODS,
author = {Houshmand Sarkhoosh, Mehdi and Midoglu, Cise and Shafiee Sabet, Saeed and Halvorsen, P{\aa}l},
title = {{The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch}},
booktitle = {{MMSys'24 : The 15th ACM Multimedia Systems Conference}},
year = {2024},
month = apr,
date = {2024-04-15},
urldate = {2024-04-15},
isbn = {979-8-4007-0412-3/24/04},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
doi = {10.1145/3625468.3652180}
}
@incollection{Houshmand_MMSYS_demo,
author = {Houshmand Sarkhoosh, Mehdi and Midoglu, Cise and Shafiee Sabet, Saeed and Halvorsen, P{\aa}l},
title = {{Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media}},
booktitle = {{MMSys'24 : The 15th ACM Multimedia Systems Conference}},
year = {2024},
month = apr,
date = {2024-04-15},
urldate = {2024-04-15},
isbn = {979-8-4007-0412-3/24/04},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
doi = {10.1145/3625468.3652197}
}
[!CAUTION] Please refer to the
CITATION.cfffile provided in the root directory if you use the software in this repository.
Owner
- Name: Simula
- Login: simula
- Kind: organization
- Location: Oslo, Norway
- Website: www.simula.no
- Repositories: 93
- Profile: https://github.com/simula
Citation (CITATION.cff)
cff-version: 1.0.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Houshmand Sarkhoosh"
given-names: "Mehdi"
orcid: "https://orcid.org/0009-0008-4616-4592"
- family-names: "Gautam"
given-names: "Sushant"
orcid: "https://orcid.org/0000-0001-9232-2661"
- family-names: "Midoglu"
given-names: "Cise"
orcid: "https://orcid.org/0000-0003-0991-4418"
- family-names: "Shafiee Sabet"
given-names: "Saeed"
orcid: "https://orcid.org/0000-0001-5348-8546"
- family-names: "Halvorsen"
given-names: "Pål"
orcid: "https://orcid.org/0000-0003-2073-7029"
title: "Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media"
version: 1.0
doi: 10.5281/zenodo.10612084
date-released: 2024-03-01
url: "https://github.com/simula/SoccerSum"
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2