soccersum

SoccerSum: Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch

https://github.com/simula/soccersum

Science Score: 67.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 9 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.7%) to scientific vocabulary

Keywords

ai computer-vision gpt-4 ml whisper
Last synced: 6 months ago · JSON representation ·

Repository

SoccerSum: Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch

Basic Info
Statistics
  • Stars: 1
  • Watchers: 5
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Topics
ai computer-vision gpt-4 ml whisper
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

SoccerSum

DOI Demonstration Paper Dataset Paper Hugging Face Build Status Tests Coverage

Abstract

The rapid advancement of technology has been revolutionizing the field of sports media, where there is a growing need for sophisticated data processing methods. Current methodologies for extracting information from soccer broadcast videos to generate game highlights and summaries for social media are predominantly manual and rely heavily on text-based NLP techniques, overlooking the rich visual and auditory information available. In response to this challenge, our research introduces SoccerSum, a tool that innovates in the field by integrating computer vision, audio analysis with advanced language models like GPT-4. This multimodal approach enables automated, enriched content summarization, including detection of players and key field elements, thereby enhancing the metadata used in summarization algorithms. SoccerSum uniquely combines textual and visual data, offering a comprehensive solution for generating accurate, platform-specific content. This development represents a significant advancement in automated, data-driven sports media dissemination, and sets a new benchmark in the realm of soccer information extraction. A video of the demo can be found here: Demonstration Video.

Dataset

The SoccerSum dataset was curated by capturing and annotating soccer videos from the Norwegian Eliteserien league. This collection spans three years, covering 2021, 2022, and 2023. It comprises 750 frames from 41 unique sequences, with 4 to 40 frames per sequence, carefully chosen to represent a diverse selection of scenarios encountered in professional soccer games. - Zenodo Record: SoccerSum dataset on Zenodo - Simula Dataset Page: Link to page

[!TIP] How to use SoccerSum?

Source Code

  • Repository: SoccerSum GitHub
  • Programming Languages: Python, JavaScript, HTML, CSS, AJAX
  • Python Version: Python 3.11

Packages and Libraries

  • Required Python Packages: flask, re, torch, torchvision, ultralytics, cv2, numpy, requests, openai, whisper, librosa, sklearn.feature_extraction.text, sklearn.metrics.pairwise, matplotlib, mpl_toolkits.mplot3d

Hardware Requirements

  • System Tested On: Linux
  • GPU Requirement: At least 1 GPU with 4 GB VRAM

Installation and Experimentation

Setup

  1. Clone the SoccerSum repository: git clone https://github.com/simula/SoccerSum

  2. Install the necessary requirements: cd SoccerSum pip install -r requirements.txt

  3. Download the model weights for Detection, Segmentation, and Classification: SoccerSum HuggingFace repository

  4. Place the weights in the weights folder. Adjust paths in main.py.

Running the Application

  1. Ensure port 5000 is free on your machine.

  2. Start the Flask application: python3 app.py

  3. The GUI will open in your default web browser.

Usage

  • Provide the path to a Goal event file (MP4 or M3U8 URL).
  • A valid OPENAI API Access Token for GPT-4 is required.
  • The system processes a 30-second clip in about 1 minute; longer clips take more time.
  • Progress is shown on the GUI sidebar.

Video

SoccerSum Demonstration

Acknowledgements

$\color{black}{This\ research\ was\ partly\ funded\ by\ the\ Research\ Council\ of\ Norway\,\ project\ number\ 346671.\ }$ (AI-storyteller).

Citations

[!IMPORTANT] Please cite our research using the following BibTeX entries:


@incollection{Houshmand_MMSYS_ODS,
  author = {Houshmand Sarkhoosh, Mehdi and Midoglu, Cise and Shafiee Sabet, Saeed and Halvorsen, P{\aa}l},
  title = {{The SoccerSum Dataset for Automated Detection, Segmentation, and Tracking of Objects on the Soccer Pitch}},
  booktitle = {{MMSys'24 : The 15th ACM Multimedia Systems Conference}},
  year = {2024},
  month = apr,
  date = {2024-04-15},
  urldate = {2024-04-15},
  isbn = {979-8-4007-0412-3/24/04},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  doi = {10.1145/3625468.3652180}
}

@incollection{Houshmand_MMSYS_demo,
  author = {Houshmand Sarkhoosh, Mehdi and Midoglu, Cise and Shafiee Sabet, Saeed and Halvorsen, P{\aa}l},
  title = {{Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media}},
  booktitle = {{MMSys'24 : The 15th ACM Multimedia Systems Conference}},
  year = {2024},
  month = apr,
  date = {2024-04-15},
  urldate = {2024-04-15},
  isbn = {979-8-4007-0412-3/24/04},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  doi = {10.1145/3625468.3652197}
}

[!CAUTION] Please refer to the CITATION.cff file provided in the root directory if you use the software in this repository.

Owner

  • Name: Simula
  • Login: simula
  • Kind: organization
  • Location: Oslo, Norway

Citation (CITATION.cff)

cff-version: 1.0.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Houshmand Sarkhoosh"
    given-names: "Mehdi"
    orcid: "https://orcid.org/0009-0008-4616-4592"
  - family-names: "Gautam"
    given-names: "Sushant"
    orcid: "https://orcid.org/0000-0001-9232-2661"
  - family-names: "Midoglu"
    given-names: "Cise"
    orcid: "https://orcid.org/0000-0003-0991-4418"
  - family-names: "Shafiee Sabet"
    given-names: "Saeed"
    orcid: "https://orcid.org/0000-0001-5348-8546"
  - family-names: "Halvorsen"
    given-names: "Pål"
    orcid: "https://orcid.org/0000-0003-2073-7029"
title: "Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media"
version: 1.0
doi: 10.5281/zenodo.10612084
date-released: 2024-03-01
url: "https://github.com/simula/SoccerSum"

GitHub Events

Total
  • Watch event: 2
Last Year
  • Watch event: 2