Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.9%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: Aker-BP-Open-Research
  • Language: HTML
  • Default Branch: main
  • Size: 13.3 MB
Statistics
  • Stars: 2
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme Citation

README.md

DeepSee: Detecting benthic lifeforms in deep-sea environments

Contributors

Affiliations

This work is a collboration between AkerBP and Bergwerk.

Introduction

The deep sea remains one of the least explored frontiers on Earth, with limited high-resolution data available on its biological diversity. Recent advances in technology have resulted in large amounts of high-resolution footage captured by remotely operated vehicles (ROVs). While this wealth of data holds immense potential for scientific discovery, the sheer volume makes manual annotation impractical. This is where machine learning (ML) can play a pivotal role, automating the annotation process and enabling more efficient analysis of these vast datasets. Our objective is to contribute to the emerging, high-resolution database on benthic life by leveraging ML to process and annotate the extensive video footage. This approach is not intended to replace biologists but to augment their capabilities, allowing them to focus on more complex analytical tasks. We present the DeepSee dataset, a comprehensive collection of annotated images from the Arctic Mid-Ocean Ridge, the Norwegian Sea, and the Greenland Sea. This dataset is designed to support the development of ML models capable of detecting and classifying benthic organisms. The DeepSee object detection model, trained on this dataset, is capable of processing vast amounts of footage quickly with high precision and recall. The model provides a valuable addition to the traditional workflow of manual annotation by significantly reducing the load on human annotators. By deploying such models, we aim to streamline the annotation process, making it easier for biologists to conduct their research and ultimately aiding in the informed decision-making regarding deep-sea resource management and protection.

Dataset

The DeepSee dataset is largely constructed using frame grabs from videos captured by remotely operated vehicles (ROVs) during deep-sea surveys in the Arctic Mid-Ocean Ridge, the Norwegian Sea, and the Greenland Sea. The video material is provided, free of charge, by the Norwegian Offshore Directorate. High-quality images, where instances of one or more classes are distinctly visible, are selected for the training dataset. Classes are chosen based on distinct morphospecies and groups of morphospecies that share a similar morphology. Additional non-biological classes (ROV parts) that occasionally occur in the images are also included to reduce false positives of biological classes. Note that class in this context is a ML-term, not to confused with the biological taxon class.

Classes in the DeepSee dataset
Classes in the DeepSee dataset
Instance and image count in the DeepSee dataset
Instance and image count in the DeepSee dataset


Model Performance

The DeepSee model is trained on 4 Tesla T4 GPUs. Inference time for a single image is ~60-120 ms. Inference on video frames is much faster and requires between 30 and 60 ms/frame.

Training Metrics
Training Metrics
Confusion Matrix
Confusion Matrix
F1 Curve
F1 Curve


Results

Video Inference

Below we demonstrate inference performed on media released by the Norwegian Broadcasting Corporation (NRK release). Note that this video is not part of the training dataset.

https://github.com/user-attachments/assets/bffbb086-8e6e-41e5-a86e-801efd39e525

DeepSee with Photogrammetry

Usually, detection is carried out on video frames that only include the general location of the ROV. In areas of particular interest where high quality videos are available, frames can be stitched together into orthomosaics using photogrammetry. Object detection using the DeepSee model on such orthomosaics allows for pinpointing the location of individual organisms on the sea floor. The image below shows the exact coordinates of detected lifeforms using this technique.

Lifeform detection on an orthomosaic

Benthic Lifeform Maps

The results obtained from the DeepSee model can be used to generate high resolution (down to the meter scale) maps of benthic life in the deep ocean. Due to general sparseness of such data, maps are usually created with a granularity of hundreds to tens of thousands of square kiolometers (e.g. report by the Institue of Marine Research, Norway, recent mapping paper by Ramirez-Llodra et al., 2024) which may result in an incomplete or inaccurate assessment of the region of interest. We have created hex maps using our results going from a 50,000 km² hex grid down to a 1 km² grid to illustrate how changing granularity changes perceived distribution and can greatly influence decision making when it comes to deepsea resource management. The hex grid images display an overview grid with 50,000 km² down to 100 km² hexes next to a zoomed grid that goes down to 1 km² hexes for number of organisms (abundance) and the number of different species (richness) detected in a hex. Note that the hex grids are limited to 1 km² simply for illustration purposes and not result/data constrained. Interactive htmls of the abundance and species richness hex grids can be accessed by the provided links. Note that the html raw code needs to be downloaded and saved as an html file before viewing it in a browser.

Classes in the DeepSee dataset
Overview hex grid showing lifeform abundance detected by the DeepSee model. The black rectangle shows the coverage of the adjacent image.
Instance and image count in the DeepSee dataset
Zoomed hex grid showing lifeform abundance detected by the DeepSee model


Classes in the DeepSee dataset
Overview hex grid showing species richness detected by the DeepSee model. The total number of detectable species is 17. The black rectangle shows the coverage of the adjacent image.
Instance and image count in the DeepSee dataset
Zoomed hex grid showing species richness detected by the DeepSee model. The total number of detectable species is 17.


DeepSee enables the production of comprehensive benthic ecosystem maps with high spatial resolution. The following example demonstrates mapping marine species distribution over a small-scale area based on a ROV video track. Note that taxa are desribed by phylum and lowest posible taxanomical level. Fig A and B are the same area, just separating phyla.
Detailed map

Comparison with Other Data

DeepSee has mapped approximately 500,000 individual life forms in the Norwegian Proposed Opening Area for Mineral Exploration. Below, we compare this data to GBIF and OBIS data for the same region.

Count comparison between DeepSee, GBIF and OBIS
Count comparison between DeepSee, GBIF and OBIS in the Norwegian Proposed Opening Area for Mineral Exploration.
Relative proportions of phyla detected by DeepSee
Relative proportions of phyla detected by DeepSee.
Heatmap of Porifera detections using DeepSee
Heatmap showing Porifera records in the GBIF database.
Heatmap of Porifera records in GBIF
Heatmap showing Porifera records in the GBIF database and detections using DeepSee.

Conclusion

By deploying machine learning models like DeepSee, we aim to streamline the annotation process, enhancing biologists' capabilities in conducting research and supporting informed decision-making regarding deep-sea resource management and protection.

Owner

  • Name: Aker BP - Open Research
  • Login: Aker-BP-Open-Research
  • Kind: organization

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: DeepSee
message: >-
  If you use this dataset, please cite it using the metadata
  from this file.
type: dataset
authors:
  - given-names: Ebbe H.
    family-names: Hartz
    affiliation: Aker BP
  - given-names: Karthik H.
    family-names: Iyer
    affiliation: Bergwerk
  - given-names: Camilla M.
    family-names: Marnor
    affiliation: Bergwerk
  - given-names: Daniel W.
    family-names: Schmid
    affiliation: Bergwerk
repository-code: 'https://github.com/Aker-BP-Open-Research/DeepSee'

GitHub Events

Total
  • Release event: 1
  • Watch event: 2
  • Push event: 6
  • Create event: 1
Last Year
  • Release event: 1
  • Watch event: 2
  • Push event: 6
  • Create event: 1