Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.9%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: Aker-BP-Open-Research
- Language: HTML
- Default Branch: main
- Size: 13.3 MB
Statistics
- Stars: 2
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 1
Metadata Files
README.md
DeepSee: Detecting benthic lifeforms in deep-sea environments
Contributors
- Ebbe H. Hartz (AkerBP)
- Karthik H. Iyer (Bergwerk)
- Camilla Marnor(Bergwerk)
- Daniel W. Schmid (Bergwerk)
Affiliations
This work is a collboration between AkerBP and Bergwerk.
Introduction
The deep sea remains one of the least explored frontiers on Earth, with limited high-resolution data available on its biological diversity. Recent advances in technology have resulted in large amounts of high-resolution footage captured by remotely operated vehicles (ROVs). While this wealth of data holds immense potential for scientific discovery, the sheer volume makes manual annotation impractical. This is where machine learning (ML) can play a pivotal role, automating the annotation process and enabling more efficient analysis of these vast datasets. Our objective is to contribute to the emerging, high-resolution database on benthic life by leveraging ML to process and annotate the extensive video footage. This approach is not intended to replace biologists but to augment their capabilities, allowing them to focus on more complex analytical tasks. We present the DeepSee dataset, a comprehensive collection of annotated images from the Arctic Mid-Ocean Ridge, the Norwegian Sea, and the Greenland Sea. This dataset is designed to support the development of ML models capable of detecting and classifying benthic organisms. The DeepSee object detection model, trained on this dataset, is capable of processing vast amounts of footage quickly with high precision and recall. The model provides a valuable addition to the traditional workflow of manual annotation by significantly reducing the load on human annotators. By deploying such models, we aim to streamline the annotation process, making it easier for biologists to conduct their research and ultimately aiding in the informed decision-making regarding deep-sea resource management and protection.
Dataset
The DeepSee dataset is largely constructed using frame grabs from videos captured by remotely operated vehicles (ROVs) during deep-sea surveys in the Arctic Mid-Ocean Ridge, the Norwegian Sea, and the Greenland Sea. The video material is provided, free of charge, by the Norwegian Offshore Directorate. High-quality images, where instances of one or more classes are distinctly visible, are selected for the training dataset. Classes are chosen based on distinct morphospecies and groups of morphospecies that share a similar morphology. Additional non-biological classes (ROV parts) that occasionally occur in the images are also included to reduce false positives of biological classes. Note that class in this context is a ML-term, not to confused with the biological taxon class.
Classes in the DeepSee dataset |
Instance and image count in the DeepSee dataset |
Model Performance
The DeepSee model is trained on 4 Tesla T4 GPUs. Inference time for a single image is ~60-120 ms. Inference on video frames is much faster and requires between 30 and 60 ms/frame.
Training Metrics |
Confusion Matrix |
F1 Curve |
Results
Video Inference
Below we demonstrate inference performed on media released by the Norwegian Broadcasting Corporation (NRK release). Note that this video is not part of the training dataset.
https://github.com/user-attachments/assets/bffbb086-8e6e-41e5-a86e-801efd39e525
DeepSee with Photogrammetry
Usually, detection is carried out on video frames that only include the general location of the ROV. In areas of particular interest where high quality videos are available, frames can be stitched together into orthomosaics using photogrammetry. Object detection using the DeepSee model on such orthomosaics allows for pinpointing the location of individual organisms on the sea floor. The image below shows the exact coordinates of detected lifeforms using this technique.

Benthic Lifeform Maps
The results obtained from the DeepSee model can be used to generate high resolution (down to the meter scale) maps of benthic life in the deep ocean. Due to general sparseness of such data, maps are usually created with a granularity of hundreds to tens of thousands of square kiolometers (e.g. report by the Institue of Marine Research, Norway, recent mapping paper by Ramirez-Llodra et al., 2024) which may result in an incomplete or inaccurate assessment of the region of interest. We have created hex maps using our results going from a 50,000 km² hex grid down to a 1 km² grid to illustrate how changing granularity changes perceived distribution and can greatly influence decision making when it comes to deepsea resource management. The hex grid images display an overview grid with 50,000 km² down to 100 km² hexes next to a zoomed grid that goes down to 1 km² hexes for number of organisms (abundance) and the number of different species (richness) detected in a hex. Note that the hex grids are limited to 1 km² simply for illustration purposes and not result/data constrained. Interactive htmls of the abundance and species richness hex grids can be accessed by the provided links. Note that the html raw code needs to be downloaded and saved as an html file before viewing it in a browser.
Overview hex grid showing lifeform abundance detected by the DeepSee model. The black rectangle shows the coverage of the adjacent image. |
Zoomed hex grid showing lifeform abundance detected by the DeepSee model |
Overview hex grid showing species richness detected by the DeepSee model. The total number of detectable species is 17. The black rectangle shows the coverage of the adjacent image. |
Zoomed hex grid showing species richness detected by the DeepSee model. The total number of detectable species is 17. |
DeepSee enables the production of comprehensive benthic ecosystem maps with high spatial resolution. The following example demonstrates mapping marine species distribution over a small-scale area based on a ROV video track. Note that taxa are desribed by phylum and lowest posible taxanomical level. Fig A and B are the same area, just separating phyla.

Comparison with Other Data
DeepSee has mapped approximately 500,000 individual life forms in the Norwegian Proposed Opening Area for Mineral Exploration. Below, we compare this data to GBIF and OBIS data for the same region.
Count comparison between DeepSee, GBIF and OBIS in the Norwegian Proposed Opening Area for Mineral Exploration. |
Relative proportions of phyla detected by DeepSee. |
Heatmap showing Porifera records in the GBIF database. |
Heatmap showing Porifera records in the GBIF database and detections using DeepSee. |
Conclusion
By deploying machine learning models like DeepSee, we aim to streamline the annotation process, enhancing biologists' capabilities in conducting research and supporting informed decision-making regarding deep-sea resource management and protection.
Owner
- Name: Aker BP - Open Research
- Login: Aker-BP-Open-Research
- Kind: organization
- Repositories: 1
- Profile: https://github.com/Aker-BP-Open-Research
Citation (CITATION.cff)
# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!
cff-version: 1.2.0
title: DeepSee
message: >-
If you use this dataset, please cite it using the metadata
from this file.
type: dataset
authors:
- given-names: Ebbe H.
family-names: Hartz
affiliation: Aker BP
- given-names: Karthik H.
family-names: Iyer
affiliation: Bergwerk
- given-names: Camilla M.
family-names: Marnor
affiliation: Bergwerk
- given-names: Daniel W.
family-names: Schmid
affiliation: Bergwerk
repository-code: 'https://github.com/Aker-BP-Open-Research/DeepSee'
GitHub Events
Total
- Release event: 1
- Watch event: 2
- Push event: 6
- Create event: 1
Last Year
- Release event: 1
- Watch event: 2
- Push event: 6
- Create event: 1