tos2ca-containerization

Container files for TOS2CA

https://github.com/nasa-jpl/tos2ca-containerization

Science Score: 75.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
    Links to: zenodo.org
  • Academic email domains
  • Institutional organization owner
    Organization nasa-jpl has institutional domain (www.jpl.nasa.gov)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (11.8%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Container files for TOS2CA

Basic Info
  • Host: GitHub
  • Owner: nasa-jpl
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Size: 10.7 KB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created 11 months ago · Last pushed 11 months ago
Metadata Files
Readme Contributing License Code of conduct Citation Governance

README.md

Containerization

DOI

Language Language

The containerization library is part of the of the TOS2CA Project. For more information, visit the TOS2CA website at https://nasa-jpl.github.io/tos2ca-documentation/.

This python library is responsible for containerizing various stages of the anomaly-detection repo. The project uses this with AWS Fargate services, though it could be run locally on a single server if needed.

Requirements

  • Will need an NASA Earthdata login to use any tools DAAC tools/applications (and have credentials in a .netrc file)
  • Should have access to the us-west-2 AWS region to access any NASA DAAC data over S3
  • Access to additional AWS services required by the anomaly-detection repo

How to run the tos2ca anomaly detection drivers

The next steps are all in the script SetUp_Docker_tos2ca_jobs.sh if you choose but here is how to setup everything

  1. Into the top directory, make sure to have your Earthdata login .netrc file
  2. Create a subdirectory called code
  3. Either check out from git or copy over into the code subdirectory
    • tos2ca-anomaly-detection.git
    • tos2ca-data-dictionaries.git
    • tos2ca-fortracc-module.git
  4. Change to the directory and run the commands to build the images:
    • Example: docker build -t data_driver . -f Dockerfile_tos2ca_data_driver
  5. Once an image is built, a container can be run like:
    • Example: docker run -it -e JOBID=469 -e CHUNKID=1 data_driver
    • Note that each container will have enviornmental variables that you will need to pass to it.
  6. Optionally, you can then tag image:
    • docker tag <image ID> <AWS URI>/<repo name>:<tag>
  7. Optionally, you can also then push the image to a repository, like AWS ECR:
    • aws ecr get-login-password --region <region> | docker login --username AWS --password-stdin <AWS URI>
    • docker push <AWS URI>/<repo name>:<tag>

Owner

  • Name: NASA Jet Propulsion Laboratory
  • Login: nasa-jpl
  • Kind: organization
  • Email: github@jpl.nasa.gov
  • Location: Pasadena, California, US

A world leader in the robotic exploration of space

Citation (CITATION.cff)

# This CITATION.cff file was generated with cffinit.
# Visit https://bit.ly/cffinit to generate yours today!

cff-version: 1.2.0
title: >-
  Thematic Observation Search, Segmentation, Collation and
  Analysis (TOS2CA)
message: >-
  If you use this software, please cite it using the
  metadata from this file.
type: software
authors:
  - given-names: Ziad
    family-names: Haddad
    email: zsh@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0002-2608-6274'
  - given-names: Jason
    family-names: Eriksen
    affiliation: Columbia University
  - given-names: Ryan
    family-names: Fuller
    email: Ryan.A.Fuller@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0001-5785-9949'
  - given-names: Alexandre
    family-names: Guillaume
    email: Alexandre.Guillaume@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0002-4322-5862'
  - given-names: Svetla
    family-names: Hristova-Veleva
    email: Svetla.Hristova@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0003-2048-5167'
  - given-names: Brian
    family-names: Knosp
    email: Brian.Knosp@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0002-2266-8883'
  - given-names: Phillip
    family-names: Le
    affiliation: Columbia University
  - given-names: Flynn
    family-names: Platt
    email: Flynn.Platt@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0009-0000-9246-2576'
  - given-names: Federica
    family-names: Polverari
    email: federica.polverari@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
    orcid: 'https://orcid.org/0000-0002-2038-4983'
  - given-names: Randy
    family-names: Sawaya
    email: randy.sawaya@primer.ai
    affiliation: Primer
    orcid: 'https://orcid.org/0000-0003-1771-634X'
  - given-names: Quoc
    family-names: Vu
    email: qvu@jpl.nasa.gov
    affiliation: Jet Propulsion Laboratory
identifiers:
  - type: doi
    value: 10.5281/zenodo.16053702
    description: >-
      Link to an archived copy of the TOS2CA v2 software on
      Zenodo
repository-code: 'https://github.com/nasa-jpl/tos2ca-documentation'
url: 'https://nasa-jpl.github.io/tos2ca-documentation/'
abstract: >-
  Thematic Observation Search, Segmentation, Collation and
  Analysis (TOS2CA) is a user-driven, data-centric system
  that identifies, collates, statistically characterizes,
  and serves Earth science data relevant to a given
  phenomenon. It facilitates the collation and analysis of
  data from disparate sources and makes it possible for
  scientists to establish science traceability requirements,
  quantify detection thresholds, define uncertainty
  requirements, and establish data sufficiency to formulate
  truly innovative missions and research investigations.
license: Apache-2.0
version: v2.0
date-released: '2025-08-01'
preferred-citation:
  authors:
    - family-names: Haddad
      given-names: Ziad
    - family-names: Eriksen
      given-names: Jason
    - family-names: Fuller
      given-names: Ryan
    - family-names: Guillaume
      given-names: Alexandre
    - family-names: Hristova-Veleva
      given-names: Svetla
    - family-names: Knosp
      given-names: Brian
    - family-names: Le
      given-names: Phillip
    - family-names: Platt
      given-names: Flynn
    - family-names: Polverarai
      given-names: Federica
    - family-names: Sawaya
      given-names: Randy
    - family-names: Vu
      given-names: Quoc
  title: Thematic Observation Search, Segmentation, Collation and Analysis (TOS2CA)
  type: software
  year: 2025
  version: v2.0
  doi: 10.5281/zenodo.16053702

GitHub Events

Total
  • Release event: 1
  • Push event: 1
  • Public event: 1
  • Create event: 1
Last Year
  • Release event: 1
  • Push event: 1
  • Public event: 1
  • Create event: 1

Dependencies

requirements.txt pypi
  • Pillow ==9.4.0
  • PyMySQL ==1.0.2
  • Shapely ==1.8.1
  • blosc ==1.11.2
  • boto3 ==1.24.28
  • botocore ==1.27.59
  • cartopy ==0.22.0
  • cryptography ==39.0.1
  • geojson ==3.0.1
  • geos ==0.2.3
  • h5netcdf ==1.2.0
  • h5py ==3.7.0
  • matplotlib ==3.6.2
  • netCDF4 ==1.6.2
  • numpy ==1.24.3
  • pandas ==1.5.3
  • redis ==5.0.1
  • requests ==2.29.0
  • s3fs ==2023.3.0
  • s3transfer ==0.6.0
  • scikit-image ==0.19.3
  • scipy ==1.10.1
  • sip ==6.6.2
  • six ==1.16.0
  • urllib3 ==1.26.16
  • valkey ==6.1.0
  • xarray ==2022.11.0