dashboard-prototype

Prototype data dashboard for Imageomics Data

https://github.com/imageomics/dashboard-prototype

Science Score: 65.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 3 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
    Organization imageomics has institutional domain (imageomics.osu.edu)
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.5%) to scientific vocabulary

Keywords

dashboard data data-visualization eda evaluation visualization
Last synced: 6 months ago · JSON representation ·

Repository

Prototype data dashboard for Imageomics Data

Basic Info
  • Host: GitHub
  • Owner: Imageomics
  • License: mit
  • Language: Python
  • Default Branch: main
  • Homepage: http://dash.imageomics.org
  • Size: 75.3 MB
Statistics
  • Stars: 5
  • Watchers: 10
  • Forks: 2
  • Open Issues: 8
  • Releases: 0
Topics
dashboard data data-visualization eda evaluation visualization
Created almost 3 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

Dashboard Prototype

Prototype data dashboard using the Cuthill Gold Standard Dataset, which was processed from Cuthill, et. al. (original dataset available at doi:10.5061/dryad.2hp1978). Test datasets (the processed version of Cuthill's data with and without filepath URLs) are available in test_data.

This dashboard focuses on images labeled at the species and subspecies level as described in a CSV.

How it works

For full dashboard functionality, upload a CSV or XLS file with the following columns: - Species: Species of each sample. - Subspecies: Subspecies of each sample. - View: View of the sample (eg., 'ventral' or 'dorsal' for butterflies). - Sex: Sex of each sample. - hybrid_stat: Hybrid status of each sample (eg., 'validsubspecies', 'subspeciessynonym', or 'unknown'). - lat: Latitude at which image was taken or specimen was collected: number in [-90,90]. - lon: Longitude at which image was taken or specimen was collected: number in [-180,180]. long will also be accepted. - file_url: URL to access file. *Note:** Images should be in PNG or JPEG format, TIFF may fail to render in the sample image display.

**Note:* - Column names are not case-sensitive. - lat and lon columns are not required to utilize the dashboard, but there will be no map view if they are not included. Blank (or null) entries are recorded as unknown, and thus excluded from map view. - file_url is not required, but there will be no sample images option if it is not included. - locality may be provided, otherwise it will take on the value lat|lon or unknown if these are not provided.

Running Dashboard

Create and activate a new (python) virtual environment. Then install the required packages (if using conda, first run conda install pip):

pip install -r requirements.txt

and run

python dashboard.py

Then navigate to http://127.0.0.1:8050/ in your browser to see the graphs.

Running with Docker

To run the dashboard in a more scalable manner a Dockerfile is provided. This container uses gunicorn to support more users at the same time. Building and running the container requires that docker is installed.

Building the container

docker build -t dashboard .

Running the container

To deploy the dashboard with 6 workers run the following command: docker run --env BACKEND_WORKERS=6 -p 5000:5000 -it dashboard Then open the following URL http://0.0.0.0:5000/.

Preview

Histogram View

image

Map View

image

Testing

Test Requirements

The testing suite requires Dash Testing and pytest-mock, which can be installed in your python environment by running: pip install dash\[testing] pytest-mock

Running Tests

Within your python environment run the following command to run all tests: pytest

Owner

  • Name: Imageomics Institute
  • Login: Imageomics
  • Kind: organization

Citation (CITATION.cff)

# following https://github.com/citation-file-format/citation-file-format/blob/main/schema-guide.md

abstract: "To simplify new dataset evaluation for suitability, this data dashboard allows the user to visualize metadata distribution information and sample images efficiently without coding. The current focus is on visualizations for datasets containing metadata of plant and animal images, though images are not used to gather metadata distribution statistics and are optional for displaying samples."
authors:
- family-names: "Campolongo"
  given-names: "Elizabeth G."
  orcid: "https://orcid.org/0000-0003-0846-2413"
- family-names: "Bradley"
  given-names: "John"
  orcid: "https://orcid.org/0000-0003-3858-848X"
- family-names: "Thompson"
  given-names: "Matthew J."
  orcid: "https://orcid.org/0000-0003-0583-8585"
- family-names: "Jebbia"
  given-names: "Dom"
  orcid: "https://orcid.org/0000-0002-9587-8718"
- family-names: "Lapp"
  given-names: "Hilmar"
  orcid: "https://orcid.org/0000-0001-9107-0714"
cff-version: 1.2.0
date-released: "2024-05-21"
identifiers:
  - description: "The GitHub release URL of tag 1.3.0."
    type: url
    value: "https://github.com/Imageomics/dashboard-prototype/releases/tag/v1.3.0"
  - description: "The GitHub URL of the commit tagged with 1.3.0."
    type: url
    value: "https://github.com/Imageomics/dashboard-prototype/tree/77779a50a9631b9180a077ae8b99883d97f3042b"
keywords:
  - "EDA"
  - "data"
  - "evaluation"
  - "dashboard"
  - "visualization"
license: MIT
message: "If you use this software, please cite it using these metadata."
repository-code: "https://github.com/Imageomics/dashboard-prototype"
title: "Data Dashboard: Facilitating Data Exploration"
version: 1.3.0

GitHub Events

Total
Last Year