https://github.com/catalystneuro/dandi-access-vis

https://github.com/catalystneuro/dandi-access-vis

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.4%) to scientific vocabulary
Last synced: 6 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: catalystneuro
  • Language: Python
  • Default Branch: main
  • Size: 6.34 MB
Statistics
  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created 7 months ago · Last pushed 7 months ago
Metadata Files
Readme

README.md

DANDI Access Visualization Tools

This repository contains tools for creating geographic visualizations of DANDI data access patterns, including choropleth maps and scatter plots showing data download patterns by country and region.

The source data that is being visualized is here: https://github.com/dandi/access-summaries. By default, the access-summaries repo is expected to be downloaded next to this repo, though this can be adjusted using optional CLI args.

Features

  • Choropleth Maps: Country-level data visualization with color-coded regions
  • Scatter Maps: Region-level visualization with proportional point sizes
  • Multiple Dandiset Support: Process specific dandisets or combinations of dandisets
  • Flexible Data Paths: Configure custom data directory locations
  • Centralized Styling: Consistent color schemes across all visualizations
  • Publication Quality: High-resolution SVG and PDF outputs
  • Flexible Scaling: Linear and logarithmic scale options

Installation

Navigate to the visualization directory and install dependencies:

bash cd visualization pip install -r requirements.txt

Dependencies

  • pandas: Data manipulation
  • matplotlib: Plotting framework
  • numpy: Numerical operations
  • cartopy: Geographic projections and mapping
  • pyyaml: YAML configuration file parsing

Usage

Basic Usage

All Dandisets (Default)

```bash

Process all available dandisets

python create_choropleth.py --log-scale ```

Creates: Global country-level visualization showing 7.69 PB across 117 countries

Global Choropleth Map

Global DANDI downloads by country (logarithmic scale) - showing Netherlands and US as top consumers

Single Dandiset

```bash

Process specific dandiset with automatic filename

python create_choropleth.py --dandiset 000026 --log-scale ```

Creates: Focused view of single dandiset (114.23 TB across 44 countries)

Single Dandiset Choropleth

Dandiset 000026 downloads by country - US and Netherlands dominate usage

Multiple Dandisets

```bash

Process multiple specific dandisets

python createscattermap.py --dandiset 000026,000409,000488 ```

Creates: Regional scatter plot showing precise geographic distribution

Multi-Dandiset Scatter Map

Combined regional view of 3 dandisets - points show both location and download volume with color/size coding

All Dandisets (Regional View)

```bash

Process all available dandisets as scatter plot

python createscattermap.py ```

Creates: Comprehensive regional scatter plot showing global access patterns (655 regions across 470 dandisets)

Global Scatter Map

Global DANDI regional access patterns - comprehensive view of all dandisets showing worldwide download distribution

Temporal Analysis

```bash

Show downloads over time with top dandisets

python createtemporalchart.py ```

Creates: Cumulative stacked area chart showing growth of DANDI downloads (4.49 PB across 469 dandisets, 2021-2025)

Temporal Chart

DANDI cumulative downloads over time - stacked visualization showing growth of top 10 dandisets individually with others grouped as "Other"

Command Reference

Choropleth Maps (Country-level)

```bash python create_choropleth.py [options]

Options: --log-scale, -l Use logarithmic scale (recommended for wide ranges) --output, -o FILE Output filename (default: output/choropleth_map.svg) --data-path, -d PATH Data directory (default: ../access-summaries/content) --dandiset DANDISETS Comma-separated dandiset IDs (default: all) --help Show help message ```

Scatter Maps (Region-level)

```bash python createscattermap.py [options]

Options: --output, -o FILE Output filename (default: output/scatter_map.svg) --data-path, -d PATH Data directory (default: ../access-summaries/content) --dandiset DANDISETS Comma-separated dandiset IDs (default: all) --help Show help message ```

Temporal Charts (Time-series)

```bash python createtemporalchart.py [options]

Options: --output, -o FILE Output filename (default: output/temporal_chart.svg) --data-path, -d PATH Data directory (default: ../access-summaries/content) --dandiset DANDISETS Comma-separated dandiset IDs (default: all) --top-n, -n NUMBER Number of top dandisets to show individually (default: 10) --help Show help message ```

Output Files

Both scripts generate: - SVG files: Vector format for publications (300 DPI equivalent) - PDF files: Alternative format for presentations - Console output: Summary statistics and processing information

Owner

  • Name: CatalystNeuro
  • Login: catalystneuro
  • Kind: organization
  • Email: hello@catalystneuro.com

GitHub Events

Total
  • Push event: 2
  • Public event: 1
Last Year
  • Push event: 2
  • Public event: 1

Dependencies

requirements.txt pypi
  • cartopy >=0.21.0
  • matplotlib >=3.5.0
  • numpy >=1.21.0
  • pandas >=1.5.0
  • pyyaml >=6.0