jupyter-scatter-tutorial

Jupyter Scatter Tutorial (that was first presented at SciPy '23)

https://github.com/flekschas/jupyter-scatter-tutorial

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.3%) to scientific vocabulary

Keywords

data-visualization embeddings jupyter-widget jupyterlab python scatter-plot
Last synced: 6 months ago · JSON representation ·

Repository

Jupyter Scatter Tutorial (that was first presented at SciPy '23)

Basic Info
Statistics
  • Stars: 22
  • Watchers: 4
  • Forks: 0
  • Open Issues: 2
  • Releases: 0
Topics
data-visualization embeddings jupyter-widget jupyterlab python scatter-plot
Created over 2 years ago · Last pushed 9 months ago
Metadata Files
Readme License Citation

README.md

Jupyter Scatter Tutorial

SciPy 2023 Talk

:wave: Welcome! Here you will find the notebooks for the Jupyter Scatter tutorial, first presented at SciPy 2023. These notebooks offer an in-depth guide to interactive scatter plot visualizations using jupyter-scatter. Specifically, the tutorial covers

  1. How to get started with Jupyter Scatter and visualize medium to large-scale datasets as interactive scatter plots.
  2. How to compose and link/synchronize multiple scatter plots
  3. How to integrate Jupyter Scatter with other widgets to build bespoke interfaces for:
    1. Exploring LLM-based sentence embeddings
    2. Comparing multiple embedding method of the Fashion MNIST dataset
    3. Browsing genomic data with HiGlass and loci embeddings
    4. Comparing a pair of single-cell embeddings by their label abundance differences
  4. How to use the tooltip feature, introduced in v0.15.0 (Added after the SciPy 2023):
    1. Tooltip with text previews for the LLM-based sentence embeddings
    2. Tooltip with image previews for the Fashion MNIST embedding
    3. Tooltip for a single-cell embededding
    4. Tooltip with audio previews for Google's Magenta Nsynth dataset
  5. How to add features to Jupyter Scatter through composition with other Jupyter Widgets or other Python libraries (Added after the SciPy 2023):
    1. Search
    2. Cluster Outlines and Contours

Note

You can find my SciPy '23 talk on YouTube and the accompanying slides at Speaker Deck.

Run the Tutorial

Online

If you have a Google/Gmail account, you can run this tutorial from your browser using Colab: Open In Colab.

[!IMPORTANT] You need to manually install Jupyter Scatter when running the notebooks in Google Colab via !pip install jupyter-scatter. Make sure to not install jscatter as that is a different package.

Locally

To run the notebook locally we recommend using uv as follows:

sh uv run jupyter-lab

Owner

  • Name: Fritz Lekschas
  • Login: flekschas
  • Kind: user
  • Location: Somerville, MA

Computer scientist researching visualization systems for large-scale exploration of biomedical data. Harvard CS PhD '21.

Citation (CITATION.cff)

cff-version: "1.2.0"
authors:
- family-names: Lekschas
  given-names: Fritz
  orcid: "https://orcid.org/0000-0001-8432-4835"
doi: 10.25080/gerudo-f2bc6f59-030
message: If you use or reference examples from these notebooks, please cite
  our slides from the SciPy Proceedings.
preferred-citation:
  authors:
  - family-names: Lekschas
    given-names: Fritz
    orcid: "https://orcid.org/0000-0001-8432-4835"
  date-published: 2023-08-07
  doi: 10.25080/gerudo-f2bc6f59-030
  publisher:
    name: Zenodo
  title: "Interactive Exploration of Large-Scale Datasets with Jupyter-Scatter"
  url: "https://doi.org/10.25080/gerudo-f2bc6f59-030"
title: "Interactive Exploration of Large-Scale Datasets with Jupyter-Scatter"

GitHub Events

Total
  • Issues event: 1
  • Watch event: 3
  • Issue comment event: 7
  • Push event: 2
Last Year
  • Issues event: 1
  • Watch event: 3
  • Issue comment event: 7
  • Push event: 2

Issues and Pull Requests

Last synced: 9 months ago

All Time
  • Total issues: 3
  • Total pull requests: 3
  • Average time to close issues: 3 days
  • Average time to close pull requests: about 10 hours
  • Total issue authors: 3
  • Total pull request authors: 2
  • Average comments per issue: 4.67
  • Average comments per pull request: 0.33
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 1
  • Average time to close issues: 3 days
  • Average time to close pull requests: about 11 hours
  • Issue authors: 3
  • Pull request authors: 1
  • Average comments per issue: 4.67
  • Average comments per pull request: 1.0
  • Merged pull requests: 1
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • declann (1)
  • faroit (1)
  • mireklzicar (1)
Pull Request Authors
  • manzt (2)
  • flekschas (2)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

.github/workflows/book.yml actions
  • actions/checkout v3 composite
  • actions/deploy-pages v2 composite
  • actions/setup-python v4 composite
  • actions/upload-pages-artifact v1 composite
requirements.txt pypi
  • anywidget *
  • bioframe *
  • higlass *
  • jupyter-scatter >=0.12.4,<0.13.0
  • jupyterlab >=3.0.0
  • pyarrow *
  • watchfiles *