https://github.com/jku-vds-lab/loops

Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.

https://github.com/jku-vds-lab/loops

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 4 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (12.0%) to scientific vocabulary

Keywords

data-analysis data-science data-visualization jupyter jupyter-notebook notebook provenance
Last synced: 5 months ago · JSON representation

Repository

Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.

Basic Info
Statistics
  • Stars: 6
  • Watchers: 0
  • Forks: 1
  • Open Issues: 21
  • Releases: 0
Topics
data-analysis data-science data-visualization jupyter jupyter-notebook notebook provenance
Created about 3 years ago · Last pushed about 1 year ago
Metadata Files
Readme Changelog License

README.md

loops

Github Actions StatusBinder

Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.

Loops automatically tracks the notebook's history and visualizes it next to the notebook. Loops shows the evolution of the notebook over time and highlights differences between versions to reveal the impact of changes made within a notebook. Loops visualizes differences in code, markdown, tables, visualizations, and images. For a quick overview of loops, see our preview video on YouTube:

Try loops yourself on Binder with two example notebooks for which the analysis process has been recorded. Loops is part of JuypterLab's sidebar and can be opened from there.

  • Use Case 1: Concert Data Analaysis Binder
  • Use Case 2: What-If Analysis on Cancer Patient Data Binder

The Use Case data and notebooks are also available on OSF to try loops in a local environment (see Usage).

To learn more about loops, read our paper.
Abstract:

Exploratory data science is an iterative process of obtaining, cleaning, profiling, analyzing, and interpreting data. This cyclical way of working creates challenges within the linear structure of computational notebooks, leading to issues with code quality, recall, and reproducibility. To remedy this, we present Loops, a set of visual support techniques for iterative and exploratory data analysis in computational notebooks. Loops leverages provenance information to visualize the impact of changes made within a notebook. In visualizations of the notebook provenance, we trace the evolution of the notebook over time and highlight differences between versions. Loops visualizes the provenance of code, markdown, tables, visualizations, and images and their respective differences. Analysts can explore these differences in detail in a separate view. Loops not only improves the reproducibility of notebooks but also supports analysts in their data science work by showing the effects of changes and facilitating comparison of multiple versions. We demonstrate our approach's utility and potential impact in two use cases and feedback from notebook users from various backgrounds.

Usage

Requires JupyterLab >= 4.0.0.

You can install loops with JupyterLab's extension manager: image

Or from the command line using pip:

bash pip install loops-diff

Feedback

Your comments and feedback are welcome. Write an email to klaus.eckelt@jku.at and let us know what you think!
If you have discovered an issue or have a feature suggestion, feel free to create an issue on GitHub.

Citing Loops

Klaus Eckelt, Kiran Gadhave, Alexander Lex, Marc Streit.
Loops: Leveraging Provenance and Visualization to Support Exploratory Data Analysis in Notebooks.
IEEE Transactions on Visualization and Computer Graphics (IEEE VIS '24, to appear), doi:10.31219/osf.io/79eyn, 2024.

@article{2024_loops, title = {Loops: Leveraging Provenance and Visualization to Support Exploratory Data Analysis in Notebooks}, author = {Klaus Eckelt and Kiran Gadhave and Alexander Lex and Marc Streit}, journal = {IEEE Transactions on Visualization and Computer Graphics (IEEE VIS '24, to appear)}, doi = {10.31219/osf.io/79eyn}, url = {https://doi.org/10.31219/osf.io/79eyn}, year = {2024} }


Contributing

There are two ways to set up loops for development:

  • with VS Code devContainers (recommended), which sets up Jupyter and the dependencies in a container.
  • with a local setup, the default way for JupyterLab extension. Works with every code editor.

Development in DevContainer

Requirements

See the official Dev Containers Tutorial for more detailed instructions and alternatives.

A devcontainer.json file is provided that uses the official scipy-notebook container for development (includes JupyterLab, numpy, pandas, matplotlib, seaborn, altair, scikit-learn, and more).

Setup

With the above requirements fullfilled, you will see the following popup when opening this project in VS Code: VS Code popup

Alternatively, you can also reopen the project in a devcontainer via the command prompt: VS Code command prmpt

By reopening in the container, you will get an environment with Jupyter Lab and the packages from the docker image and it will also install all dependencies of the extension as well as the extension itself. Therefore, this process will take a while when doing it for the first time. You can watch the set up process by opening the log in the terminal. When the extension is installed, the terminal should look similar to this:

image

All you need to do, is running jlpm watch in the VS Code terminal afterwards so that the extension gets updated when you make code changes.
The terminal can also be used to add further python packages.

Local Development

Note: You will need NodeJS to build the extension package.

Tested with Node 18 and Python 3.9

You also may want to create a virtual environment, i.e.

```bash python -m venv env source env/bin/activate

Install Jupyterlab and any other python packages you want to use

python -m pip install "jupyterlab>=4.0.0" ```

The jlpm command is JupyterLab's pinned version of yarn that is installed with JupyterLab. You may use yarn or npm in lieu of jlpm below.

```bash

Clone the repo to your local environment

Change directory to the loops directory

Install package in development mode

python -m pip install -e "."

Link your development version of the extension with JupyterLab

python -m jupyter labextension develop . --overwrite

Rebuild extension Typescript source after making changes

jlpm build ```

You can watch the source directory and run JupyterLab at the same time in different terminals to watch for changes in the extension's source and automatically rebuild the extension.

```bash

Watch the source directory in one terminal, automatically rebuilding when needed

jlpm watch

Run JupyterLab in another terminal

python -m jupyter lab ```

With the watch command running, every saved change will immediately be built locally and available in your running JupyterLab. Refresh JupyterLab to load the change in your browser (you may need to wait several seconds for the extension to be rebuilt).

By default, the jlpm build command generates the source maps for this extension to make it easier to debug using the browser dev tools. To also generate source maps for the JupyterLab core extensions, you can run the following command:

bash python -m jupyter lab build --minimize=False

Testing the extension

Frontend tests

This extension is using Jest for JavaScript code testing.

To execute them, execute:

sh jlpm jlpm test

Integration tests

This extension uses Playwright for the integration tests (aka user level tests). More precisely, the JupyterLab helper Galata is used to handle testing the extension in JupyterLab.

More information are provided within the ui-tests README.

Packaging the extension

See RELEASE

Owner

  • Name: JKU Visual Data Science Lab
  • Login: jku-vds-lab
  • Kind: organization
  • Email: contact@jku-vds-lab.at
  • Location: Linz, Austria

GitHub Events

Total
  • Issues event: 1
  • Watch event: 4
  • Create event: 1
Last Year
  • Issues event: 1
  • Watch event: 4
  • Create event: 1

Committers

Last synced: 9 months ago

All Time
  • Total Commits: 93
  • Total Committers: 1
  • Avg Commits per committer: 93.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 4
  • Committers: 1
  • Avg Commits per committer: 4.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Klaus Eckelt K****t@j****t 93
Committer Domains (Top 20 + Academic)
jku.at: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 55
  • Total pull requests: 44
  • Average time to close issues: about 1 month
  • Average time to close pull requests: 3 days
  • Total issue authors: 2
  • Total pull request authors: 1
  • Average comments per issue: 0.75
  • Average comments per pull request: 1.07
  • Merged pull requests: 42
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 1
  • Pull request authors: 0
  • Average comments per issue: 0.0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • keckelt (49)
  • declann (1)
Pull Request Authors
  • keckelt (56)
Top Labels
Issue Labels
enhancement (2) bug (1)
Pull Request Labels
enhancement (42) bug (8)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 18 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 1
pypi.org: loops-diff

A JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.

  • Homepage: https://github.com/jku-vds-lab/loops
  • Documentation: https://loops-diff.readthedocs.io/
  • License: BSD 3-Clause License Copyright (c) 2024, Klaus Eckelt All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: 1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. 3. Neither the name of the copyright holder nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
  • Latest release: 1.1.1
    published almost 2 years ago
  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 18 Last month
Rankings
Dependent packages count: 9.6%
Average: 36.4%
Dependent repos count: 63.2%
Maintainers (1)
Last synced: 6 months ago

Dependencies

.github/workflows/binder-on-pr.yml actions
  • actions/github-script v3 composite
.github/workflows/build.yml actions
  • actions/cache v2 composite
  • actions/checkout v2 composite
  • actions/download-artifact v2 composite
  • actions/setup-python v2 composite
  • actions/upload-artifact v2 composite
  • jupyterlab/maintainer-tools/.github/actions/base-setup v1 composite
  • jupyterlab/maintainer-tools/.github/actions/check-links v1 composite
.github/workflows/check-release.yml actions
  • actions/checkout v2 composite
  • actions/upload-artifact v2 composite
  • jupyter-server/jupyter_releaser/.github/actions/check-release v2 composite
  • jupyterlab/maintainer-tools/.github/actions/base-setup v1 composite
.github/workflows/update-integration-tests.yml actions
  • actions/checkout v2 composite
  • jupyterlab/maintainer-tools/.github/actions/update-snapshots v1 composite
package.json npm
  • @babel/core ^7.0.0 development
  • @babel/preset-env ^7.0.0 development
  • @jupyterlab/builder ^3.1.0 development
  • @jupyterlab/testutils ^3.0.0 development
  • @types/jest ^26.0.0 development
  • @typescript-eslint/eslint-plugin ^4.8.1 development
  • @typescript-eslint/parser ^4.8.1 development
  • eslint ^7.14.0 development
  • eslint-config-prettier ^6.15.0 development
  • eslint-plugin-prettier ^3.1.4 development
  • jest ^26.0.0 development
  • npm-run-all ^4.1.5 development
  • prettier ^2.1.1 development
  • rimraf ^3.0.2 development
  • stylelint ^14.3.0 development
  • stylelint-config-prettier ^9.0.4 development
  • stylelint-config-recommended ^6.0.0 development
  • stylelint-config-standard ~24.0.0 development
  • stylelint-prettier ^2.0.0 development
  • ts-jest ^26.0.0 development
  • typescript ~4.1.3 development
  • @jupyterlab/application ^3.1.0
ui-tests/package.json npm
  • @jupyterlab/galata ^4.3.0 development
.github/workflows/create-image.yml actions
  • actions/checkout v3 composite
  • docker/build-push-action v3 composite
  • docker/login-action v2 composite
  • docker/metadata-action v4 composite
.github/workflows/deploy-image.yml actions
  • actions/checkout v3 composite
  • docker/build-push-action v3 composite
  • docker/login-action v2 composite
  • docker/metadata-action v4 composite
.github/workflows/enforce-label.yml actions
  • jupyterlab/maintainer-tools/.github/actions/enforce-label v1 composite
.github/workflows/prep-release.yml actions
  • jupyter-server/jupyter_releaser/.github/actions/prep-release v2 composite
  • jupyterlab/maintainer-tools/.github/actions/base-setup v1 composite
.github/workflows/publish-release.yml actions
  • jupyter-server/jupyter-releaser/.github/actions/finalize-release v2 composite
  • jupyter-server/jupyter_releaser/.github/actions/populate-release v2 composite
  • jupyterlab/maintainer-tools/.github/actions/base-setup v1 composite
.devcontainer/docker-compose.yml docker
  • jupyter/scipy-notebook lab-4.0.2
Dockerfile docker
  • jupyter/base-notebook lab-4.0.2 build
ui-tests/yarn.lock npm
yarn.lock npm
  • 1055 dependencies
pyproject.toml pypi
setup.py pypi
binder/environment.yml conda
  • jupyterlab >=4.0.0,<5
  • matplotlib
  • mscorefonts
  • nodejs >=18,<19
  • numpy
  • pandas
  • pip
  • python >=3.10,<3.11.0a0
  • python-duckdb 0.6.1
  • scikit-learn
  • seaborn
  • wheel
  • yellowbrick
environment.yml conda
  • jupyterlab >=4.0.0b0,<5
  • nodejs >=18,<19
  • pip
  • python >=3.10,<3.11.0a0