SetVis

SetVis: Visualizing Large Numbers of Sets and Intersections - Published in JOSS (2024)

https://github.com/alan-turing-institute/setvis

Science Score: 98.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README and JOSS metadata
  • Academic publication links
    Links to: joss.theoj.org
  • Committers with academic emails
    4 of 8 committers (50.0%) from academic institutions
  • Institutional organization owner
    Organization alan-turing-institute has institutional domain (turing.ac.uk)
  • JOSS paper metadata
    Published in Journal of Open Source Software

Keywords

bokeh hut23 hut23-845 jupyter-notebook missing-data python set-visualization

Scientific Fields

Mathematics Computer Science - 84% confidence
Artificial Intelligence and Machine Learning Computer Science - 62% confidence
Last synced: 4 months ago · JSON representation

Repository

A tool for visualising set membership and patterns of missingness in data

Basic Info
  • Host: GitHub
  • Owner: alan-turing-institute
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Homepage: https://setvis.readthedocs.io
  • Size: 3.73 MB
Statistics
  • Stars: 4
  • Watchers: 10
  • Forks: 0
  • Open Issues: 14
  • Releases: 2
Topics
bokeh hut23 hut23-845 jupyter-notebook missing-data python set-visualization
Created over 4 years ago · Last pushed about 1 year ago
Metadata Files
Readme

README.md

setvis

Python Package Documentation Status DOI

Setvis is a python library for visualising set membership and patterns of missingness in data.

The plotting and interactive workflow of Setvis is designed for use within a Jupyter notebook (although it is possible to run outside Jupyter). The other components of Setvis can be used interactively or programmatically. The interactive plots are powered by Bokeh widgets.

It operates on data using a memory efficient architecture, and supports loading data from flat files, Pandas dataframes, and directly from a Postgres database.

Documentation

The setvis documentation is hosted on Read the Docs.

Installation (quick start)

For the complete installation instructions, consult the installation page of the documentation, which includes information on some extra installation options and setting up a suitable environment on several platforms.

We recommend installing setvis in a python virtual environment or Conda environment.

To install setvis, most users should run:

pip install 'setvis[notebooks]'

This will include everything to run setvis in a notebook, and to run the tutorial examples that do not need a database connection.

The Bokeh plots produced by setvis require the package notebook >= 6.4 to display properly. This will be included when installing setvis using the command above.

Tutorials

For basic examples, please see the two example notebooks: - Missingness example - Set example

Additionally, there is a series of Tutorials notebooks, starting with Tutorial 1.

After installing setvis, to follow theses tutorials interactively you will need to clone or download this repository. Then start jupyter from within it:

python -m jupyter notebook notebooks

Notice

The setvis software is released under the Apache Licence, version 2.0. See LICENCE for details.

The data files ./examples/datasets/simpsons - Format 1.csv and ./examples/datasets/simpsons - Format 2.csv, are based on a data file included in UpSet, copyright Visual Computing Group, Harvard, and distributed here under the terms of the MIT Licence.

The other data files in ./examples/datasets/ are released under the Creative Commons Attribution 4.0 International Licence (CC-BY-4.0).

Citing setvis

bibtex @article{Ruddle2024, doi = {10.21105/joss.06925}, url = {https://doi.org/10.21105/joss.06925}, year = {2024}, publisher = {The Open Journal}, volume = {9}, number = {103}, pages = {6925}, author = {R.a. Ruddle and L. Hama and P Wochner and O.t. Strickson}, title = {SetVis: Visualizing Large Numbers of Sets and Intersections}, journal = {Journal of Open Source Software} }

Acknowledgements

The development of the setvis software was supported by funding from the Engineering and Physical Sciences Research Council (EP/N013980/1; EP/R511717/1) and the Alan Turing Institute.

Owner

  • Name: The Alan Turing Institute
  • Login: alan-turing-institute
  • Kind: organization
  • Email: info@turing.ac.uk

The UK's national institute for data science and artificial intelligence.

JOSS Publication

SetVis: Visualizing Large Numbers of Sets and Intersections
Published
November 12, 2024
Volume 9, Issue 103, Page 6925
Authors
R.a. Ruddle ORCID
University of Leeds, Leeds, United Kingdom, Alan Turing Institute, London, United Kingdom
L. Hama ORCID
University of Leeds, Leeds, United Kingdom
P Wochner ORCID
Alan Turing Institute, London, United Kingdom
O.t. Strickson ORCID
Alan Turing Institute, London, United Kingdom
Editor
Marcel Stimberg ORCID
Tags
set visualization sets

GitHub Events

Total
  • Create event: 5
  • Issues event: 1
  • Release event: 1
  • Watch event: 1
  • Delete event: 3
  • Issue comment event: 12
  • Push event: 19
  • Pull request event: 12
  • Fork event: 1
Last Year
  • Create event: 5
  • Issues event: 1
  • Release event: 1
  • Watch event: 1
  • Delete event: 3
  • Issue comment event: 12
  • Push event: 19
  • Pull request event: 12
  • Fork event: 1

Committers

Last synced: 5 months ago

All Time
  • Total Commits: 405
  • Total Committers: 8
  • Avg Commits per committer: 50.625
  • Development Distribution Score (DDS): 0.358
Past Year
  • Commits: 42
  • Committers: 5
  • Avg Commits per committer: 8.4
  • Development Distribution Score (DDS): 0.69
Top Committers
Name Email Commits
Oliver Strickson o****n@t****k 260
Layik Hama l****a@g****m 78
pwochner p****r@t****k 39
Roy Ruddle 5****e 18
Marcel Stimberg m****g@s****r 5
pwochner 7****r 3
Iain 2****S 1
Roy Ruddle s****r@l****k 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 4 months ago

All Time
  • Total issues: 37
  • Total pull requests: 76
  • Average time to close issues: 6 months
  • Average time to close pull requests: 2 days
  • Total issue authors: 4
  • Total pull request authors: 6
  • Average comments per issue: 1.46
  • Average comments per pull request: 1.07
  • Merged pull requests: 68
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 3
  • Pull requests: 10
  • Average time to close issues: 20 days
  • Average time to close pull requests: 4 days
  • Issue authors: 2
  • Pull request authors: 4
  • Average comments per issue: 3.67
  • Average comments per pull request: 1.3
  • Merged pull requests: 8
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • ots22 (26)
  • layik (5)
  • pwochner (4)
  • royruddle (1)
  • ChristinaLast (1)
Pull Request Authors
  • ots22 (42)
  • layik (26)
  • pwochner (7)
  • royruddle (5)
  • mstimberg (1)
  • Iain-S (1)
Top Labels
Issue Labels
bug (3) documentation (2) build & deploy (2)
Pull Request Labels
build & deploy (2) documentation (1)

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 30 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 3
  • Total maintainers: 1
pypi.org: setvis

Visualize set membership and missing data

  • Versions: 3
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 30 Last month
Rankings
Dependent packages count: 6.6%
Average: 24.0%
Stargazers count: 28.2%
Forks count: 30.5%
Dependent repos count: 30.6%
Maintainers (1)
Last synced: 4 months ago

Dependencies

.github/workflows/main.yml actions
  • actions/checkout v2 composite
  • actions/setup-python v2 composite
poetry.lock pypi
  • 129 dependencies
pyproject.toml pypi
  • Bottleneck ^1.3
  • Sphinx ^4.3
  • bokeh ^2.3
  • matplotlib ^3.4.3
  • notebook ^6.4
  • numexpr ^2.7
  • numpy ^1.21
  • pandas ^1.3
  • psycopg2 ^2.9
  • pydantic ^1.8
  • pydata-sphinx-theme ^0.7
  • pytest ^6.2
  • python >=3.8,<3.12
  • scikit-learn >=0.2
  • scipy ^1.7
  • tomli ^2.0