Lumen

Lumen: A software for the interactive visualization of probabilistic models together with data - Published in JOSS (2021)

https://github.com/lumen-org/lumen

Keywords

web-application

Scientific Fields

Artificial Intelligence and Machine Learning Computer Science - 40% confidence

Last synced: 6 months ago · JSON representation ·

Repository

Interactive web application for the exploration, comparision and validation of probability models and its data

Basic Info

Host: GitHub
Owner: lumen-org
License: lgpl-3.0
Language: JavaScript
Default Branch: master
Homepage:
Size: 12.7 MB

Statistics

Stars: 7
Watchers: 5
Forks: 4
Open Issues: 46
Releases: 1

Topics

web-application

Created over 7 years ago · Last pushed over 4 years ago

Metadata Files

Readme License Citation

What is `lumen`?

lumen is an interactive web-application for the visualization and exploration of probabilistic machine learning models. Its main feature is the ability to rapidly and incrementally build flexible and potentially complex visualizations of both probabilistic machine learning models and the data these models were trained on.

Using `lumen`

lumen aims to make a particular class of machine learning/statistical models, namely probabilistic models, more easily accessible to humans. A probabilistic model models a set of target variables by means of a probability distribution. That is, different to many classic ML methods which predict a particular value of the target variable(s), probabilistic models instead capture the distribution of the target variables. lumen lets you 'see' your model, understand how it performs, where it 'fails', and compare this to previous versions of the model or alternative models.

Manual

A manual-style description of the UI, the visual encodings, lumens usage, its features, and available interactions is available here.

Walk-Through

A walk-through-style introduction to lumen is available here. It demonstrates some of the feature for exploration of probabilistic models in lumen.

`lumens` user interface displaying a variety of visualizations of a probabilistic model on a socio-economic data set

In particular lumen lets you:

plot any marginals of your models. Here, marginal means not just 1d but also higher dimensional marginals. Studying multiple multi-variate 'slices' of a model may help to understand the interactions between the variables. We believe this also helps you to fix model degrading artifacts. Such artifacts may indicate a problem in your model specification, model parameterization or possibly a bug in the machine learning algorithm of your model.
plot the model marginals together with data marginals. This lets you directly check the models fit to data.
plot predictions of your model along side corresponding data aggregations. This lets you understand its predictive behaviour, and also compare it observed quantities.
combine any of the above 'layers' into a single visualization.
change visualizations by flexibly assigning variables/data attributes to visual channels.
create as many of these visualizations side by side on an virtually infinite canvas. This lets you compare various stages of a model, compare different modelling approaches, and get a better overall understanding by combining many different visualizations of the same model.

Augmenting Probabilistic Programming

Probabilistic programming language (PPLs), such as PyMC3, BLOG, or Stan, provide a framework to define probabilistic models by explicitly declaring the likelihood of the observed data as a probability density function. The analyst typically starts with an exploration of the data. Based on insights gained from data exploration and on the analyst's domain knowledge, the analyst creates an initial simple model involving only some data. Subsequently, this model is iteratively made more complex until it meets the expert's goals. In particular, the model must be validated after each iteration. lumen supports this model building process by (i) enabling visual-interactive data exploration, (ii) supporting model validation by means of a visual comparison of data queries to semantically equivalent model queries, and (iii) enabling a direct comparison of model iterates.

Model Debugging

Even for a machine learning expert it may be hard to know whether a model has been trained on the data as expected. Possible reasons for artifacts in a model include an inappropriate application of the machine learning method, implementation bugs in the machine learning method, and issues in the training data. Direct visual inspection of the probabilistic model provides an approach to model debugging that enables the analyst to literally spot model artifacts that may cause degrading performance. Classical approaches to validation would rely on aggregating measures like information criterions or predictive accuracy scores.

Education / Teaching

By its intuitive visual representations of models, Lumen aims to promote understanding of the underlying modelling techniques. For instance, the effect of varying a parameter value for a modelling method on the probabilistic model can be observed visually rather than remaining an abstract description in a textbook. Similarly, the differences between models/model types can be visually illustrated by plotting them side by side. Also, probabilistic concepts such as conditioning or marginalization, which are often difficult to grasp, can be tried out interactively, providing immediate feedback.

Data-only exploration

You don't do any Machine Learning but simply would like to conveniently browse, explore, and compare tabular data? lumen is the right place for you too! This is not what lumen was built for originally, but regard it as your 'free lunch'.

Installing `lumen`

This explains how to install and configure lumen and its dependencies.

Note that lumen is build on top of the modelbase back-end, which provides a SQL-like interface for querying models and its data.

Requirements

lumen is a web application that requires access to a web-service instance of the Python3-based modelbase backend. lumen allows a user to interactively compile data/model queries and visualize the queries results. modelbase does the computation and actually answers the queries. You can get modelbase here where you also find information on how to set it up and run it as a web-service.
lumen and modelbase need to be configured correctly with 'matching' settings. By default (both run locally on the same physical machine) this is the case and you do not need to change these settings:
- hostname set in the configuration of lumen must match the actual hostname of modelbase.
- port must match
- protocol must match (http or https)
lumen allows you to explore the models and data that are hosted by the modelbase backend. You can use the modelbase Python package to (1) train/create models from data, and then (2) host them by an instance of the modelbase web-service. See the documentation and introductory jupyter notebooks in the doc folder for more information. Also, a number of example models are created during the setup process of modelbase for your convenience.

Setup

Clone/download this repository into a folder <path> of your choice.

Updating it

Just pull/download the lasted branch/version you'd like.

Running it

make sure the modelbase backend is running and hosting the models that you'd like to explore.
it's dead simple: Open <path>/index.html in your browser. If everything is fine you should now see a model dialog that lists the available models. Select one and start exploring it!

Notes: * Using chrome/chromium as a browser is recommended, since it provides the best performance from our experience.

Trouble Shooting

If you have any trouble using lumen, need some additional explanation, or even just want to provide some feedback, please don't hesitate to contact us at philipp.lucas@dlr.de. If you encounter any bugs you can also submit an issue.

Typical Issues

When open `lumen` in my browser I get the error message: "Could not load remote model from server!"

Confirm that the backend server actually running
Check the developer console log of the browser where you are loading the front-end. If it shows something like:

Failed to load http://127.0.0.1:5000/web-service: Response to preflight request doesn't pass access control check: The 'Access-Control-Allow-Origin' header has a value 'null' that is not equal to the supplied origin. Origin 'null' is therefore not allowed access.

Then your probably run into some CORS issue because you serve the file directly from the file system, instead from a webserver running locally. See here for the issues: * problem description: answer 1, point 2

Solutions: * serve it from a local web-service (preferred) * disable CORS control in chrome (kind of hacky)

I get the error message: "Could not load remote model 'XXXX' from server 'XXXX' !"

Confirm that the backend server is actually running
Did the backend server load the particular model that you are trying to retrieve? Loaded models are listed in the terminal output of the backend server on its start up.

Contributing

You wanna contribute? Awesome! Let's get in touch: philipp.lucas@dlr.de !

Development Setup

This is only for you, if you want to contribute to the project.

Do the steps as described in the Setup section above.
Install node-js. For questions refer to the getting started guide.
Update npm (part of node-js): sudo npm install -g npm
Install all npm-dependencies as provided by the projects package.json:
* run from <path>: npm install

Contact

For any questions, feedback, bug reports, feature requests, spam, rants, etc please contact: philipp.lucas@dlr.de

Copyright and Licence

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.

You should have received a copy of the GNU Lesser General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Owner

Name: Lumen
Login: lumen-org
Kind: organization
Email: philipp.lucas@gmail.com
Location: Jena, Germany

Repositories: 4
Profile: https://github.com/lumen-org

A project for the visual-interactive building, validation and exploration of probabilistic models of all kinds.

JOSS Publication

Lumen: A software for the interactive visualization of probabilistic models together with data

Published

July 17, 2021

DOI

10.21105/joss.03395

Volume 6, Issue 63, Page 3395

Authors

Philipp Lucas

Institute of Data Science, German Aerospace Center

Joachim Giesen
Friedrich-Schiller-University Jena

Editor

Vissarion Fisikopoulos

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - given-names: Philipp
    family-names: Lucas
    email: philipp.lucas@gmail.com
    orcid: 'https://orcid.org/0000-0002-6687-8209'
    affiliation: Friedrich-Schiller-Universität Jena
  - given-names: Joachim
    family-names: Giesen
    email: joachim.giesen@uni-jena.de
    affiliation: Friedrich-Schiller-Universität Jena
title: "Lumen: A software for the interactive visualization of probabilistic models together with data"
version: 1.0.0
doi: https://doi.org/10.21105/joss.03395
date-released: 2021-07-17
url: "https://github.com/lumen-org/lumen"

GitHub Events

Total

Last Year

Committers

Last synced: 7 months ago

All Time

Total Commits: 904
Total Committers: 8
Avg Commits per committer: 113.0
Development Distribution Score (DDS): 0.209

Past Year

Commits: 0
Committers: 0
Avg Commits per committer: 0.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Philipp Lucas	p**s@u**e	715
Philipp Lucas	p**s@d**e	82
Philipp Lucas	p**s@g**m	75
=	l**s@w**e	12
Schmalwasser	s**i@d**e	12
Christoph Saffer	c**r@u**e	4
Joachim Giesen	j**n@u**e	2
Andreas Goral	a**l@u**e	2

Committer Domains (Top 20 + Academic)

uni-jena.de: 4 dw-00037sl.intra.dlr.de: 1 dlr.de: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 82
Total pull requests: 18
Average time to close issues: 21 days
Average time to close pull requests: 5 days
Total issue authors: 6
Total pull request authors: 3
Average comments per issue: 0.7
Average comments per pull request: 0.33
Merged pull requests: 13
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

ghost (38)
nandaloo (15)
Feathergunner (15)
asver12 (12)
jong42 (1)
vissarion (1)

Pull Request Authors

asver12 (12)
nandaloo (5)
keli95566 (1)

Top Labels

Issue Labels

Feature Request (37) bug (23) Visual Improvement (8) Usability (7) enhancement (4) invalid (4) CHI2019 (4) wontfix (3) EuroVis2020 (2) good first issue (1) Feature Idea (1) doc (1)

Lumen

Science Score: 100.0%

Keywords

Scientific Fields

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

What is lumen?

Using lumen

Manual

Walk-Through

Augmenting Probabilistic Programming

Model Debugging

Education / Teaching

Data-only exploration

Installing lumen

Requirements

Setup

Updating it

Running it

Trouble Shooting

Typical Issues

When open lumen in my browser I get the error message: "Could not load remote model from server!"

I get the error message: "Could not load remote model 'XXXX' from server 'XXXX' !"

Contributing

Development Setup

* run from <path>: npm install

Contact

Copyright and Licence

Owner

JOSS Publication

Lumen: A software for the interactive visualization of probabilistic models together with data

Authors

Editor

Tags

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

What is `lumen`?

Using `lumen`

Installing `lumen`

When open `lumen` in my browser I get the error message: "Could not load remote model from server!"

* run from `<path>`: `npm install`