falcon

Brushing and linking for big data

https://github.com/vega/falcon

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 1 DOI reference(s) in README
  • Academic publication links
  • Committers with academic emails
    2 of 8 committers (25.0%) from academic institutions
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.1%) to scientific vocabulary

Keywords

scalable-visualization

Keywords from Contributors

embedded interactive plagiarism-checker packaging xunit-test xunit-framework actions yolov5 graph-generation optim
Last synced: 7 months ago · JSON representation

Repository

Brushing and linking for big data

Basic Info
Statistics
  • Stars: 965
  • Watchers: 24
  • Forks: 53
  • Open Issues: 12
  • Releases: 0
Topics
scalable-visualization
Created over 9 years ago · Last pushed 8 months ago
Metadata Files
Readme License Citation

README.md

Falcon: Interactive Visual Analysis for Big Data

npm version Tests code style: prettier

Crossfilter millions of records without latencies.

The largest experiments we have done so far is 10M flights in the browser, 33M flights in the browser with DuckDB, and ~180M flights or ~1.7B stars when connected to OmniSciDB (formerly known as MapD).

We have written a paper about the research behind Falcon. Please cite us if you use Falcon in a publication.

bib @inproceedings{moritz2019falcon, doi = {10.1145/3290605}, year = {2019}, publisher = {{ACM} Press}, author = {Dominik Moritz and Bill Howe and Jeffrey Heer}, title = {Falcon: Balancing Interactive Latency and Resolution Sensitivity for Scalable Linked Visualizations}, booktitle = {Proceedings of the 2019 {CHI} Conference on Human Factors in Computing Systems - {CHI} {\textquotesingle}19} }

This project was developed for the paper above. A lot of the functionality is now in the falcon-vis library, VegaFusion, and Mosaic.

Demos

  • 1M flights in the browser: https://vega.github.io/falcon/flights/
  • 10M flights in the browser with DuckDB-WASM: https://vega.github.io/falcon/flights-duckdb/
  • 7M flights in OmniSci Core: https://vega.github.io/falcon/flights-mapd/
  • 500k weather records: https://vega.github.io/falcon/weather/

Falcon demo

Usage

Install with yarn add falcon-vis. You can use two query engines. First ArrowDB reading data from Apache Arrow. This engine works completely in the browser and scales up to ten million rows. Second, MapDDB, which connects to OmniSci Core. The indexes are created as ndarrays. Check out the examples to see how to set up an app with your own data. More documentation will follow.

Features

Zoom

You can zoom histograms. Falcon automatically re-bins the data.

Show and hide unfiltered data

The original counts without filters, can be displayed behind the filtered counts to provide context. Hiding the unfiltered data shows the relative distribution of the data.

With unfiltered data.

Without unfiltered data.

Circles or Color Heatmap

Heatmap with circles (default). Can show the data without filters.

Heatmap with colored cells.

Vertical bar, horizontal bar, or text for counts

Horizontal bar.

Vertical bar.

Text only.

Timeline visualization

You can visualize the timeline of brush interactions in Falcon.

Falcon with 1.7 Billion Stars from the GAIA Dataset

The GAIA spacecraft measured the positions and distances of stars with unprecedented precision. It collected about 1.7 billion objects, mainly stars, but also planets, comets, asteroids and quasars among others. Below, we show the dataset loaded in Falcon (with OmniSci Core). There is also a video of me interacting with the dataset through Falcon.

Developers

Install the dependencies with yarn. Then run yarn start to start the flight demo with in memory data. Have a look at the other script commands in package.json.

Experiments

First version that turned out to be too complicated is at https://github.com/vega/falcon/tree/complex and the client-server version is at https://github.com/vega/falcon/tree/client-server.

Owner

  • Name: Vega
  • Login: vega
  • Kind: organization

Data Visualization Languages & Tools

GitHub Events

Total
  • Issues event: 1
  • Watch event: 15
  • Delete event: 33
  • Issue comment event: 16
  • Push event: 28
  • Pull request review event: 17
  • Pull request event: 72
  • Fork event: 1
  • Create event: 37
Last Year
  • Issues event: 1
  • Watch event: 15
  • Delete event: 33
  • Issue comment event: 16
  • Push event: 28
  • Pull request review event: 17
  • Pull request event: 72
  • Fork event: 1
  • Create event: 37

Committers

Last synced: 10 months ago

All Time
  • Total Commits: 1,110
  • Total Committers: 8
  • Avg Commits per committer: 138.75
  • Development Distribution Score (DDS): 0.496
Past Year
  • Commits: 41
  • Committers: 2
  • Avg Commits per committer: 20.5
  • Development Distribution Score (DDS): 0.122
Top Committers
Name Email Commits
Dominik Moritz d****z@g****m 559
dependabot-preview[bot] 2****] 289
dependabot[bot] 4****] 239
greenkeeper[bot] g****] 14
Leilani Battle l****t@.****u 5
Tarek Rached t****d@s****m 2
p42-ai[bot] 7****] 1
Donny Bertucci b****d@o****u 1
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 7 months ago

All Time
  • Total issues: 71
  • Total pull requests: 95
  • Average time to close issues: 3 months
  • Average time to close pull requests: 29 days
  • Total issue authors: 6
  • Total pull request authors: 5
  • Average comments per issue: 0.69
  • Average comments per pull request: 0.47
  • Merged pull requests: 52
  • Bot issues: 0
  • Bot pull requests: 67
Past Year
  • Issues: 1
  • Pull requests: 59
  • Average time to close issues: about 2 hours
  • Average time to close pull requests: 5 days
  • Issue authors: 1
  • Pull request authors: 1
  • Average comments per issue: 1.0
  • Average comments per pull request: 0.49
  • Merged pull requests: 24
  • Bot issues: 0
  • Bot pull requests: 59
Top Authors
Issue Authors
  • domoritz (58)
  • mathisonian (5)
  • leibatt (4)
  • saulshanabrook (2)
  • tarekrached (1)
  • Sondro (1)
Pull Request Authors
  • dependabot[bot] (128)
  • domoritz (14)
  • mathisonian (12)
  • greenkeeper[bot] (2)
  • tarekrached (2)
Top Labels
Issue Labels
enhancement (8) bug (4)
Pull Request Labels
dependencies (128) javascript (128)