ascvit

Repository for the article in the online magazine Towards Data Science.

https://github.com/stefanpietrusky/ascvit

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (15.6%) to scientific vocabulary

Keywords

data data-science data-visualization education flask-application llm ollama plotly python
Last synced: 6 months ago · JSON representation ·

Repository

Repository for the article in the online magazine Towards Data Science.

Basic Info
Statistics
  • Stars: 10
  • Watchers: 2
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Topics
data data-science data-visualization education flask-application llm ollama plotly python
Created over 1 year ago · Last pushed 8 months ago
Metadata Files
Readme License Citation

README.md

ASCVIT V2 APP

Automatic Statistical Calculation, Visualization, and Interpretation Tool

This repository, developed by Stefan Pietrusky, is based on the article published at Towards Data Science [1]. In this article, I discuss the development of a first version (V1) of a local app that can be used to automatically apply various statistical methods to any datasets. The enhanced version of ASCVIT (V1.5) now includes automatic interpretation of all generated visualisations [appv1.5.py]. The latest version of ASCVIT is now available [appv2.py]. Streamlit is no longer used in this version. The app is completely based on customizable HTML, CSS and JS. An LLM defined via the code is no longer used, but the models installed locally via Ollama can be selected as required. The navigation has been adapted and all data visualizations are now interactive through Plotly.

ASCVIT can be used in social work, specifically in associations and organizations that collect a wide range of data and can use the tool to evaluate it in order to better understand financial needs [2]. This is an open source project for educational and research purposes.

ASCVIT V1.5 Automatic interpretation by LLM (Image by author)

Overview of the statistical procedures

The following statistical procedures are supported by the first Version: ASCVIT V1.5 Overview of analysis methods (Image by author)

Structure and function of ASCVIT [V1.5]

The code to run the app is already in the repository as well as a script (clean.py) to remove missing values from data records. Below is a short GIF showing the structure and function of the app. ASCVIT V1.5 (Gif by author

The latest version of ASCVIT [V2]

The new version of ASCVIT looks as follows. ASCVIT V2 (Gif by author

Installing and running the application

  1. Clone this repository on your local computer: bash git clone https://github.com/stefanpietrusky/ascvitv2.git
  2. Install the required dependencies: bash pip install -r requirements.txt
  3. Install Ollama and load the model Llama3.1 (8B) or another.
  4. Remove missing data, if available, with the clean.py script bash python clean.py
  5. If you use V1.5 start the Streamlit app: bash streamlit run appv1.5.py
  6. If you use V2 start app with: bash python appv2.py 7 Use the file data.csv to test the application.

References

[1] Pietrusky, S. (2024). ASCVIT V1: Automatic Statistical Calculation, Visualization, and Interpretation Tool. Towards Data Science

[2] Pietrusky, S. (2025). Datafizierung und KI in der Sozialen Arbeit: Automatisierte Auswertung, Visualisierung und Interpretation von Daten zur Optimierung von Prozessen. peDOCs.

Owner

  • Login: stefanpietrusky
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you cite this repository, please use the following reference."
title: "AUTOMATIC STATISTICAL CALCULATION, VISUALIZATION AND INTERPRETATION TOOL (ASCVIT) V2"
authors:
  - family-names: "Pietrusky"
    given-names: "Stefan"
    orcid: "https://orcid.org/0009-0008-9739-5542"
version: "1.0.0"
date-released: "2024-03-01"

GitHub Events

Total
Last Year

Dependencies

requirements_V1.5.txt pypi
  • matplotlib *
  • numpy *
  • pandas *
  • plotly *
  • scikit-learn *
  • scipy *
  • seaborn *
  • statsmodels *
  • streamlit *
requirements_V2.txt pypi
  • Flask ==3.1.1
  • MarkupSafe ==2.1.5
  • matplotlib ==3.8.4
  • numpy ==1.25.2
  • pandas ==1.5.3
  • plotly ==5.24.1
  • scikit_learn ==1.7.0
  • scipy ==1.16.0
  • seaborn ==0.13.2
  • statsmodels ==0.14.4