data-cleaning-ui

A general-purpose time-series data cleaning user interface

https://github.com/marci-1004/data-cleaning-ui

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.3%) to scientific vocabulary
Last synced: 7 months ago · JSON representation ·

Repository

A general-purpose time-series data cleaning user interface

Basic Info
  • Host: GitHub
  • Owner: Marci-1004
  • Language: Python
  • Default Branch: main
  • Size: 5.33 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 2
Created almost 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme Citation

README.md

This is a general-purpose time-series data cleaning user interface.

It's main purpose is to perform the data cleaning task of the preparation of ML training data.

A video guide is available at: https://youtu.be/vf5UN73QOW4

The functions of the UI include: 1) Calculation of periodicities within each column of a dataset 2) Missing value imputation 3) Data transformation into trend, seasonal, and residual components 4) Data outlier detection for each of these components 5) Data anomaly rectification 6) Various visualization methods interspersed along the way

The possible outputs obtained from the UI: 1) Missing value imputation results 2) Decomposition into trend, seasonal, and residual results 3) Outlier detection results 4) Anomaly rectification results

All of these outputs should be sufficient input for the training of ML models

Running the UI can be done through VS-code, where: 1) The python and shiny for python extensions need to be installed to vscode 2) A correct interpreter has to be selected, one which includes all the necessary packages listed on the first cell of the app.py file 3) After this, the app.py file can be started, and can be opened in the browser via vscode

Owner

  • Login: Marci-1004
  • Kind: user

Citation (CITATION.cff)

cff-version: 1.1.0
message: "If you use this software, please cite it as below."
authors:
- family-names: Kovács
  given-names: Márton
orcid: 
doi: 10.5281/zenodo.11057684
url: https://github.com/Marci-1004/Data-Cleaning-UI
title: Time-Series Data Cleaning User Interface
version: v1.0.1
date-released: 2024-04-24

GitHub Events

Total
Last Year