https://github.com/anselmoo/csv_first_insight
A sklearn-based correlation- and prediction-maker for small *csv-data
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: zenodo.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary
Keywords
Repository
A sklearn-based correlation- and prediction-maker for small *csv-data
Basic Info
- Host: GitHub
- Owner: Anselmoo
- License: mit
- Language: Python
- Default Branch: master
- Size: 171 KB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 1
- Releases: 3
Topics
Metadata Files
README.md
CSV-First-Insights
A sklearn-based correlation- and prediction-maker for small csv-data < 10,000 entries. Consquently, no Neural Network will be used and so far the following Models are implemented:
Furthermore, for a first analysis, the cluster- and aprori-pair-plots can be easily generated for checking dependencies in the data.
The CSV-First-Insights-application can be installed like this:
python setup.py install
The options of the Command Line Interface of CSV-First-Insights are:
python -m pyinsights --help
usage: __main__.py [-h] [--fname FNAME FNAME] [--mode MODE] [--export]
Analyzer for small (# < 10,000) csv-Databases with binary content via scikit-learn!
Training-Set and Test-Set is separately stored in two databases.
optional arguments:
-h, --help show this help message and exit
--fname FNAME FNAME Two filenames have to be defined for the train- and test-set.
Default names are: train-data.csv','test-data.csv'
--mode MODE Please chose the model for the forecaset:
*Ridge-Regression as a Variation of Linear-Regressions -> rig(deafault)
*Gradient-Boosting-Trees -> grad
*Random-Forest -> fors
*All three models, please choose -> all
--export Export the Apriori-Analysis, Cluster-Maps, and Predictions as png- and txt-file
The CSV-First-Insights can be also loaded as packages like this:
python
import pyinsights
import pyinsights.dataread as dr
import pyinsights.mlmodels as ml
import pyinsights.sklsetups as skl
The Ridge-Regression-Prediction of CSV-First-Insights for the The Ultimate Halloween Candy Power Ranking of kaggle:

Owner
- Name: Anselm Hahn
- Login: Anselmoo
- Kind: user
- Location: Switzerland
- Repositories: 100
- Profile: https://github.com/Anselmoo
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 9
- Average time to close issues: N/A
- Average time to close pull requests: 6 minutes
- Total issue authors: 0
- Total pull request authors: 3
- Average comments per issue: 0
- Average comments per pull request: 0.11
- Merged pull requests: 8
- Bot issues: 0
- Bot pull requests: 2
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- Anselmoo (7)
- imgbot[bot] (1)
- sourcery-ai[bot] (1)