https://github.com/erictleung/ml-final-proj
:wine_glass: CS559/659 Machine Learning Final Project on Predicting Wine Quality
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Committers with academic emails
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.1%) to scientific vocabulary
Keywords
Repository
:wine_glass: CS559/659 Machine Learning Final Project on Predicting Wine Quality
Basic Info
Statistics
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
CS559/659 Machine Learning Final Project
Here is my machine learning project on using various methods to predict wine quality and wine type based on physiochemical measurements.
Prerequisites
Run Analysis and Create Report
shell
git clone https://github.com/erictleung/ml-final-proj.git
make report
Data
The data comes from the University of California Irvine Machine Learning Repository and can be found at the Wine Quality Data Set.
The data has two datasets: one related to red wine, another is for white wine. Each type of wine is from Portugal.
The data includes eleven input variables (such as citric acid content and pH) and there is one output variable on quality, which is on a scale between zero and ten.
Questions Asked
- Putting the data together, can we distinguish between white and red wine?
- Can we predict perceived wine quality based on the input variables?
- Are there any variables that contain redundant information? (In other words, are there any correlative variables?)
- What variables are most important in predicting perceived wine quality?
Repository Structure
``` . ├── Makefile ├── README.md ├── bin │ ├── decision-trees.R │ ├── naive-bayes.R │ ├── splitdf.R │ └── svm.R └── report ├── leung-final-report.Rmd └── refs.bib
2 directories, 8 files ```
Owner
- Name: Eric Leung
- Login: erictleung
- Kind: user
- Location: New York, NY
- Website: https://erictleung.com
- Repositories: 169
- Profile: https://github.com/erictleung
Data science generalist. Sharing knowledge and optimizing tools for learning and growth. Open-source and open-data advocate. Community learner.
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Committers
Last synced: about 1 year ago
Top Committers
| Name | Commits | |
|---|---|---|
| Eric Leung | e****c@e****m | 103 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 11 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0