https://github.com/erictleung/ml-final-proj

:wine_glass: CS559/659 Machine Learning Final Project on Predicting Wine Quality

https://github.com/erictleung/ml-final-proj

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Committers with academic emails
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.1%) to scientific vocabulary

Keywords

decision-trees machine-learning quality r svm uci wine
Last synced: 4 months ago · JSON representation

Repository

:wine_glass: CS559/659 Machine Learning Final Project on Predicting Wine Quality

Basic Info
  • Host: GitHub
  • Owner: erictleung
  • Language: R
  • Default Branch: master
  • Homepage:
  • Size: 55.7 KB
Statistics
  • Stars: 2
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
decision-trees machine-learning quality r svm uci wine
Created over 9 years ago · Last pushed over 9 years ago
Metadata Files
Readme

README.md

CS559/659 Machine Learning Final Project

Here is my machine learning project on using various methods to predict wine quality and wine type based on physiochemical measurements.

Prerequisites

Run Analysis and Create Report

shell git clone https://github.com/erictleung/ml-final-proj.git make report

Data

The data comes from the University of California Irvine Machine Learning Repository and can be found at the Wine Quality Data Set.

The data has two datasets: one related to red wine, another is for white wine. Each type of wine is from Portugal.

The data includes eleven input variables (such as citric acid content and pH) and there is one output variable on quality, which is on a scale between zero and ten.

Questions Asked

  • Putting the data together, can we distinguish between white and red wine?
  • Can we predict perceived wine quality based on the input variables?
  • Are there any variables that contain redundant information? (In other words, are there any correlative variables?)
  • What variables are most important in predicting perceived wine quality?

Repository Structure

``` . ├── Makefile ├── README.md ├── bin │   ├── decision-trees.R │   ├── naive-bayes.R │   ├── splitdf.R │   └── svm.R └── report ├── leung-final-report.Rmd └── refs.bib

2 directories, 8 files ```

Owner

  • Name: Eric Leung
  • Login: erictleung
  • Kind: user
  • Location: New York, NY

Data science generalist. Sharing knowledge and optimizing tools for learning and growth. Open-source and open-data advocate. Community learner.

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Committers

Last synced: about 1 year ago

All Time
  • Total Commits: 103
  • Total Committers: 1
  • Avg Commits per committer: 103.0
  • Development Distribution Score (DDS): 0.0
Past Year
  • Commits: 0
  • Committers: 0
  • Avg Commits per committer: 0.0
  • Development Distribution Score (DDS): 0.0
Top Committers
Name Email Commits
Eric Leung e****c@e****m 103
Committer Domains (Top 20 + Academic)

Issues and Pull Requests

Last synced: 11 months ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels