https://github.com/ammarlodhi255/student_performance_indicator_end-to-end_implementation

An end-to-end machine learning project, student performance indicator. The goal of this project is to understand the influence of the parents background, test preparation, and various other variables on the students performance.

https://github.com/ammarlodhi255/student_performance_indicator_end-to-end_implementation

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (13.2%) to scientific vocabulary

Keywords

aws cd-pipeline data-analysis data-science data-science-projects eda end-to-end-machine-learning machine-learning machine-learning-projects regression regression-analysis
Last synced: 5 months ago · JSON representation

Repository

An end-to-end machine learning project, student performance indicator. The goal of this project is to understand the influence of the parents background, test preparation, and various other variables on the students performance.

Basic Info
  • Host: GitHub
  • Owner: ammarlodhi255
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 2 MB
Statistics
  • Stars: 11
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Topics
aws cd-pipeline data-analysis data-science data-science-projects eda end-to-end-machine-learning machine-learning machine-learning-projects regression regression-analysis
Created almost 3 years ago · Last pushed almost 3 years ago
Metadata Files
Readme

README.md

Student Performance Indicator

Information About the Dataset:

The dataset The goal of this project is to understand the influence of the parents background, test preparation, and various other variables on the students math score.

There are 8 independent variables:

  • gender : Sex of a student (Male/Female)
  • race/ethnicity : Ethnicity of a student (Group A,B,C,D,E)
  • parental level of education : parents' final education (bachelor's degree,some college,master's degree,associate's degree,high school)
  • lunch : What type of lunch the student had before test (standard or free/reduced)
  • test preparation course : Whether the student completed any preparation course before the test.
  • reading score : Reading score obtained by the student.
  • writing score : Writing score obtained by the student.

Target variable:

  • math score: Math score of a student.

Dataset Source Link : https://www.kaggle.com/datasets/spscientist/students-performance-in-exams?resource=download

Animation of UI

HomepageUI

Project Development Approach

  1. Data Ingestion :
  • In Data Ingestion phase the data is first read as csv.
  • Then the data is split into training and testing and saved as csv file.
  1. Data Transformation :
  • In this phase a ColumnTransformer Pipeline is created.
  • for Numeric Variables first SimpleImputer is applied with strategy median (because there were some outliers in the data), then standard scaling is performed on numeric data.
  • for Categorical Variables SimpleImputer is applied with most frequent strategy, then one-hot-encoding is performed, after this data is scaled with standard scaler.
  • This preprocessor is saved as pkl file inside the artifacts folder.
  1. Model Training :
  • In this phase, all the models are trained and evaluated. The best model found was Linear Regression.
  • After this hyperparameter tuning is also performed prior to selecting the best model.
  • This model is saved as pickle file to be used for the predict pipeline.
  1. Prediction Pipeline :

    • This pipeline converts given data into dataframe and has various functions to load pickle files and predict the final results in python.
  2. Flask App creation :

    • Flask app is created with User Interface to predict the math score of a student given the required features inside a Web Application.

Exploratory Data Analysis Notebook

Link : EDA Notebook

Model Training Approach Notebook

Link : Model Training Notebook

Usage:

  1. conda create -p std python=3.8 -y
  2. conda activate std/
  3. pip install -r requirements.txt
  4. Execute app.py
  5. Access http://127.0.0.1:5000/

Owner

  • Name: Ammar Ahmed
  • Login: ammarlodhi255
  • Kind: user
  • Location: Sukkur, Pakistan

A computer scientist at heart, interested in AI, software development, and space.

GitHub Events

Total
  • Watch event: 1
Last Year
  • Watch event: 1

Issues and Pull Requests

Last synced: about 1 year ago

All Time
  • Total issues: 0
  • Total pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Total issue authors: 0
  • Total pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 0
  • Average time to close issues: N/A
  • Average time to close pull requests: N/A
  • Issue authors: 0
  • Pull request authors: 0
  • Average comments per issue: 0
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels