predict-health-insurance-amount

Deep Neural Network web application to predict health insurance amount and to forecast the users monthly medical expense.

https://github.com/snehaveerakumar/predict-health-insurance-amount

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (14.1%) to scientific vocabulary

Keywords

deep-neural-networks machine-learning python
Last synced: 6 months ago · JSON representation ·

Repository

Deep Neural Network web application to predict health insurance amount and to forecast the users monthly medical expense.

Basic Info
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Topics
deep-neural-networks machine-learning python
Created over 3 years ago · Last pushed about 3 years ago
Metadata Files
Readme Citation

README.md


CAP4001 Capstone Project | Vellore Institute of Technology-AP

Project work carried out during the penultimate semester of study for the credits prescribed under University Core of the curriculum, related to the specialization of the programme by applying the knowledge gained in the courses we have undergone so far.


Prediction of Health Insurance Amount Using Deep Neural Network

Table of Contents
  1. About The Project
  2. Project set-up
  3. Docker Desktop to deploy app
  4. Docker Hub Repository
  5. Project folder description
  6. Contributing
  7. Dataset links

About The Project

The purpose of this project work was to build an efficient model to predict the health insurance amount. Various machine learning models were compared with Deep Neural Network model. With the evaluation metric being Mean Squared Error, DNN model outperformed the machine learning models, which is then used to serve the end users. The end-to-end application ensures the continuous integration and continuous deployment of the application to the users. Using Agile Software Development Lifecycle, the project is built following all the engineering standards, rules and regulations.

Built with

Flask, JavaScript, Azure, Docker,Visual Studio Code and Jupyter


Project set-up

Note for Windows user : Before step-3, uncomment line no.95 to 101 and delete the line no.116.

  1. Clone the repository. sh git clone https://github.com/SnehaVeerakumar/Predict-health-insurance-amount.git
  2. Activate virtual environment. sh virtualenv venv venv\Scripts\activate
  3. Install necessary python packages. sh pip install -r requirements.txt
  4. Start the server sh python app.py

Project set-up in Docker Desktop

  1. Build using Docker Image sh docker build -t app_name:tag_name .
  2. Run on port 5000 sh docker run -p 5000:5000 app_name:tag_name

Docker Hub Repository

Link : https://hub.docker.com/r/snehaveerakumar/insuranceprediction
Pull Image sh docker pull snehaveerakumar/insuranceprediction

Project folder description

  1. Parent folder contains 3 important files.

    • app.py : Acts as the entry point to the application. It contains the URL routes and the codes for machine learning models.
    • Dockerfile : Contains the instructions to build a docker image.
    • requirements.txt : Contains the packages required to deploy the application.

  2. Folder : Code

      Three sub folders in this folder contains the code for frontend and backend of the application. Jupyter Notebook has been used to analyse various machine learning models and deep neural network models and compared to select the best model using Mean Squared Error.

  3. Folder : Dataset

      Contains all the dataset that is required for the project. Processed data and user data will also get saved in this folder for the further processing

Contributing

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b test)
  3. Commit your Changes (git commit -m 'message')
  4. Push to the Branch (git push origin test)
  5. Open a Pull Request

Dataset links

  1. Disease indicators : https://www.kaggle.com/datasets/cdc/behavioral-risk-factor-surveillance-system
  2. Insurance amount(insurance.csv) : https://www.kaggle.com/datasets/annetxu/health-insurance-cost-prediction

(back to top)

Citation (CITATION.cff)

cff-version: 1.0.0
message: "If you use this software, please cite it as below."
authors:
  - given-names: Sneha Veerakumar
title: "Predict Health Insurance Amount Using DNN"
version: 1.0.0
date-released: 2022-12-26

GitHub Events

Total
Last Year

Dependencies

Dockerfile docker
  • python 3.10 build
requirements.txt pypi
  • Flask ==2.2.2
  • ImageHash ==4.3.1
  • Jinja2 ==3.1.2
  • Markdown ==3.4.1
  • MarkupSafe ==2.1.1
  • Pillow ==9.3.0
  • PyWavelets ==1.4.1
  • PyYAML ==6.0
  • Pygments ==2.13.0
  • Werkzeug ==2.2.2
  • absl-py ==1.3.0
  • adjustText ==0.7.3
  • asttokens ==2.1.0
  • astunparse ==1.6.3
  • attrs ==22.1.0
  • backcall ==0.2.0
  • cachetools ==5.2.0
  • certifi ==2022.9.24
  • charset-normalizer ==2.1.1
  • click ==8.1.3
  • colorama ==0.4.6
  • comm ==0.1.1
  • cycler ==0.11.0
  • dash-core-components ==2.0.0
  • dash-html-components ==2.0.0
  • dash-table ==5.0.0
  • debugpy ==1.6.3
  • decorator ==5.1.1
  • entrypoints ==0.4
  • executing ==1.2.0
  • fastjsonschema ==2.16.2
  • flatbuffers ==22.11.23
  • fonttools ==4.38.0
  • gast ==0.4.0
  • google-auth ==2.14.1
  • google-auth-oauthlib ==0.4.6
  • google-pasta ==0.2.0
  • grpcio ==1.50.0
  • h5py ==3.7.0
  • htmlmin ==0.1.12
  • idna ==3.4
  • ipykernel ==6.18.1
  • ipympl ==0.9.2
  • ipython ==8.7.0
  • ipython-genutils ==0.2.0
  • ipywidgets ==8.0.2
  • itsdangerous ==2.1.2
  • jedi ==0.18.2
  • joblib ==1.2.0
  • jsonschema ==4.17.0
  • jupyter_client ==7.4.7
  • jupyter_core ==5.1.0
  • jupyterlab-widgets ==3.0.3
  • keras ==2.11.0
  • kiwisolver ==1.4.4
  • libclang ==14.0.6
  • matplotlib ==3.5.3
  • matplotlib-inline ==0.1.6
  • multimethod ==1.9
  • nbformat ==5.7.0
  • nest-asyncio ==1.5.6
  • networkx ==2.8.8
  • numpy ==1.23.4
  • oauthlib ==3.2.2
  • opt-einsum ==3.3.0
  • packaging ==21.3
  • pandas ==1.5.1
  • parso ==0.8.3
  • patsy ==0.5.3
  • pickleshare ==0.7.5
  • platformdirs ==2.5.4
  • plotly ==5.11.0
  • prompt-toolkit ==3.0.32
  • protobuf ==3.19.6
  • psutil ==5.9.3
  • pure-eval ==0.2.2
  • pyasn1 ==0.4.8
  • pyasn1-modules ==0.2.8
  • pydantic ==1.10.2
  • pyparsing ==3.0.9
  • pyrsistent ==0.19.2
  • python-dateutil ==2.8.2
  • pytz ==2022.6
  • pyzmq ==24.0.1
  • requests ==2.28.1
  • requests-oauthlib ==1.3.1
  • rsa ==4.9
  • scikit-learn ==1.1.3
  • scipy ==1.9.3
  • seaborn ==0.12.1
  • six ==1.16.0
  • sklearn ==0.0.post1
  • stack-data ==0.6.2
  • tangled-up-in-unicode ==0.2.0
  • tenacity ==8.1.0
  • tensorboard ==2.11.0
  • tensorboard-data-server ==0.6.1
  • tensorboard-plugin-wit ==1.8.1
  • tensorflow ==2.11.0
  • tensorflow-estimator ==2.11.0
  • tensorflow-intel ==2.11.0
  • tensorflow-io-gcs-filesystem ==0.28.0
  • termcolor ==2.1.1
  • threadpoolctl ==3.1.0
  • tornado ==6.2
  • tqdm ==4.64.1
  • traitlets ==5.5.0
  • typing_extensions ==4.4.0
  • urllib3 ==1.26.12
  • visions ==0.7.5
  • wcwidth ==0.2.5
  • widgetsnbextension ==4.0.3
  • wrapt ==1.14.1
  • xgboost ==1.7.1