predict-health-insurance-amount
Deep Neural Network web application to predict health insurance amount and to forecast the users monthly medical expense.
https://github.com/snehaveerakumar/predict-health-insurance-amount
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (14.1%) to scientific vocabulary
Keywords
Repository
Deep Neural Network web application to predict health insurance amount and to forecast the users monthly medical expense.
Basic Info
- Host: GitHub
- Owner: SnehaVeerakumar
- Language: Jupyter Notebook
- Default Branch: main
- Homepage: https://insuranceprediction.azurewebsites.net
- Size: 22.3 MB
Statistics
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
CAP4001 Capstone Project | Vellore Institute of Technology-AP
Project work carried out during the penultimate semester of study for the credits prescribed under University Core of the curriculum, related to the specialization of the programme by applying the knowledge gained in the courses we have undergone so far.
Prediction of Health Insurance Amount Using Deep Neural Network
Table of Contents
About The Project
The purpose of this project work was to build an efficient model to predict the health insurance amount. Various machine learning models were compared with Deep Neural Network model. With the evaluation metric being Mean Squared Error, DNN model outperformed the machine learning models, which is then used to serve the end users. The end-to-end application ensures the continuous integration and continuous deployment of the application to the users. Using Agile Software Development Lifecycle, the project is built following all the engineering standards, rules and regulations.
Built with
Flask, JavaScript, Azure, Docker,Visual Studio Code and Jupyter
Project set-up
Note for Windows user : Before step-3, uncomment line no.95 to 101 and delete the line no.116.
- Clone the repository.
sh git clone https://github.com/SnehaVeerakumar/Predict-health-insurance-amount.git - Activate virtual environment.
sh virtualenv venv venv\Scripts\activate - Install necessary python packages.
sh pip install -r requirements.txt - Start the server
sh python app.py
Project set-up in Docker Desktop
- Build using Docker Image
sh docker build -t app_name:tag_name . - Run on port 5000
sh docker run -p 5000:5000 app_name:tag_name
Docker Hub Repository
Link : https://hub.docker.com/r/snehaveerakumar/insuranceprediction
Pull Image
sh
docker pull snehaveerakumar/insuranceprediction
Project folder description
Parent folder contains 3 important files.
- app.py : Acts as the entry point to the application. It contains the URL routes and the codes for machine learning models.
- Dockerfile : Contains the instructions to build a docker image.
- requirements.txt : Contains the packages required to deploy the application.
Folder : Code
- Three sub folders in this folder contains the code for frontend and backend of the application. Jupyter Notebook has been used to analyse various machine learning models and deep neural network models and compared to select the best model using Mean Squared Error.
Folder : Dataset
- Contains all the dataset that is required for the project. Processed data and user data will also get saved in this folder for the further processing
Contributing
- Fork the Project
- Create your Feature Branch (
git checkout -b test) - Commit your Changes (
git commit -m 'message') - Push to the Branch (
git push origin test) - Open a Pull Request
Dataset links
- Disease indicators : https://www.kaggle.com/datasets/cdc/behavioral-risk-factor-surveillance-system
- Insurance amount(insurance.csv) : https://www.kaggle.com/datasets/annetxu/health-insurance-cost-prediction
Citation (CITATION.cff)
cff-version: 1.0.0 message: "If you use this software, please cite it as below." authors: - given-names: Sneha Veerakumar title: "Predict Health Insurance Amount Using DNN" version: 1.0.0 date-released: 2022-12-26
GitHub Events
Total
Last Year
Dependencies
- python 3.10 build
- Flask ==2.2.2
- ImageHash ==4.3.1
- Jinja2 ==3.1.2
- Markdown ==3.4.1
- MarkupSafe ==2.1.1
- Pillow ==9.3.0
- PyWavelets ==1.4.1
- PyYAML ==6.0
- Pygments ==2.13.0
- Werkzeug ==2.2.2
- absl-py ==1.3.0
- adjustText ==0.7.3
- asttokens ==2.1.0
- astunparse ==1.6.3
- attrs ==22.1.0
- backcall ==0.2.0
- cachetools ==5.2.0
- certifi ==2022.9.24
- charset-normalizer ==2.1.1
- click ==8.1.3
- colorama ==0.4.6
- comm ==0.1.1
- cycler ==0.11.0
- dash-core-components ==2.0.0
- dash-html-components ==2.0.0
- dash-table ==5.0.0
- debugpy ==1.6.3
- decorator ==5.1.1
- entrypoints ==0.4
- executing ==1.2.0
- fastjsonschema ==2.16.2
- flatbuffers ==22.11.23
- fonttools ==4.38.0
- gast ==0.4.0
- google-auth ==2.14.1
- google-auth-oauthlib ==0.4.6
- google-pasta ==0.2.0
- grpcio ==1.50.0
- h5py ==3.7.0
- htmlmin ==0.1.12
- idna ==3.4
- ipykernel ==6.18.1
- ipympl ==0.9.2
- ipython ==8.7.0
- ipython-genutils ==0.2.0
- ipywidgets ==8.0.2
- itsdangerous ==2.1.2
- jedi ==0.18.2
- joblib ==1.2.0
- jsonschema ==4.17.0
- jupyter_client ==7.4.7
- jupyter_core ==5.1.0
- jupyterlab-widgets ==3.0.3
- keras ==2.11.0
- kiwisolver ==1.4.4
- libclang ==14.0.6
- matplotlib ==3.5.3
- matplotlib-inline ==0.1.6
- multimethod ==1.9
- nbformat ==5.7.0
- nest-asyncio ==1.5.6
- networkx ==2.8.8
- numpy ==1.23.4
- oauthlib ==3.2.2
- opt-einsum ==3.3.0
- packaging ==21.3
- pandas ==1.5.1
- parso ==0.8.3
- patsy ==0.5.3
- pickleshare ==0.7.5
- platformdirs ==2.5.4
- plotly ==5.11.0
- prompt-toolkit ==3.0.32
- protobuf ==3.19.6
- psutil ==5.9.3
- pure-eval ==0.2.2
- pyasn1 ==0.4.8
- pyasn1-modules ==0.2.8
- pydantic ==1.10.2
- pyparsing ==3.0.9
- pyrsistent ==0.19.2
- python-dateutil ==2.8.2
- pytz ==2022.6
- pyzmq ==24.0.1
- requests ==2.28.1
- requests-oauthlib ==1.3.1
- rsa ==4.9
- scikit-learn ==1.1.3
- scipy ==1.9.3
- seaborn ==0.12.1
- six ==1.16.0
- sklearn ==0.0.post1
- stack-data ==0.6.2
- tangled-up-in-unicode ==0.2.0
- tenacity ==8.1.0
- tensorboard ==2.11.0
- tensorboard-data-server ==0.6.1
- tensorboard-plugin-wit ==1.8.1
- tensorflow ==2.11.0
- tensorflow-estimator ==2.11.0
- tensorflow-intel ==2.11.0
- tensorflow-io-gcs-filesystem ==0.28.0
- termcolor ==2.1.1
- threadpoolctl ==3.1.0
- tornado ==6.2
- tqdm ==4.64.1
- traitlets ==5.5.0
- typing_extensions ==4.4.0
- urllib3 ==1.26.12
- visions ==0.7.5
- wcwidth ==0.2.5
- widgetsnbextension ==4.0.3
- wrapt ==1.14.1
- xgboost ==1.7.1