probabilisticml
Seminar in Probabilistic Machine Learning
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.2%) to scientific vocabulary
Keywords
Repository
Seminar in Probabilistic Machine Learning
Basic Info
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
Probabilistic Machine Learning Seminar
About
This repository contains the code and experiments for my seminar thesis in the Probabilistic Machine Learning course (SoSe 2025).
The work focuses on exploring uncertainty in neural networks using partial Bayesian neural networks, with a particular emphasis on:
- Bayesian last-layer (Neural Linear Approximations)
- Comparison to fully deterministic neural networks
- Evaluation of uncertainty calibration and overconfidence on in-distribution and out-of-distribution (OOD) data
Core Topics
- Probabilistic modeling in neural networks
- Bayesian linear regression as output layer
- Predictive entropy and epistemic uncertainty
- OOD detection via uncertainty quantification
- Comparison to MAP-trained deterministic models
Project Structure
src/– Jupyter notebooks for experiments and visualizationsSimulation.ipynb– 1D toy regression to visualize predictive uncertaintyRealData.ipynb– Evaluation on real-world dataBayesianLLNN.py- Implementation of Bayesian Last LayerbaseNN.py- base deterministic NN
models/– ModelsbaseBayes.pth– base Neural Network for Classficationbasenn.pth- Standard ReLU neural networkbest_lastlayer.pth- – Neural Linear Model (Bayesian output layer)
Data/– DataDataset of Diabetes.csv- Dataset in csv format
results/– Generated plots and saved evaluation resultsrequirements.txt– Python dependenciesREADME.md– This file
To set up the project environment and reproduce the results, follow these steps:
- Clone the repository
bash: git clone https://github.com/yourusername/probabilistic-seminar.git cd probabilistic-seminar - create virtual environment ``` # on Mac os: python -m venv venv source venv/bin/activate
# on Windows:
venv\Scripts\activate
3. **install dependencies**
pip install -r requirements.txt
```
- Run experiments
For simluation study:
Open and run src/Simulation.ipynb
For real-world classification:
Open and run src/RealData.ipynb
Datasets Used
- Synthetic (toy) regression dataset — for visualizing predictive uncertainty
- Real-world dataset
- Diabetes Diagnosis: available at Kaggle
- In-distribution: known label classes
- OOD: unrelated or random inputs for uncertainty evaluation
Goal of the Work
To evaluate whether a simple Bayesian approximation (only in the last layer) is sufficient to capture meaningful uncertainty in predictions — particularly in OOD settings — and to compare its behavior to a fully deterministic ReLU network.
LICENSE
This project is not open-source. All rights reserved. More infos found on License Unauthorized use, copying, or distribution is prohibited.
Owner
- Name: Bakir Chaban
- Login: Kingmopser
- Kind: user
- Location: Munich, Germany
- Repositories: 1
- Profile: https://github.com/Kingmopser
Undergraduate in Data Science and Statistics
Citation (Citation.CFF)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Chaban" given-names: "Bakir Munir" title: "ProbabilisticML" version: 1.0.0 doi: 10.5281/zenodo.1234 date-released: 2025-05-22 url: "https://github.com/Kingmopser/ProbabilisticML"
GitHub Events
Total
- Push event: 40
- Create event: 2
Last Year
- Push event: 40
- Create event: 2
Dependencies
- matplotlib *
- numpy <2
- pandas ==2.2.3
- scikit-learn ==1.6.1
- seaborn *
- torch *