https://github.com/cty20010831/adsp_32014_final_project

https://github.com/cty20010831/adsp_32014_final_project

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
    Found 5 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary
Last synced: 10 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: cty20010831
  • Language: Jupyter Notebook
  • Default Branch: main
  • Size: 1.24 MB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed over 1 year ago
Metadata Files
Readme

README.md

ADSP32014Final_Project - Bayesian Multimodal Fusion for Depression Classification

📌 Overview

This project applies Bayesian Machine Learning techniques to classify depression using speech embeddings from Wav2Vec 2.0 and features extracted from EEG signals. The goal is to quantify uncertainty in predictions and enhance model interpretability for clinical decision-making.

🚀 Key Bayesian Methods Used: - Bayesian Logistic Regression (BLR) - Bayesian Gaussian Process Classification (GPC) - Bayesian Neural Networks (BNN) (with uncertainty-aware predictions)


🎯 Motivation

Major Depressive Disorder (MDD) is often underdiagnosed due to subjective assessments.

🔹 Why Bayesian Learning? It provides: - ✅ Uncertainty estimation (crucial for clinical AI)
- ✅ Better generalization over small datasets
- ✅ Robustness to noise in speech data


📂 Dataset

Overview

We utilize an open‐source multimodal dataset for mental‐disorder analysis as described in Cai et al., 2022. This dataset contains EEG recordings and spoken language data collected from clinically depressed patients (MDD) and matching healthy controls (HC). The dataset was carefully curated by clinical experts and includes both physiological signals and behavioral assessments, providing a comprehensive foundation for developing machine learning models for depression classification.

Data Components

The dataset is organized into three main components:

1. Full Brain 128-Electrodes EEG Data

  • Acquisition:
    • Collected using a traditional 128-electrodes mounted elastic cap (HydroCel Geodesic Sensor Net, Electrical Geodesics Inc., Oregon, USA).
  • Tasks:
    • Resting State: Five minutes of eyes-closed resting-state EEG.
    • Dot-Probe Task: EEG recordings during a dot-probe experiment that assesses attentional bias with facial stimuli.
  • Participants:
    • 53 subjects (24 diagnosed with MDD, 29 healthy controls).

2. 3-Electrodes EEG Data

  • Acquisition:
    • Recorded using a wearable 3-electrode EEG collector designed for pervasive computing.
  • Task:
    • Resting-state EEG.
  • Participants:
    • 55 subjects (26 diagnosed with MDD, 29 healthy controls).

3. Recordings of Spoken Language

  • Acquisition:
    • Audio recordings in uncompressed WAV format, captured in a quiet, soundproof environment.
  • Tasks:
    • Interview: 18 questions covering positive, neutral, and negative emotional content (e.g., travel plans, gifts, self-evaluation, sleep difficulties).
    • Reading: A short story ("The North Wind and the Sun") and readings of word groups with distinct emotional valences (positive, neutral, negative).
    • Picture Description: Participants describe images (four images in total), with three images selected for positive, neutral, and negative expressions and one image from the Thematic Apperception Test (TAT).
  • Participants:
    • 52 subjects (23 diagnosed with MDD, 29 healthy controls).

Experimental Protocol

  • Participant Selection:
    Participants were carefully diagnosed by professional psychiatrists using structured interviews such as the Mini-International Neuropsychiatric Interview (MINI) and clinical scales like the PHQ-9. Inclusion and exclusion criteria ensured high-quality data collection.

  • Data Recording:

    • EEG Data: Recorded at a sampling rate of 250 Hz. For full-brain EEG, impedance was maintained below 50 kΩ. Data are provided in multiple formats (e.g., .mff, .mat, .EDF) and in BIDS format.
    • Audio Data: Recorded using high-fidelity microphones (e.g., Neumann TLM102) at 44.1 kHz and 24-bit depth. Recordings were manually segmented and labeled.
  • Data Storage:
    The dataset is available as safeguarded data on the UK Data Archive’s ReShare repository. Access requires registration with the UK Data Service and agreement to their End User License conditions.

Accessing the Dataset

The dataset can be downloaded from the publicly accessible repository after registration: - UK Data Service ReShare

It is also available at: - MODMA Dataset

Reference

For full details, please refer to the original publication:

Cai, H., Yuan, Z., Gao, Y., Sun, S., Li, N., Tian, F., et al. (2022). A multi-modal open dataset for mental-disorder analysis. Scientific Data, 9:178.
DOI: 10.1038/s41597-022-01211-x


Owner

  • Login: cty20010831
  • Kind: user

GitHub Events

Total
  • Watch event: 1
  • Public event: 1
  • Push event: 21
  • Fork event: 2
Last Year
  • Watch event: 1
  • Public event: 1
  • Push event: 21
  • Fork event: 2

Dependencies

Sam/requirements.txt pypi
  • ipykernel *
  • ipywidgets *
  • librosa *
  • matplotlib *
  • numpy *
  • openpyxl *
  • pandas *
  • pymc *
  • pytensor *
  • scikit-learn *
  • torch *
  • torchaudio *
  • torchvision *
  • transformers *