https://github.com/aksholokhov/msr3-paper

Scripts for reproducing results from my MSR3 paper

Science Score: 10.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org, pubmed.ncbi, ncbi.nlm.nih.gov
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.2%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Scripts for reproducing results from my MSR3 paper

Basic Info

Host: GitHub
Owner: aksholokhov
License: gpl-3.0
Language: Python
Default Branch: main
Size: 956 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed over 3 years ago

https://github.com/aksholokhov/msr3-paper/blob/main/

## MSR3 Reproducibility Guide

This notebook (**walkthrough.ipynb**) explains how to reproduce our results from [Sholokhov et. al. (2022): "A Relaxation Approach to Feature Selection for Linear Mixed Effects Models"](https://arxiv.org/abs/2205.06925?context=stat).

This repository only contains the code that uses [pysr3](https://github.com/aksholokhov/pysr3) library to produce figures and tables from the paper above. To learn more about pysr3 go here: [quickstart](https://github.com/aksholokhov/pysr3), [documentation](https://aksholokhov.github.io/pysr3/), [models overview](https://aksholokhov.github.io/pysr3/models_overview.html).

In our research group we take reproducibility seriously. Please open an issue in this repository if you can not reproduce our results with this notebook. We developed it using our current best knowledge on reproducibility which is imperfect and evolves as we receive more feedback.

## Installation

First, checkout this repo

`git clone https://github.com/aksholokhov/msr3-paper`

You'll need to open **walkthrough.ipynb** -- this jupyter notebook. 

As of today, the code works with any version of python starting 3.8.11.

You need to install the following packages to your python environment to run this notebook:
  - pysr3>=0.3.4
  - pandas
  - dask
  - distributed
  - matplotlib
  - seaborn
  - tqdm
  - rpy2

Alternatively, you can use our *environment.yml* to create a new clean environment (**recommended**):

Execute the lines below in your terminal

`conda env create -f environment.yml`

`conda activate msr3-paper`

`python -m ipykernel install --user --name msr3`

Now in the menu above (Kernel->Change Kernel) you should be able to change your Jupyter Kernel to "msr3" that has all the necessary packages installed


```python
%matplotlib auto
```

    Using matplotlib backend:

	Regilarizer	Metric	PGD	MSR3	MSR3-fast
0	L0	Accuracy	0.90	0.95	0.95
1	L0	Time	23.18	109.44	0.56
2	SCAD	Accuracy	0.80	0.95	0.95
3	SCAD	Time	50.17	9.43	0.75
4	L1	Accuracy	0.80	0.95	0.95
5	L1	Time	17.37	10.34	0.51
6	ALASSO	Accuracy	0.95	0.95	0.95
7	ALASSO	Time	28.02	77.06	0.85

	Unnamed: 0	SR3-L1-P	glmmLasso	lmmLasso
0	Accuracy	95 (95-95)	45 (45-45)	72 (72-72)
1	FE Accuracy	95 (95-95)	45 (45-45)	45 (45-45)
2	RE Accuracy	95 (95-95)	45 (45-45)	100 (100-100)
3	F1	94 (94-94)	61 (61-61)	77 (77-77)
4	FE F1	95 (95-95)	59 (59-59)	62 (62-62)
5	RE F1	94 (94-94)	62 (62-62)	100 (100-100)
6	Time	0.14 (0.14-0.14)	0.55 (0.55-0.55)	6.13 (6.13-6.13)
7	Iterations	90 (90-90)	53 (53-53)	0 (0-0)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/aksholokhov/msr3-paper

Science Score: 10.0%

Repository

Basic Info

Statistics

https://github.com/aksholokhov/msr3-paper/blob/main/

Owner

GitHub Events

Total

Last Year