https://github.com/chris-santiago/met
Reproducing the MET framework with PyTorch
Science Score: 36.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
✓Committers with academic emails
1 of 1 committers (100.0%) from academic institutions -
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.1%) to scientific vocabulary
Keywords
Repository
Reproducing the MET framework with PyTorch
Basic Info
Statistics
- Stars: 5
- Watchers: 1
- Forks: 1
- Open Issues: 1
- Releases: 0
Topics
Metadata Files
README.md
MET - PyTorch
This repo reproduces the MET (Masked Encoding for Tabular Data) framework for self-supervised learning with tabular data.
Authors: Kushal Majmundar, Sachin Goyal, Praneeth Netrapalli, Prateek Jain
Reference: Kushal Majmundar, Sachin Goyal, Praneeth Netrapalli, Prateek Jain, "MET: Masked Encoding for Tabular Data," Neural Information Processing Systems (NeurIPS), 2022.
Original paper: https://table-representation-learning.github.io/assets/papers/metmaskedencodingfortabula.pdf
Original repo: https://github.com/google-research/met
Install
Clone this repository, create a new Conda environment and
bash
git clone https://github.com/chris-santiago/met.git
conda env create -f environment.yml
cd met
pip install -e .
Use
Prerequisites
Hydra
This project uses Hydra for managing configuration CLI arguments. See met/conf for full
configuration details.
Task
This project uses Task as a task runner. Though the underlying Python
commands can be executed without it, we recommend installing Task
for ease of use. Details located in Taskfile.yml.
Current commands
```bash
task -l task: Available tasks for this project: * check-config: Check Hydra configuration * compare: Compare using linear baselines * train: Train a model * wandb: Login to Weights & Biases ```
Example: Train model and for adult-income dataset experiment
The -- forwards CLI arguments to Hydra.
bash
task train -- experiment=income
PDM
This project was built using this cookiecutter and is setup to use PDM for dependency management, though it's not required for package installation.
Weights and Biases
This project is set up to log experiment results with Weights and Biases. It
expects an API key within a .env file in the root directory:
toml
WANDB_KEY=<my-super-secret-key>
Users can configure different logger(s) within the conf/trainer/default.yaml file.
Owner
- Name: Chris Santiago
- Login: chris-santiago
- Kind: user
- Repositories: 64
- Profile: https://github.com/chris-santiago
GitHub Events
Total
- Issues event: 1
- Watch event: 1
- Fork event: 1
Last Year
- Issues event: 1
- Watch event: 1
- Fork event: 1
Committers
Last synced: 7 months ago
Top Committers
| Name | Commits | |
|---|---|---|
| chris-santiago | c****o@g****u | 40 |
Committer Domains (Top 20 + Academic)
Issues and Pull Requests
Last synced: 7 months ago
All Time
- Total issues: 0
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 0
- Total pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- jawwada (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- actions/cache v2 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v3 composite
- pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite
- feature-engine >=1.6.2
- hydra-core >=1.3.2
- hydra-joblib-launcher >=1.2.0
- python-dotenv >=1.0.0
- pytorch-lightning >=2.1.2
- rich >=13.7.0
- scikit-learn >=1.3.2
- torch >=2.1.1
- torchmetrics >=1.2.0
- torchvision >=0.16.1
- wandb >=0.16.0
- Click ==8.1.7
- GitPython ==3.1.40
- MarkupSafe ==2.1.3
- PyYAML ==6.0.1
- aiohttp ==3.9.1
- aiosignal ==1.3.1
- antlr4-python3-runtime ==4.9.3
- appdirs ==1.4.4
- async-timeout ==4.0.3
- attrs ==23.1.0
- certifi ==2023.11.17
- charset-normalizer ==3.3.2
- docker-pycreds ==0.4.0
- feature-engine ==1.6.2
- filelock ==3.13.1
- frozenlist ==1.4.0
- fsspec ==2023.10.0
- gitdb ==4.0.11
- hydra-core ==1.3.2
- hydra-joblib-launcher ==1.2.0
- idna ==3.6
- jinja2 ==3.1.2
- joblib ==1.3.2
- lightning-utilities ==0.10.0
- markdown-it-py ==3.0.0
- mdurl ==0.1.2
- mpmath ==1.3.0
- multidict ==6.0.4
- networkx ==3.2.1
- numpy ==1.26.2
- omegaconf ==2.3.0
- packaging ==23.2
- pandas ==2.1.3
- patsy ==0.5.4
- pillow ==10.1.0
- protobuf ==4.25.1
- psutil ==5.9.6
- pygments ==2.17.2
- python-dateutil ==2.8.2
- python-dotenv ==1.0.0
- pytorch-lightning ==2.1.2
- pytz ==2023.3.post1
- requests ==2.31.0
- rich ==13.7.0
- scikit-learn ==1.3.2
- scipy ==1.11.4
- sentry-sdk ==1.37.1
- setproctitle ==1.3.3
- setuptools ==69.0.2
- six ==1.16.0
- smmap ==5.0.1
- statsmodels ==0.14.0
- sympy ==1.12
- threadpoolctl ==3.2.0
- torch ==2.1.1
- torchmetrics ==1.2.0
- torchvision ==0.16.1
- tqdm ==4.66.1
- typing-extensions ==4.8.0
- tzdata ==2023.3
- urllib3 ==2.1.0
- wandb ==0.16.0
- yarl ==1.9.3
- bzip2 1.0.8.*
- ca-certificates 2022.12.7.*
- libffi 3.4.2.*
- libsqlite 3.40.0.*
- libzlib 1.2.13.*
- ncurses 6.3.*
- openssl 3.1.0.*
- pip 23.1.*
- python 3.10.*
- readline 8.2.*
- setuptools 67.6.1.*
- tk 8.6.12.*
- tzdata 2023c.*
- wheel 0.40.0.*
- xz 5.2.6.*