https://github.com/google-research/ingestables

https://github.com/google-research/ingestables

Science Score: 13.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.7%) to scientific vocabulary
Last synced: 5 months ago · JSON representation

Repository

Basic Info
  • Host: GitHub
  • Owner: google-research
  • License: apache-2.0
  • Language: Python
  • Default Branch: main
  • Size: 200 KB
Statistics
  • Stars: 11
  • Watchers: 1
  • Forks: 1
  • Open Issues: 0
  • Releases: 0
Created over 1 year ago · Last pushed 12 months ago
Metadata Files
Readme Changelog Contributing License

README.md

IngesTables: Scalable and Efficient Training of LLM-Enabled Tabular Foundation Models

Unittests PyPI version

This repository contains the implementation of IngesTables, a tabular foundation model. An earlier version of the model was accepted at the NeurIPS'23 Tabular Representation Learning Workshop. Stay tuned for updates!

It contains library code that defines the data preprocessing, training, and evaluation. It also contains scripts for running it locally or on Google Cloud Platform (GCP). Do note that using the GCP scripts may incur costs and would transmit data to GCP and be accessible to those who can access your GCP project.

🧑‍🏫 Tutorials

Here is the list of tutorials and reproducible experiments to get started with IngesTables for various tasks:

BibTeX

If you found any part of this codebase to be useful, please consider citing our work:

bibtex @inproceedings{yak2023ingestables, title={IngesTables: Scalable and Efficient Training of LLM-Enabled Tabular Foundation Models}, author={Scott Yak and Yihe Dong and Javier Gonzalvo and Sercan Arik}, booktitle={NeurIPS'23 Table Representation Learning Workshop}, year={2023} }

This is not an officially supported Google product.

Owner

  • Name: Google Research
  • Login: google-research
  • Kind: organization
  • Location: Earth

GitHub Events

Total
  • Watch event: 9
  • Delete event: 4
  • Issue comment event: 1
  • Public event: 1
  • Push event: 20
  • Pull request event: 9
  • Fork event: 1
  • Create event: 4
Last Year
  • Watch event: 9
  • Delete event: 4
  • Issue comment event: 1
  • Public event: 1
  • Push event: 20
  • Pull request event: 9
  • Fork event: 1
  • Create event: 4

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 0
  • Total pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: about 19 hours
  • Total issue authors: 0
  • Total pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.2
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 4
Past Year
  • Issues: 0
  • Pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: about 19 hours
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.2
  • Merged pull requests: 3
  • Bot issues: 0
  • Bot pull requests: 4
Top Authors
Issue Authors
Pull Request Authors
  • copybara-service[bot] (4)
  • mononitogoswami (1)
Top Labels
Issue Labels
Pull Request Labels

Packages

  • Total packages: 1
  • Total downloads:
    • pypi 10 last-month
  • Total dependent packages: 0
  • Total dependent repositories: 0
  • Total versions: 1
  • Total maintainers: 1
pypi.org: ingestables

IngesTables: A Recipe for Tabular Foundation Models

  • Versions: 1
  • Dependent Packages: 0
  • Dependent Repositories: 0
  • Downloads: 10 Last month
Rankings
Dependent packages count: 9.6%
Average: 31.8%
Dependent repos count: 54.0%
Maintainers (1)
Last synced: 5 months ago

Dependencies

.github/workflows/pytest_and_autopublish.yml actions
  • actions/checkout v4 composite
  • actions/setup-python v5 composite
  • etils-actions/pypi-auto-publish v1 composite
ingestables/Dockerfile docker
  • pytorch/pytorch 2.4.0-cuda12.4-cudnn9-runtime build
pyproject.toml pypi
  • absl-py >=2.1
  • arff >=0.9
  • etils [epath]>=1.12
  • fiddle >=0.3
  • pandas >=2.1
  • scikit-learn >=1.6
  • tensorboard >=2.18
  • tensorflow-cpu >=2.18
  • torch >=2.4
  • transformers >=4.44