https://github.com/amazon-science/robust-tableqa
Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer models. 2) LI-RAGE is a robust framework for open-domain TableQA which addresses several limitations. (ACL 2023)
https://github.com/amazon-science/h3-indexer
The h3-indexer is an open source package for indexing geospatial data using PySpark, Apache Sedona and the H3 hierarchical spatial indexing system. The h3-indexer maps any number of vector-type geospatial data sets to H3 grids for efficient spatial analysis and querying.
https://github.com/amazon-science/idioms-incontext-mt
idioms in context dataset
https://github.com/amazon-science/job-posting-structure
Extract structured information from job postings.
https://github.com/amazon-science/spherical_diffusion_policy
[ICML 2025] Official implementation of Spherical Diffusion Policy: A SE(3) Equivariant Visuomotor Policy with Spherical Fourier Representation
https://github.com/amazon-science/omnimatch
OmniMatch: Joinability Discovery in Data Products
https://github.com/amazon-science/wqa-multi-sentence-inference
This repository contains code used for our Multi Sentence Inference NAACL'22 paper.
https://github.com/amazon-science/repoformer
Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)
https://github.com/amazon-science/dq-bart
DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)
https://github.com/amazon-science/boon
Datasets and code for results presented in the BOON paper
https://github.com/amazon-science/street-reasoning
STREET: a multi-task and multi-step reasoning dataset
https://github.com/amazon-science/transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
https://github.com/amazon-science/isometric-slt
Isometric Spoken Language Translation - Isometric SLT.