https://github.com/abertsch72/lattice-generation
Code for Massive-scale Decoding for Text Generation using Lattices
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.1%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
Code for Massive-scale Decoding for Text Generation using Lattices
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of jiacheng-xu/lattice-generation
Created over 3 years ago
· Last pushed almost 4 years ago
https://github.com/abertsch72/lattice-generation/blob/release/
# [Massive-scale Decoding for Text Generation using Lattices](https://arxiv.org/abs/2112.07660)
[Jiacheng Xu](https://jiacheng-xu.github.io/), Sid Reddy, [Greg Durrett](https://www.cs.utexas.edu/~gdurrett/)
TL;DR: a new search algorithm to construct lattices encoding many generation options;
two key technical contributions: (1) best-first search, (2) path recombination.
## Visualization
We provide a few examples in the ```vis``` folder and on [my homepage](https://www.cs.utexas.edu/~jcxu/data/summarization/). You need to download the html files to view and **interact** with the model outputs.
The complete set of outputs are available on [Box](https://utexas.box.com/s/wmvhg8lol3kvgirizqyiyiblbn6ogj1a).
## Getting started
- ```model``` contains all of the methods, including baselines like beam search, nucleus sampling, and our methods.
- ```evaluation``` contains scripts for evaluation.
- ```command``` are the prompts and shells we use to run the experiment.
Beam Search:
```
PYTHONPATH=./ python src/recom_search/scripts/run_pipeline.py -nexample 100 -ngram_suffix 4 -beam_size 16 -min_len 10 -max_len 35 -model bs
```
Best-first Search:
```
PYTHONPATH=./ python src/recom_search/scripts/run_pipeline.py -nexample 100 -ngram_suffix 4 -beam_size 16 -min_len 10 -max_len 35 -model astar_baseline
```
Best-first Search with Recomb:
```
PYTHONPATH=./ python src/recom_search/scripts/run_pipeline.py -nexample 100 -ngram_suffix 4 -beam_size 16 -min_len 10 -max_len 35 -model astar -merge imp -avg_score 0.75 -dfs_expand
```
Best-first Search with Zip:
```
PYTHONPATH=./ python src/recom_search/scripts/run_pipeline.py -nexample 100 -ngram_suffix 4 -beam_size 16 -min_len 10 -max_len 35 -model astar -merge zip -avg_score 0.75 -dfs_expand
```
More detailed instructions coming soon!
## Citation
```
@misc{xu-etal-2022-massive,
title={Massive-scale Decoding for Text Generation using Lattices},
author = {Xu, Jiacheng and Jonnalagadda, Siddhartha Reddy and Durrett, Greg},
year={2022},
eprint={2112.07660},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
## Contact
jcxu@utexas.edu
Owner
- Name: Amanda Bertsch
- Login: abertsch72
- Kind: user
- Location: Pittsburgh, PA
- Repositories: 39
- Profile: https://github.com/abertsch72
MS student @ CMU LTI