Updated 4 months ago
visualroberta
The first public Vietnamese visual linguistic foundation model(s)
Updated 4 months ago
updown-baseline
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".