Updated 6 months ago
visualroberta
The first public Vietnamese visual linguistic foundation model(s)
Updated 4 months ago
https://github.com/lromul/gramtion
Twitter bot for generating photo descriptions (alt text)
Updated 6 months ago
https://github.com/ammarlodhi255/image-captioning-system-to-assist-the-blind
An image captioning system that is able to predict and speak out a caption of an image taken by visually impaired.
Updated 6 months ago
https://github.com/aehrc/cvt2distilgpt2
Improving Chest X-Ray Report Generation by Leveraging Warm-Starting
Updated 6 months ago
https://github.com/aehrc/cxrmate
CXRMate: Longitudinal Data and a Semantic Similarity Reward for Chest X-Ray Report Generation
Updated 6 months ago
https://github.com/aehrc/imageclefmedical_caption_23
MedICap: Code for the participation of team CSIRO at the ImageCLEFmedical Caption task of 2023.
Updated 6 months ago
https://github.com/atharvapathak/computer_vision_linear_regression_projects
This repository consists of some basic Computer Vision & Linear Regression Projects.
Updated 6 months ago
updown-baseline
Baseline model for nocaps benchmark, ICCV 2019 paper "nocaps: novel object captioning at scale".