Updated 9 months ago

https://github.com/ai4bharat/indic-tts • Science 23%

Text-to-Speech for languages of India

Updated 9 months ago

https://github.com/ai4bharat/indicinstruct • Science 10%

Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"

Updated 9 months ago

https://github.com/ai4bharat/setu • Science 13%

Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Built on Apache Spark, Setu encompasses four key stages: document preparation, document cleaning and analysis, flagging and filtering, and deduplication.

Updated 9 months ago

https://github.com/ai4co/unsupervised-co-ucom2 • Science 23%

[ICML'24] Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More

Updated 9 months ago

https://github.com/ai4co/rl • Science 23%

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Updated 9 months ago

https://github.com/ai4er-cdt/earthquake-predictability • Science 23%

Codebase for the 2023 GTC Project on Earthquake Predictability