https://github.com/ai4bharat/chitralekha
Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over
Science Score: 13.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (13.6%) to scientific vocabulary
Keywords
Repository
Chitralekha - A video transcreation platform for Indic languages, supporting transcription, translation and voice-over
Basic Info
- Host: GitHub
- Owner: AI4Bharat
- License: mit
- Default Branch: master
- Homepage: https://chitralekha.ai4bharat.org/
- Size: 3.84 MB
Statistics
- Stars: 94
- Watchers: 11
- Forks: 20
- Open Issues: 3
- Releases: 12
Topics
Metadata Files
README.md
An open source video transcreation platform for Indic languages using ML models
Chitralekha is an open source platform for video transcreation across various Indic languages, using ML model support (ASR for Transcription, NMT for Translation and TTS for Voice-over)
Chitralekha offers support for multiple input sources (Ex : Youtube, local), transcription generation process (Ex : Models, Source captions, Custom subtitle files, manually created), translation generation process (Models, manually created) and voice-over generation process (Models, manually created). Currently, Chitralekha supports voice-over for only single speaker videos. Support for multi-speaker videos is under development.
Qualities of a good Subtitling Ecosystem
Challenges faced by Transcriptionists
Why Chitralekha?
In current world, there are numerous informative videos available online. Mostly they are associated with very few languages. The usefulness of the content can be increased by creating the sub-titles and voice-over of these across various Indic languages. With millions of hours of video contents, it becomes harder to manually create the multi-lingual sub-titles. This is where Chitraleka comes to the rescue.
The existing state-of-the-art ASR, Translation ML and TTS models can power the Chitralekha tool, to provides the platform for the Transcriptionists/Translators to create the multi-lingual sub-titles at scale with high accuracy.
Goals
- Support all possible video sources and languages
- Build a reliable & scalable platform beneath Chitralekha
- Keep the UI simple and intuitive
Features of Chitralekha
Import from YouTube
Chitralekha supports importing videos and optional subtitles from YouTube. It also enables export of the subtitles in standard formats which can be used to update videos on YouTube.
Translation Support
Chitralekha supports translating the transcription into English and 12 Indian languages supported by IndicTrans model. Eventually it would be a Plug & Play feature.
Transcription Support
Chitralekha supports transcribing the input video with IndicASR for English and 9 Indian languages. This automatically creates timestamped transcription cards which can be edited. Eventually it would be a Plug & Play feature.
Transliteration Support
Chitralekha supports editing the transcriptions both in the source and target languages in Roman characters with IndicXlit support.
Voice-over Support
Chitralekha supports voice-over generation for the translated subtitles of the input video with IndicTTS for Indian languages. This automatically creates timestamped voice-over audio files which can be edited by editing the subtitle text of that particular timestamp.
High Level Architecture
Overview and Demo Video
Cloning this master repo
git clone --recurse-submodules https://github.com/AI4Bharat/Chitralekha
Communication Forum
Any information/help/discussion required, can be taken up using the following link :
https://github.com/AI4Bharat/Chitralekha/discussions
Code of Conduct
This project adheres to the Contributor Covenant code of conduct. By participating, you are expected to uphold this code. Please report unacceptable behavior to opensource@ai4bharat.org.
Owner
- Name: AI4Bhārat
- Login: AI4Bharat
- Kind: organization
- Email: opensource@ai4bharat.org
- Location: India
- Website: https://ai4bharat.org
- Twitter: AI4Bharat
- Repositories: 37
- Profile: https://github.com/AI4Bharat
Artificial-Intelligence-For-Bhārat : Building open-source AI solutions for India!
GitHub Events
Total
- Create event: 1
- Issues event: 4
- Release event: 1
- Watch event: 12
- Push event: 1
- Pull request event: 1
- Fork event: 6
Last Year
- Create event: 1
- Issues event: 4
- Release event: 1
- Watch event: 12
- Push event: 1
- Pull request event: 1
- Fork event: 6
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 7
- Total pull requests: 1
- Average time to close issues: N/A
- Average time to close pull requests: 4 months
- Total issue authors: 7
- Total pull request authors: 1
- Average comments per issue: 0.0
- Average comments per pull request: 0.0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 5
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 5
- Pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- TheChesireCat (1)
- sandeshmore821 (1)
- NC969 (1)
- Artripti (1)
- kranti-up (1)
- Ramyadeviresearch (1)
- sravaniboyapati423-wq (1)
Pull Request Authors
- progwithraj (1)

