generativeai_and_llm_odia
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.7%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: OdiaGenAI
- License: other
- Language: Jupyter Notebook
- Default Branch: main
- Size: 2.29 MB
Statistics
- Stars: 29
- Watchers: 3
- Forks: 8
- Open Issues: 5
- Releases: 0
Metadata Files
README.md
Generative AI and LLM Initiative for the Odia Language
Table of contents
Latest Updates
- [12thApril2023] We released our first experiment Odia LLM odiagenAI-model-v0. Please go through our Blog for more details.
About
The Odia Generative AI (in short, OdiaGenAI) is an initiative to research Generative AI and Large Language Models (LLMs) for the low-resource Odia language.
Objective
The OdiaGenAI aims to
- Build pre-trained Odia LLM,
- Fine-tuned Odia LLM, and
- Instruct LLM (Odia).
The data, code, and models will be available to the public for research and non-commercial purposes.
Why OdiaGenAI
- First: Though many LLMs support multilingual, including Odia language, the performance for various tasks (e.g., content generation, question-answering) is limited due to the amount of ingested data for Odia.
Second: There is subscription or fees associated with the high-performing LLMs.
Third: The usage (privacy) and bias of data input to these LLMs are in question.
What are the focus research areas of OdiaGenAI
We have divided the primary focus areas into three parts.
1. Literature Survey: Investigate the latest developments in Generative AI and LLMs and analyze current methods to support the Odia language for different tasks.
2. Development: Developing pre-trained and fine-tuned Odia LLM, which includes dataset preparation, model training, evaluation, prompt engineering, and API development.
3. Deployment: Deploy the Odia LLM models for public access for research and non-commercial purposes.
Who can use OdiaGenAI LLMs
The models (pre-trained/fine-tuned) will be available through Hugging Face for research and non-commercial purposes. Feel free to contact us for a domain-specific application or particular use cases.
What are the use cases of OdiaGenAI LLMs
There are several use cases of OdiaGenAI LLMs. Three primary domains relating to Odisha which we are focusing to use the developed LLM are:
- Education
- Healthcare
- Governance
- Tourism
- Agriculture
- Industrial Application
Apps
Contributors
- Shantipriya Parida
- Sambit Sekhar
- Subhadarshi Panda
- Soumendra Kumar Sahoo
- Swateek Jena
- Abhijeet Parida
- Arghyadeep Sen
- Dr. Satya Ranjan Dash
- Deepak Kumar Pradhan
About our logo: The critically endangered Olive Ridley sea turtle is the world's smallest and most prevalent marine turtle. Travel thousands of kilometers in the ocean for nesting. The Gahirmatha Marine Sanctuary in Odisha is the largest known mass nesting rookery for olive ridley sea turtles worldwide.
Contact
Please contact Shantipriya Parida (shantipriya.parida@gmail.com) for any contribution/support/usage.
Supporters
Citation
If you find this repository useful, please consider giving ⭐ and citing:
@misc{OdiaGenAI,
author = {Shantipriya Parida and Sambit Sekhar and Subhadarshi Panda and Soumendra Kumar Sahoo and Swateek Jena and Abhijeet Parida and Arghyadeep Sen and Satya Ranjan Dash and Deepak Kumar Pradhan},
title = {OdiaGenAI: Generative AI and LLM Initiative for the Odia Language},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/shantipriyap/OdiaGenAI}},
}
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Owner
- Name: OdiaGenAI
- Login: OdiaGenAI
- Kind: organization
- Email: odiagen.ai@gmail.com
- Location: India
- Website: https://www.odiagenai.org/
- Twitter: OdiaGenAI
- Repositories: 1
- Profile: https://github.com/OdiaGenAI
Generative AI and LLM Research for Odia and Indic Languages
Citation (CITATION.cff)
# This CITATION.cff file was generated with [cffinit](https://bit.ly/cffinit)
cff-version: 1.2.0
title: "OdiaGenAI: Generative AI and LLM Initiative for the Odia Language"
message: If you use this software, please cite it using these metadata.
type: generic
authors:
- given-names: Shantipriya
family-names: Parida
- given-names: Sambit
family-names: Sekhar
- given-names: Subhadarshi
family-names: Panda
- given-names: Soumendra Kumar
family-names: Sahoo
- given-names: Swateek
family-names: Jena
- given-names: Abhijeet
family-names: Parida
- given-names: Arghyadeep
family-names: Sen
- given-names: Satya Ranjan
family-names: Dash
- given-names: Deepak Kumar
family-names: Pradhan
- {}
identifiers:
- type: url
value: 'https://github.com/shantipriyap/OdiaGenAI'
repository-code: 'https://github.com/shantipriyap/OdiaGenAI'
license: CC-BY-NC-SA-4.0
GitHub Events
Total
- Watch event: 4
Last Year
- Watch event: 4
