https://github.com/beomi/easy-lm-trainer
π€ μ΅μνμ μΈν μΌλ‘ LMμ νμ΅νκΈ° μν μνμ½λ
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
βCITATION.cff file
-
βcodemeta.json file
Found codemeta.json file -
β.zenodo.json file
Found .zenodo.json file -
βDOI references
-
βAcademic publication links
-
βCommitters with academic emails
-
βInstitutional organization owner
-
βJOSS paper metadata
-
βScientific vocabulary similarity
Low similarity (1.9%) to scientific vocabulary
Keywords
boilerplate
huggingface
language-model
transformers
Keywords from Contributors
interactive
projection
generic
sequences
archival
genomics
observability
autograding
hacking
shellcodes
Last synced: 5 months ago
·
JSON representation
Repository
π€ μ΅μνμ μΈν μΌλ‘ LMμ νμ΅νκΈ° μν μνμ½λ
Statistics
- Stars: 58
- Watchers: 2
- Forks: 8
- Open Issues: 2
- Releases: 0
Topics
boilerplate
huggingface
language-model
transformers
Created over 3 years ago
· Last pushed over 2 years ago
Metadata Files
Readme
README.md
Easy LM Trainer
Huggingface Transformersλ₯Ό μ¬μ©ν΄ LMμ νμ΅ν λ, λ¨μ LM(CLM) νμ΅ μμ μ λ³΄λ€ μ½κ² μμνκΈ° μν Boilerplate νλ‘μ νΈ.
νκ²½
- CPython 3.10+
- PyTorchλ CUDA νκ²½μ λ§κ² μ€μΉνκΈ° (1.12.1 μ΄μ)
bash
pip install -r requirements.txt
pip install -U deepspeed # Deepspeed branch νμ
μ€ν
bash
./train.sh
- μμ νκ²½μ RAM 1TB, GPU A100 40GB x4μ₯ νκ²½μμ μ€ν
- CUDA 11.6/11.7
- PyTorch 2.0
- KoAlpaca Datasetμ νμ΅ λ°μ΄ν°λ‘ μ¬μ©ν¨
- DeepSpeed ZeRO3, Optimizerμ Parameter λͺ¨λ CPU Offload
- Seq len 1024
- Max batch size 1 (per GPU)
- GPUλΉ μ½ 27GB vram μ¬μ© (= V100 32Gμμλ μ¬μ© κ°λ₯ν κ²μΌλ‘ μμ)
Owner
- Name: Junbum Lee
- Login: Beomi
- Kind: user
- Location: Seoul, South Korea
- Website: https://junbuml.ee
- Twitter: __Beomi__
- Repositories: 110
- Profile: https://github.com/Beomi
AI/ML GDE @ml-gde. Korean AI/NLP Researcher and creator of multiple Korean PLMs. Focused on advancing Open LLMs.
GitHub Events
Total
- Watch event: 1
Last Year
- Watch event: 1
Committers
Last synced: about 1 year ago
Top Committers
| Name | Commits | |
|---|---|---|
| Junbum Lee | j****n@b****t | 22 |
| dependabot[bot] | 4****] | 1 |
Committer Domains (Top 20 + Academic)
beomi.net: 1
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 3
- Total pull requests: 2
- Average time to close issues: 4 days
- Average time to close pull requests: 1 minute
- Total issue authors: 2
- Total pull request authors: 2
- Average comments per issue: 2.0
- Average comments per pull request: 0.0
- Merged pull requests: 2
- Bot issues: 0
- Bot pull requests: 1
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- park1200656 (2)
- sjhyeon2 (1)
Pull Request Authors
- dependabot[bot] (1)
- Beomi (1)
Top Labels
Issue Labels
Pull Request Labels
dependencies (1)