llm-from-scratch-main
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (0.2%) to scientific vocabulary
Last synced: 9 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: nishahariii
- License: mit
- Language: Jupyter Notebook
- Default Branch: main
- Size: 1.65 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Created 11 months ago
· Last pushed 11 months ago
Metadata Files
Readme
License
Citation
README.md
LLM-from-scratch-main
Owner
- Login: nishahariii
- Kind: user
- Repositories: 1
- Profile: https://github.com/nishahariii
Citation (CITATION.cff)
cff-version: 1.2.0
message: "If you use this book or its accompanying code, please cite it as follows."
title: "Build A Large Language Model (From Scratch), Published by Manning, ISBN 978-1633437166"
abstract: "This book provides a comprehensive, step-by-step guide to implementing a ChatGPT-like large language model from scratch in PyTorch."
date-released: 2024-09-12
authors:
- family-names: "Raschka"
given-names: "Sebastian"
license: "Apache-2.0"
url: "https://www.manning.com/books/build-a-large-language-model-from-scratch"
repository-code: "https://github.com/rasbt/LLMs-from-scratch"
keywords:
- large language models
- natural language processing
- artificial intelligence
- PyTorch
- machine learning
- deep learning
GitHub Events
Total
- Push event: 5
- Create event: 2
Last Year
- Push event: 5
- Create event: 2
Dependencies
setup/03_optional-docker-environment/.devcontainer/Dockerfile
docker
- pytorch/pytorch 2.5.0-cuda12.4-cudnn9-runtime build
ch02/02_bonus_bytepair-encoder/requirements-extra.txt
pypi
- requests *
- tqdm *
- transformers >=4.33.2
ch04/02_performance-analysis/requirements-extra.txt
pypi
- thop *
ch05/06_user_interface/requirements-extra.txt
pypi
- chainlit >=1.2.0
ch05/07_gpt_to_llama/requirements-extra.txt
pypi
- blobfile >=3.0.0
- huggingface_hub >=0.24.7
- ipywidgets >=8.1.2
- safetensors >=0.4.4
- sentencepiece >=0.1.99
ch05/07_gpt_to_llama/tests/test-requirements-extra.txt
pypi
- pytest >=8.1.1 test
- transformers >=4.44.2 test
ch06/03_bonus_imdb-classification/requirements-extra.txt
pypi
- scikit-learn >=1.3.0
- transformers >=4.33.2
ch06/04_user_interface/requirements-extra.txt
pypi
- chainlit >=1.2.0
ch07/02_dataset-utilities/requirements-extra.txt
pypi
- openai >=1.30.3
- scikit-learn >=1.3.1
- tqdm >=4.65.0
ch07/03_model-evaluation/requirements-extra.txt
pypi
- openai >=1.30.3
- tqdm >=4.65.0
ch07/05_dataset-generation/requirements-extra.txt
pypi
- openai >=1.30.3
- tqdm >=4.65.0
ch07/06_user_interface/requirements-extra.txt
pypi
- chainlit >=1.2.0
pyproject.toml
pypi
- jupyterlab >=4.0
- matplotlib >=3.7.1
- numpy >=1.26,<2.1
- pandas >=2.2.1
- pip >=25.0.1
- pytest >=8.3.5
- tensorflow >=2.18.0
- tiktoken >=0.5.1
- torch >=2.3.0
- tqdm >=4.66.1
requirements.txt
pypi
- jupyterlab >=4.0
- matplotlib >=3.7.1
- numpy >=1.26,<2.1
- pandas >=2.2.1
- psutil >=5.9.5
- tensorflow >=2.18.0
- tiktoken >=0.5.1
- torch >=2.3.0
- tqdm >=4.66.1