cocktails_translating
Science Score: 57.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
✓DOI references
Found 7 DOI reference(s) in README -
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.5%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: ormeilu
- Language: Python
- Default Branch: master
- Size: 105 KB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 1
Created about 1 year ago
· Last pushed about 1 year ago
Metadata Files
Readme
Citation
README.md
Translating erwanlc/cocktails_recipe
This project translates the erwanlc/cocktails_recipe dataset into Russian, including converting measurements to metric units and ensuring high-quality translations for cocktail names, ingredients, and instructions.
Datasets
Final Dataset: toiletsandpaper/cocktailsreciperu
- Description: Full translation of the original dataset (except 2 null rows) using hand-written measurement conversions and LLM translations.
- LLM Used: gemma-3-27b-it (vLLM, bfloat16)
- Inference Engine: GPUStack with vLLM backend
- Hardware: 4 H100 80GB GPUs (4 LLM replicas)
- Size: 6,954 rows, 1.05 MB
- Format: Parquet
- DOI: 10.57967/hf/5395
- License: MIT
- Xet-Enabled: This dataset is Xet-enabled on Hugging Face, allowing efficient version control and collaboration.
Small Dataset: toiletsandpaper/cocktailsreciperu_small
- Description: Translation of the first 1,000 rows of the original dataset using hand-written conversions and LLM translations.
- LLM Used: gemma-3-27b-it-qat
- Inference Engine: LM Studio
- Hardware: 1 RTX 3090 GPU
- Size: 1,000 rows, 98 kB
- Format: Parquet
- DOI: 10.57967/hf/5375
- License: MIT
- Xet-Enabled: This dataset is Xet-enabled on Hugging Face, allowing efficient version control and collaboration.
Citation
If you use this project, please cite it as follows:
yaml
@software{Lubenets_Translating_Cocktails_to_2025,
author = {Lubenets, Ilya and Gunko, Maxim},
doi = {10.5281/zenodo.15367996},
license = {MIT},
month = may,
title = {{Translating Cocktails to Russian using LLMs}},
url = {https://github.com/toiletsandpaper/cocktails_translating},
version = {1.0.0},
year = {2025}
}
Owner
- Name: Ilya Lubenets
- Login: ormeilu
- Kind: user
- Location: Rostov-on-Don
- Repositories: 3
- Profile: https://github.com/ormeilu
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - family-names: "Lubenets" given-names: "Ilya" orcid: "https://orcid.org/0009-0005-9507-6669" - family-names: "Gunko" given-names: "Maxim" orcid: "https://orcid.org/0009-0009-4812-0291" title: "Translating Cocktails to Russian using LLMs" version: 1.0.0 doi: 10.5281/zenodo.15367996 date-released: 2025-05-08 url: "https://github.com/toiletsandpaper/cocktails_translating" license: MIT
GitHub Events
Total
Last Year
Issues and Pull Requests
Last synced: 10 months ago
All Time
- Total issues: 0
- Total pull requests: 5
- Average time to close issues: N/A
- Average time to close pull requests: 1 day
- Total issue authors: 0
- Total pull request authors: 2
- Average comments per issue: 0
- Average comments per pull request: 0.6
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 5
- Average time to close issues: N/A
- Average time to close pull requests: 1 day
- Issue authors: 0
- Pull request authors: 2
- Average comments per issue: 0
- Average comments per pull request: 0.6
- Merged pull requests: 5
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
- ormeilu (4)
- oceanofurtears (1)
Top Labels
Issue Labels
Pull Request Labels
Dependencies
pyproject.toml
pypi
- datasets >=3.5.1
- duckdb >=1.2.2
- huggingface-hub >=0.30.2
- langfuse >=2.60.3
- openai >=1.77.0
- polars >=1.29.0
- pydantic >=2.11.4
- pydantic-settings >=2.9.1
- structlog >=25.3.0
- transformers >=4.51.3
uv.lock
pypi
- aiohappyeyeballs 2.6.1
- aiohttp 3.11.18
- aiosignal 1.3.2
- annotated-types 0.7.0
- anyio 4.9.0
- attrs 25.3.0
- backoff 2.2.1
- barometer-chat 0.1.0
- cattrs 24.1.3
- certifi 2025.4.26
- charset-normalizer 3.4.2
- click 8.1.8
- colorama 0.4.6
- datasets 3.6.0
- dill 0.3.8
- distro 1.9.0
- docstring-to-markdown 0.17
- docutils 0.21.2
- duckdb 1.2.2
- filelock 3.18.0
- frozenlist 1.6.0
- fsspec 2025.3.0
- h11 0.16.0
- hf-xet 1.1.0
- httpcore 1.0.9
- httpx 0.28.1
- huggingface-hub 0.31.1
- idna 3.10
- importlib-metadata 8.7.0
- itsdangerous 2.2.0
- jedi 0.19.2
- jiter 0.9.0
- langfuse 2.60.4
- lsprotocol 2023.0.1
- marimo 0.13.6
- markdown 3.8
- markdown-it-py 3.0.0
- mdurl 0.1.2
- multidict 6.4.3
- multiprocess 0.70.16
- narwhals 1.38.1
- numpy 2.2.5
- openai 1.77.0
- packaging 24.2
- pandas 2.2.3
- parso 0.8.4
- pluggy 1.5.0
- polars 1.29.0
- propcache 0.3.1
- psutil 7.0.0
- pyarrow 20.0.0
- pycrdt 0.11.1
- pydantic 2.11.4
- pydantic-core 2.33.2
- pydantic-settings 2.9.1
- pygments 2.19.1
- pymdown-extensions 10.15
- python-dateutil 2.9.0.post0
- python-dotenv 1.1.0
- python-lsp-jsonrpc 1.1.2
- python-lsp-ruff 2.2.2
- python-lsp-server 1.12.2
- pytz 2025.2
- pyyaml 6.0.2
- regex 2024.11.6
- requests 2.32.3
- rich 14.0.0
- ruff 0.11.8
- safetensors 0.5.3
- six 1.17.0
- sniffio 1.3.1
- sqlglot 26.16.4
- starlette 0.46.2
- structlog 25.3.0
- tokenizers 0.21.1
- tomlkit 0.13.2
- tqdm 4.67.1
- transformers 4.51.3
- typing-extensions 4.13.2
- typing-inspection 0.4.0
- tzdata 2025.2
- ujson 5.10.0
- urllib3 2.4.0
- uvicorn 0.34.2
- websockets 15.0.1
- wrapt 1.17.2
- xxhash 3.5.0
- yarl 1.20.0
- zipp 3.21.0