Science Score: 57.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
    Found 7 DOI reference(s) in README
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (6.5%) to scientific vocabulary
Last synced: 10 months ago · JSON representation ·

Repository

Basic Info
  • Host: GitHub
  • Owner: ormeilu
  • Language: Python
  • Default Branch: master
  • Size: 105 KB
Statistics
  • Stars: 0
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Created about 1 year ago · Last pushed about 1 year ago
Metadata Files
Readme Citation

README.md

Translating erwanlc/cocktails_recipe

This project translates the erwanlc/cocktails_recipe dataset into Russian, including converting measurements to metric units and ensuring high-quality translations for cocktail names, ingredients, and instructions.

Datasets

Final Dataset: toiletsandpaper/cocktailsreciperu

  • Description: Full translation of the original dataset (except 2 null rows) using hand-written measurement conversions and LLM translations.
  • LLM Used: gemma-3-27b-it (vLLM, bfloat16)
  • Inference Engine: GPUStack with vLLM backend
  • Hardware: 4 H100 80GB GPUs (4 LLM replicas)
  • Size: 6,954 rows, 1.05 MB
  • Format: Parquet
  • DOI: 10.57967/hf/5395
  • License: MIT
  • Xet-Enabled: This dataset is Xet-enabled on Hugging Face, allowing efficient version control and collaboration.

Small Dataset: toiletsandpaper/cocktailsreciperu_small

  • Description: Translation of the first 1,000 rows of the original dataset using hand-written conversions and LLM translations.
  • LLM Used: gemma-3-27b-it-qat
  • Inference Engine: LM Studio
  • Hardware: 1 RTX 3090 GPU
  • Size: 1,000 rows, 98 kB
  • Format: Parquet
  • DOI: 10.57967/hf/5375
  • License: MIT
  • Xet-Enabled: This dataset is Xet-enabled on Hugging Face, allowing efficient version control and collaboration.

Citation

If you use this project, please cite it as follows:

yaml @software{Lubenets_Translating_Cocktails_to_2025, author = {Lubenets, Ilya and Gunko, Maxim}, doi = {10.5281/zenodo.15367996}, license = {MIT}, month = may, title = {{Translating Cocktails to Russian using LLMs}}, url = {https://github.com/toiletsandpaper/cocktails_translating}, version = {1.0.0}, year = {2025} }

Owner

  • Name: Ilya Lubenets
  • Login: ormeilu
  • Kind: user
  • Location: Rostov-on-Don

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
- family-names: "Lubenets"
  given-names: "Ilya"
  orcid: "https://orcid.org/0009-0005-9507-6669"
- family-names: "Gunko"
  given-names: "Maxim"
  orcid: "https://orcid.org/0009-0009-4812-0291"
title: "Translating Cocktails to Russian using LLMs"
version: 1.0.0
doi: 10.5281/zenodo.15367996
date-released: 2025-05-08
url: "https://github.com/toiletsandpaper/cocktails_translating"
license: MIT

GitHub Events

Total
Last Year

Issues and Pull Requests

Last synced: 10 months ago

All Time
  • Total issues: 0
  • Total pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: 1 day
  • Total issue authors: 0
  • Total pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.6
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 0
  • Pull requests: 5
  • Average time to close issues: N/A
  • Average time to close pull requests: 1 day
  • Issue authors: 0
  • Pull request authors: 2
  • Average comments per issue: 0
  • Average comments per pull request: 0.6
  • Merged pull requests: 5
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
Pull Request Authors
  • ormeilu (4)
  • oceanofurtears (1)
Top Labels
Issue Labels
Pull Request Labels

Dependencies

pyproject.toml pypi
  • datasets >=3.5.1
  • duckdb >=1.2.2
  • huggingface-hub >=0.30.2
  • langfuse >=2.60.3
  • openai >=1.77.0
  • polars >=1.29.0
  • pydantic >=2.11.4
  • pydantic-settings >=2.9.1
  • structlog >=25.3.0
  • transformers >=4.51.3
uv.lock pypi
  • aiohappyeyeballs 2.6.1
  • aiohttp 3.11.18
  • aiosignal 1.3.2
  • annotated-types 0.7.0
  • anyio 4.9.0
  • attrs 25.3.0
  • backoff 2.2.1
  • barometer-chat 0.1.0
  • cattrs 24.1.3
  • certifi 2025.4.26
  • charset-normalizer 3.4.2
  • click 8.1.8
  • colorama 0.4.6
  • datasets 3.6.0
  • dill 0.3.8
  • distro 1.9.0
  • docstring-to-markdown 0.17
  • docutils 0.21.2
  • duckdb 1.2.2
  • filelock 3.18.0
  • frozenlist 1.6.0
  • fsspec 2025.3.0
  • h11 0.16.0
  • hf-xet 1.1.0
  • httpcore 1.0.9
  • httpx 0.28.1
  • huggingface-hub 0.31.1
  • idna 3.10
  • importlib-metadata 8.7.0
  • itsdangerous 2.2.0
  • jedi 0.19.2
  • jiter 0.9.0
  • langfuse 2.60.4
  • lsprotocol 2023.0.1
  • marimo 0.13.6
  • markdown 3.8
  • markdown-it-py 3.0.0
  • mdurl 0.1.2
  • multidict 6.4.3
  • multiprocess 0.70.16
  • narwhals 1.38.1
  • numpy 2.2.5
  • openai 1.77.0
  • packaging 24.2
  • pandas 2.2.3
  • parso 0.8.4
  • pluggy 1.5.0
  • polars 1.29.0
  • propcache 0.3.1
  • psutil 7.0.0
  • pyarrow 20.0.0
  • pycrdt 0.11.1
  • pydantic 2.11.4
  • pydantic-core 2.33.2
  • pydantic-settings 2.9.1
  • pygments 2.19.1
  • pymdown-extensions 10.15
  • python-dateutil 2.9.0.post0
  • python-dotenv 1.1.0
  • python-lsp-jsonrpc 1.1.2
  • python-lsp-ruff 2.2.2
  • python-lsp-server 1.12.2
  • pytz 2025.2
  • pyyaml 6.0.2
  • regex 2024.11.6
  • requests 2.32.3
  • rich 14.0.0
  • ruff 0.11.8
  • safetensors 0.5.3
  • six 1.17.0
  • sniffio 1.3.1
  • sqlglot 26.16.4
  • starlette 0.46.2
  • structlog 25.3.0
  • tokenizers 0.21.1
  • tomlkit 0.13.2
  • tqdm 4.67.1
  • transformers 4.51.3
  • typing-extensions 4.13.2
  • typing-inspection 0.4.0
  • tzdata 2025.2
  • ujson 5.10.0
  • urllib3 2.4.0
  • uvicorn 0.34.2
  • websockets 15.0.1
  • wrapt 1.17.2
  • xxhash 3.5.0
  • yarl 1.20.0
  • zipp 3.21.0