https://github.com/bytedance/cure

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (13.9%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: bytedance
License: apache-2.0
Language: Python
Default Branch: main
Size: 1.83 MB

Statistics

Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Releases: 0

Created 10 months ago · Last pushed 10 months ago

Metadata Files

Readme License

CURE: Critical-Token-Guided Re-concatenation for Entropy-collapse Prevention

If you find this project useful, please give us a star 🌟.

🔥 News

2024.8.14: We’re excited to announce the release of CURE’s paper.
2024.8.22: We’re excited to announce the release of CURE’s model.

📚 Algorithm Overview

In Stage 1, given an input query $q$, the policy model produces a pool of candidate responses. We compute token-level entropy to identify critical tokens (high entropy), extract the clauses immediately preceding those tokens, append them to $q$ to form refined prompts, and query the model again. The newly generated responses are aggregated with the original ones and jointly optimized within a single group. In Stage 2, we continue training to translate the exploration bonus into realized performance.

📚 Experimental Results

Comparison of Entropy comparison of CURE-First-Stage and other methods at temperature 1.0.

CURE performs competitively compared with other algorithms. We report avg@32 for AIME24, AIME25, and AMC23 and avg@1 for others.

⚙️ Installation

Our code has been incorporated into VERL as a plugin, located in recipe-CURE

1. Prepare the environment

Exactly the same as the environment setup in verl, no additional configuration is required. In our actual workflow, we executed the following operations directly within the released image. bash cd CURE pip install --no-deps -e . pip install --no-deps git+https://github.com/=hiyouga/MathRuler.git pip install math_verify

2. Prepare the dataset and model

For training, simply download the dataset from DAPO-Math-17K and set its path as TRAIN_FILE in the startup script. We use Qwen-2.5-Math-Base as the training baseline; download it and set its path as CKPTS_DIR in the same script.

We also recommend downloading AIME-2024 and setting its path as TEST_FILE in the same script.

🚀 Train

1. First Stage Training

bash cd CURE sh ./recipe/CURE_First_Stage/run_cure_stage_1.sh

2. Second Stage Training

bash cd CURE sh ./recipe/CURE_Second_Stage/run_cure_stage_2.sh

💓 Acknowledgement

This project has been developed partially based on the following pioneering works on GitHub repositories. We express our profound gratitude for these foundational resources: - https://github.com/volcengine/verl - https://github.com/NVlabs/NFT - https://github.com/huggingface/Math-Verify

We would like to extend our special thanks to the following contributors @Qingbin Li, @Rongkun Xue, for their valuable contributions and support to this algorithm library.

🌏 Citation

bibtex @misc{li2025curecriticaltokenguidedreconcatenationentropycollapse, title={CURE: Critical-Token-Guided Re-concatenation for Entropy-collapse Prevention}, author={Qingbin Li and Rongkun Xue and Jie Wang and Ming Zhou and Zhi Li and Xiaofeng Ji and Yongqi Wang and Miao Liu and Zheming Yang and Minghui Qiu and Jing Yang}, year={2025}, eprint={2508.11016}, archivePrefix={arXiv}, primaryClass={cs.LG}, url={https://arxiv.org/abs/2508.11016}, }

🏷️ License

All code within this repository is under Apache License 2.0.

Owner

Name: Bytedance Inc.
Login: bytedance
Kind: organization
Location: Singapore

Website: https://opensource.bytedance.com
Twitter: ByteDanceOSS
Repositories: 255
Profile: https://github.com/bytedance

GitHub Events

Total

Watch event: 1
Public event: 1
Push event: 1

Last Year

Watch event: 1
Public event: 1
Push event: 1

Dependencies

docs/requirements-docs.txt pypi

myst_parser *
recommonmark *
sphinx-markdown-tables *
sphinx-rtd-theme *
tokenizers ==0.21

pyproject.toml pypi

requirements-npu.txt pypi

accelerate *
codetiming *
datasets *
dill *
einops *
hydra-core *
mathruler *
numpy *
pandas *
peft *
pyarrow >=15.0.0
pybind11 *
pylatexenc *
qwen_vl_utils *
ray ==2.46.0
tensordict <=0.6.2
torchdata *
torchvision ==0.20.1
transformers ==4.52.4
wandb *

requirements.txt pypi

accelerate *
codetiming *
datasets *
dill *
fastapi *
flash-attn *
hydra-core *
latex2sympy2_extended *
liger-kernel *
math_verify *
numpy *
packaging >=20.0
pandas *
peft *
pre-commit *
pyarrow >=19.0.0
pybind11 *
pylatexenc *
ray *
tensordict <=0.6.2
torchdata *
transformers *
uvicorn *
wandb *

requirements_sglang.txt pypi

accelerate *
codetiming *
datasets *
dill *
flash-attn *
huggingface_hub *
hydra-core *
numpy *
pandas *
peft *
pyarrow >=19.0.0
pybind11 *
pylatexenc *
ray >=2.10
sglang ==0.4.6.post5
tensordict <=0.6.2
torch-memory-saver >=0.0.5
torchdata *
torchvision *
transformers *
wandb *

setup.py pypi

accelerate *
codetiming *
datasets *
dill *
hydra-core *
numpy *
pandas *
peft *
pyarrow >=19.0.0
pybind11 *
pylatexenc *
ray *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science