https://github.com/baohaoliao/frac-cot

Science Score: 36.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.3%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: BaohaoLiao
License: apache-2.0
Language: Python
Default Branch: main
Size: 229 KB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme License

# ⛓️ Fractured Chain-of-Thought Reasoning ⛓️ [![Paper](https://img.shields.io/badge/paper-A42C25?style=for-the-badge&logo=arxiv&logoColor=white)](https://arxiv.org/abs/2505.12992) [![Github](https://img.shields.io/badge/Fractured%20CoT-000000?style=for-the-badge&logo=github&logoColor=000&logoColor=white)](https://github.com/BaohaoLiao/frac-cot) [![Website](https://img.shields.io/badge/Blog-%23000000.svg?style=for-the-badge&logo=semanticweb&logoColor=white)](https://huggingface.co/spaces/Salesforce/Efficient-Reasoning)

🎉 News • 📢 Introduction • 🤝 Support

👷 Installation • 🔎 Evaluation • 📝 Citation

🎉News

[2025.05.21] We release the first version that mainly supports the pass@k experiments.

📢Introduction

We introduce Fractured Sampling, a unified inference-time strategy that interpolates between full CoT and solution-only sampling along three orthogonal axes: (1) the number of reasoning trajectories (n dimension), (2) the number of final solutions per trajectory (m dimension), and (3) the depth at which reasoning traces are truncated (H dimension).

Main Takeaways 1. 📈 The long-reasoning LLM is able to use truncated CoT to derive a correct solution. 2. 🌟 We can sample in 3D: the full-CoT (n dimensin), the solution (m dimension) and the truncated CoT (H dimension). The H dimension shows the steepest log-linear scaling gains in Pass@k. 3. 🚀 Sampling over all 3 dimensions offers the highest Pass@k. 4. 📊 We can use a process reward model to select the best solution among all 3D samplings, a ~10% accuracy improvement compared to only sampling in the n dimension. 5. 🧠 We can use the self-consistency property within the H dimension to early stop the genration, saving 20% tokens without sacrifying accuracy.

🤝Support

[x] Pass@k for single dimension and multiple dimensions.
[ ] Best-of-N accross multiple dimensions.
[ ] Early stopping for efficient generation.

👷Installation

bash conda create -n frac_cot python=3.10 conda activate frac_cot pip install -r requirements.txt

🔎Evaluation

Pass@k for single and multiple dimensions. bash # Generation bash ./scripts/gen.sh # Evaluate bash ./scriots/passk.sh

📝Citation

If you find our work useful, please cite as: @article{liao2025fractured, title={Fractured Chain-of-Thought Reasoning}, author={Liao, Baohao and Dong, Hanze and Xu, Yuhui and Sahoo, Doyen and Monz, Christof and Li, Junnan and Xiong, Caiming}, journal={arXiv preprint arXiv:2505.12992}, year={2025} }

Owner

Name: baohao
Login: BaohaoLiao
Kind: user
Location: Netherlands
Company: University of Amsterdam

Website: https://baohaoliao.github.io/
Repositories: 1
Profile: https://github.com/BaohaoLiao

PhD candidate @ltl-uva for NLP

GitHub Events

Total

Watch event: 4
Push event: 33
Create event: 2

Last Year

Watch event: 4
Push event: 33
Create event: 2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science