fuxictr

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

https://github.com/reczoo/fuxictr

Keywords

ctr ctr-prediction cvr pytorch recommender-systems

Keywords from Contributors

interpretability standardization hack

Last synced: 6 months ago · JSON representation ·

Repository

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Basic Info

Host: GitHub
Owner: reczoo
License: apache-2.0
Language: Python
Default Branch: main
Homepage:
Size: 2.31 MB

Statistics

Stars: 1,253
Watchers: 14
Forks: 199
Open Issues: 13
Releases: 0

Topics

ctr ctr-prediction cvr pytorch recommender-systems

Created over 4 years ago · Last pushed 8 months ago

Metadata Files

Readme Changelog License Citation

Click-through rate (CTR) prediction is a critical task for various industrial applications such as online advertising, recommender systems, and sponsored search. FuxiCTR provides an open-source library for CTR prediction, with key features in configurability, tunability, and reproducibility. We hope this project could promote reproducible research and benefit both researchers and practitioners in this field.

Key Features

Configurable: Both data preprocessing and models are modularized and configurable.
Tunable: Models can be automatically tuned through easy configurations.
Reproducible: All the benchmarks can be easily reproduced.
Extensible: It can be easily extended to any new models, supporting both Pytorch and Tensorflow frameworks.

Model Zoo

Benchmarking

We have benchmarked FuxiCTR models on a set of open datasets as follows:

:star: Benchmark datasets for CTR prediction
:star: Benchmark settings and running steps
:star: Benchmark leaderboard for CTR prediction

Dependencies

FuxiCTR has the following dependencies:

python 3.9+
pytorch 1.10.0--2.1.2 (if using for torch models)
tensorflow 2.1 (if using for tensorflow models)

Please install other required packages via pip install -r requirements.txt.

Quick Start

Run the demo examples

Examples are provided in the demo directory to show some basic usage of FuxiCTR. Users can run the examples for quick start and to understand the workflow.

cd demo python example1_build_dataset_to_parquet.py python example2_DeepFM_with_parquet_input.py

Run a model on tiny data

Users can easily run each model in the model zoo following the commands below, which is a demo for running DCN. In addition, users can modify the dataset config and model config files to run on their own datasets or with new hyper-parameters. More details can be found in the README.

``` cd modelzoo/DCN/DCNtorch python runexpid.py --expid DCNtest --gpu 0

# Change MODEL according to the target model name cd modelzoo/MODEL python runexpid.py --expid MODEL_test --gpu 0 ```

Run a model on benchmark datasets (e.g., Criteo)

Users can follow the benchmark section to get benchmark datasets and running steps for reproducing the existing results. Please see an example here: https://github.com/reczoo/BARS/tree/main/ranking/ctr/DCNv2/DCNv2criteox1

Implement a new model

The FuxiCTR library is designed to be modularized, so that every component can be overwritten by users according to their needs. In many cases, only the model class needs to be implemented for a new customized model. If data preprocessing or data loader is not directly applicable, one can also overwrite a new one through the core APIs. We show a concrete example which implements our new model FinalMLP that has been recently published in AAAI 2023.

Tune hyper-parameters of a model

FuxiCTR currently support fast grid search of hyper-parameters of a model using multiple GPUs. The following example shows the grid search of 8 experiments with 4 GPUs.

cd experiment python run_param_tuner.py --config config/DCN_tiny_parquet_tuner_config.yaml --gpu 0 1 2 3 0 1 2 3

🔥 Citation

If you use our code or benchmarks in your public research, please cite the following two papers.

Jieming Zhu, Quanyu Dai, Liangcai Su, Rong Ma, Jinyang Liu, Guohao Cai, Xi Xiao, Rui Zhang. BARS: Towards Open Benchmarking for Recommender Systems. The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2022. [Bibtex]
Jieming Zhu, Jinyang Liu, Shuai Yang, Qi Zhang, Xiuqiang He. BARS-CTR: Open Benchmarking for Click-Through Rate Prediction. The 30th ACM International Conference on Information and Knowledge Management (CIKM), 2021. [Bibtex]

🙋 Discussion

Welcome to join our WeChat group for any question and discussion. If you are interested in research and practice in recommender systems, please reach out via our WeChat group.

Scan QR code

Owner

Name: RECZOO
Login: reczoo
Kind: organization

Repositories: 1
Profile: https://github.com/reczoo

Open Science by XUEPAI

Citation (CITATION)

@incollection{FuxiCTR,
  author    = {Jieming Zhu and
               Jinyang Liu and
               Shuai Yang and
               Qi Zhang and
               Xiuqiang He},
  title     = {Open Benchmarking for Click-Through Rate Prediction},
  booktitle = {The 30th {ACM} International Conference on Information
               and Knowledge Management (CIKM'21)},
  pages     = {2759--2769},
  year      = {2021}
}

@incollection{BARS,
  author    = {Jieming Zhu and
               Quanyu Dai and
               Liangcai Su and
               Rong Ma and
               Jinyang Liu and
               Guohao Cai and
               Xi Xiao and
               Rui Zhang},
  title     = {BARS: Towards Open Benchmarking for Recommender Systems},
  booktitle = {The 45th International ACM SIGIR Conference on Research 
               and Development in Information Retrieval (SIGIR'22)},
  year      = {2022}
}

GitHub Events

Total

Issues event: 60
Watch event: 298
Issue comment event: 39
Push event: 24
Pull request event: 7
Fork event: 43
Create event: 5

Last Year

Issues event: 60
Watch event: 298
Issue comment event: 39
Push event: 24
Pull request event: 7
Fork event: 43
Create event: 5

Committers

Last synced: 9 months ago

All Time

Total Commits: 126
Total Committers: 13
Avg Commits per committer: 9.692
Development Distribution Score (DDS): 0.302

Past Year

Commits: 31
Committers: 6
Avg Commits per committer: 5.167
Development Distribution Score (DDS): 0.258

Top Committers

Name	Email	Commits
xpai	x****i	88
zhujiem	z****m	22
LiangcaiSu	3****u	3
github-actions[bot]	4****]	2
XiaoLongtaoo	9****o	2
Honghao Li	7****2	2
乳酸君、	r**j@p**g	1
lu-minous	4****s	1
leesoojin	l**2@n**m	1
ccfco	5****o	1
Tian Zhen	6****Z	1
Serdarcan Dilbaz	s**z@g**m	1
Dansheng	3****g	1

Committer Domains (Top 20 + Academic)

naver.com: 1 php.vg: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 55
Total pull requests: 14
Average time to close issues: 7 days
Average time to close pull requests: 19 days
Total issue authors: 33
Total pull request authors: 11
Average comments per issue: 1.33
Average comments per pull request: 1.43
Merged pull requests: 7
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 38
Pull requests: 6
Average time to close issues: 5 days
Average time to close pull requests: 2 days
Issue authors: 24
Pull request authors: 4
Average comments per issue: 0.87
Average comments per pull request: 0.5
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

zhujiem (16)
Sharan123 (5)
ywangwxd (3)
clementineyyy (2)
byby221b (2)
xpai (1)
Lcy1-1 (1)
AIyumeng (1)
XudxDon (1)
AdamLTy (1)
ChengQianlong (1)
quency711 (1)
rsliu94 (1)
Hizkai (1)
houWenK (1)

Pull Request Authors

salmon1802 (4)
sdilbaz (2)
ccfco (2)
LiangcaiSu (2)
XiaoLongtaoo (2)
nilozdemir (2)
lu-minous (2)
rsliu94 (2)
Ethan-TZ (2)
w32zhong (1)
samins (1)
FuatOgme (1)
milein4u (1)

Top Labels

Issue Labels

bug (1)

Pull Request Labels

bug (2)

Packages

Total packages: 1
Total downloads:
- pypi 646 last-month

Total dependent packages: 0
Total dependent repositories: 2
Total versions: 31
Total maintainers: 1

pypi.org: fuxictr

A configurable, tunable, and reproducible library for CTR prediction

Homepage: https://github.com/reczoo/FuxiCTR
Documentation: https://fuxictr.readthedocs.io/
License: Apache-2.0 License
Latest release: 2.3.9
published 8 months ago

Versions: 31
Dependent Packages: 0
Dependent Repositories: 2
Downloads: 646 Last month

Rankings

Stargazers count: 2.4%

Forks count: 4.3%

Average: 8.7%

Dependent packages count: 10.1%

Dependent repos count: 11.6%

Downloads: 15.1%

Maintainers (1)

zhujiem

Last synced: 6 months ago

fuxictr

Science Score: 77.0%

Keywords

Keywords from Contributors

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Key Features

Model Zoo

Benchmarking

Dependencies

Quick Start

🔥 Citation

🙋 Discussion

Owner

Citation (CITATION)

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Packages

pypi.org: fuxictr

Rankings

Maintainers (1)

Dependencies