https://github.com/deeprec-ai/deeprec

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
○
DOI references
○
Academic publication links
✓
Committers with academic emails
84 of 2395 committers (3.5%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.9%) to scientific vocabulary

Keywords

advertising deep-learning distributed-training machine-learning python recommendation-engine scalability search-engine

Keywords from Contributors

deep-neural-networks distributed jax tensor autograd reinforcement-learning mxnet keras transformer research

Last synced: 4 months ago · JSON representation

Repository

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

Basic Info

Host: GitHub
Owner: DeepRec-AI
License: apache-2.0
Language: C++
Default Branch: main
Homepage:
Size: 764 MB

Statistics

Stars: 1,099
Watchers: 35
Forks: 362
Open Issues: 91
Releases: 14

Topics

advertising deep-learning distributed-training machine-learning python recommendation-engine scalability search-engine

Created about 4 years ago · Last pushed about 1 year ago

Metadata Files

Readme Contributing License Code of conduct Governance Authors

Introduction

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow 1.15, Intel-TensorFlow and NVIDIA-TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

Background

Recommendation models have huge commercial values for areas such as retailing, media, advertisements, social networks and search engines. Unlike other kinds of models, recommendation models have large amount of non-numeric features such as id, tag, text and so on which lead to huge parameters.

DeepRec has been developed since 2016, which supports core businesses such as Taobao Search, recommendation and advertising. It precipitates a list of features on basic frameworks and has excellent performance in recommendation models training and inference. So far, in addition to Alibaba Group, dozens of companies have used DeepRec in their business scenarios.

Key Features

DeepRec has super large-scale distributed training capability, supporting recommendation model training of trillion samples and over ten trillion parameters. For recommendation models, in-depth performance optimization has been conducted across CPU and GPU platform. It contains list of features to improve usability and performance for super-scale scenarios.

Embedding & Optimizer

Embedding Variable.
Dynamic Dimension Embedding Variable.
Adaptive Embedding Variable.
Multiple Hash Embedding Variable.
Multi-tier Hybrid Embedding Storage.
Group Embedding.
AdamAsync Optimizer.
AdagradDecay Optimizer.

Training

Asynchronous Distributed Training Framework (Parameter Server), such as grpc+seastar, FuseRecv, StarServer etc.
Synchronous Distributed Training Framework (Collective), such as HybridBackend, Sparse Operation Kits (SOK) etc.
Runtime Optimization, such as Graph Aware Memory Allocator (GAMMA), Critical-path based Executor etc.
Runtime Optimization (GPU), GPU Multi-Stream Engine which support multiple CUDA compute stream and CUDA Graph.
Operator level optimization, such as BF16 mixed precision optimization, embedding operator optimization and EmbeddingVariable on PMEM and GPU, new hardware feature enabling, etc.
Graph level optimization, such as AutoGraphFusion, SmartStage, AutoPipeline, Graph Template Engine, Sample-awared Graph Compression, MicroBatch etc.
Compilation optimization, support BladeDISC, XLA etc.

Deploy and Serving

Delta checkpoint loading and exporting.
Super-scale recommendation model distributed serving.
Multi-tier hybrid storage and multi backend supported.
Online deep learning with low latency.
High performance inference framework SessionGroup (share-nothing), with multiple threadpool and multiple CUDA stream supported.
Model Quantization.

Installation

Prepare for installation

CPU Platform

alideeprec/deeprec-build:deeprec-dev-cpu-py38-ubuntu20.04

GPU Platform

alideeprec/deeprec-build:deeprec-dev-gpu-py38-cu116-ubuntu20.04

How to Build

Configure $ ./configure Compile for CPU and GPU defaultly $ bazel build -c opt --config=opt //tensorflow/tools/pip_package:build_pip_package Compile for CPU and GPU: ABI=0 $ bazel build --cxxopt="-D_GLIBCXX_USE_CXX11_ABI=0" --host_cxxopt="-D_GLIBCXX_USE_CXX11_ABI=0" -c opt --config=opt //tensorflow/tools/pip_package:build_pip_package Compile for CPU optimization: oneDNN + Unified Eigen Thread pool $ bazel build -c opt --config=opt --config=mkl_threadpool //tensorflow/tools/pip_package:build_pip_package Compile for CPU optimization and ABI=0 $ bazel build --cxxopt="-D_GLIBCXX_USE_CXX11_ABI=0" --host_cxxopt="-D_GLIBCXX_USE_CXX11_ABI=0" -c opt --config=opt --config=mkl_threadpool //tensorflow/tools/pip_package:build_pip_package

Create whl package

$ ./bazel-bin/tensorflow/tools/pip_package/build_pip_package /tmp/tensorflow_pkg

Install whl package

$ pip3 install /tmp/tensorflow_pkg/tensorflow-1.15.5+${version}-cp38-cp38m-linux_x86_64.whl

Latest Release Images

Image for CPU

alideeprec/deeprec-release:deeprec2402-cpu-py38-ubuntu20.04

Image for GPU CUDA11.6

alideeprec/deeprec-release:deeprec2402-gpu-py38-cu116-ubuntu20.04

Continuous Build Status

Official Build

| Build Type | Status | | ------------- | ------------------------------------------------------------ | | Linux CPU | | | Linux GPU | | | Linux CPU Serving | | | Linux GPU Serving | |

Official Unit Tests

| Unit Test Type | Status | | -------------- | ------ | | Linux CPU C | | | Linux CPU CC | | | Linux CPU Contrib | | | Linux CPU Core | | | Linux CPU Examples | | | Linux CPU Java | | | Linux CPU JS | | | Linux CPU Python | | | Linux CPU Stream Executor | | | Linux GPU C | | | Linux GPU CC | | | Linux GPU Contrib | | | Linux GPU Core | | | Linux GPU Examples | | | Linux GPU Java | | | Linux GPU JS | | | Linux GPU Python | | | Linux GPU Stream Executor | | | Linux CPU Serving UT | | | Linux GPU Serving UT | |

User Document

Chinese: https://deeprec.readthedocs.io/zh/latest/

English: https://deeprec.readthedocs.io/en/latest/

Contact Us

Join the Official Discussion Group on DingTalk

Join the Official Discussion Group on WeChat

License

Apache License 2.0

Owner

Name: DeepRec-AI
Login: DeepRec-AI
Kind: organization

Repositories: 1
Profile: https://github.com/DeepRec-AI

GitHub Events

Total

Commit comment event: 1
Issues event: 5
Watch event: 90
Issue comment event: 7
Pull request review event: 6
Pull request event: 6
Fork event: 12

Last Year

Commit comment event: 1
Issues event: 5
Watch event: 90
Issue comment event: 7
Pull request review event: 6
Pull request event: 6
Fork event: 12

Committers

Last synced: 11 months ago

All Time

Total Commits: 59,124
Total Committers: 2,395
Avg Commits per committer: 24.686
Development Distribution Score (DDS): 0.725

Past Year

Commits: 9
Committers: 5
Avg Commits per committer: 1.8
Development Distribution Score (DDS): 0.444

Top Committers

Name	Email	Commits
A. Unique TensorFlower	g**r@t**g	16,283
A. Unique TensorFlower	n**y@t**g	1,340
Yong Tang	y**b@o**m	1,039
Derek Murray	m**y@g**m	866
Benoit Steiner	b**r@g**m	839
Gunhan Gulsoy	g**n@g**m	792
Sanjoy Das	s**y@g**m	712
Justin Lebar	j**r@g**m	654
Peter Hawkins	p**s@g**m	647
Shanqing Cai	c**s@g**m	645
Alexandre Passos	a**s@g**m	587
Eugene Brevdo	e**o@g**m	554
Vijay Vasudevan	v**v@g**m	539
Allen Lavoie	a**l@g**m	485
Asim Shankar	a**r@g**m	444
Anna R	a**v@g**m	425
Martin Wicke	w**e@g**m	413
Dan Moldovan	m**n@g**m	383
Guangda Lai	l**d@g**m	366
Yifei Feng	y**f@g**m	343
Mark Daoust	m**t@g**m	341
Skye Wanderman-Milne	s**m@g**m	332
Jiri Simsa	j**a@g**m	331
Illia Polosukhin	i**n@g**m	306
Amit Patankar	a**r@g**m	304
Suharsh Sivakumar	s**s@g**m	302
Mihai Maruseac	m**c@g**m	290
Akshay Modi	n**i@g**m	283
Tongxuan Liu	t**x@a**m	281
terrytangyuan	t**n@g**m	281
and 2,365 more...

Committer Domains (Top 20 + Academic)

google.com: 423 intel.com: 61 nvidia.com: 45 qq.com: 28 us.ibm.com: 18 alibaba-inc.com: 15 163.com: 12 microsoft.com: 11 huawei.com: 9 ibm.com: 8 me.com: 8 126.com: 7 amd.com: 6 naver.com: 6 gmx.de: 6 mit.edu: 5 arm.com: 4 kth.se: 4 gatech.edu: 4 tensorflow.org: 4 pku.edu.cn: 3 uw.edu: 2 sjtu.edu.cn: 2 stanford.edu: 2 epfl.ch: 2 ntu.edu.tw: 2 student.ethz.ch: 2 utexas.edu: 2 ieee.org: 2 alum.mit.edu: 1 buaa.edu.cn: 1 bt.iitr.ac.in: 1 iisc.ac.in: 1 purdue.edu: 1 cern.ch: 1 stud.ntnu.no: 1 surrey.ac.uk: 1 goa.bits-pilani.ac.in: 1 cs.cornell.edu: 1 ucsd.edu: 1 mail.ustc.edu.cn: 1 duke.edu: 1 ucl.ac.uk: 1 uni-mainz.de: 1 tu-dortmund.de: 1 up.edu.ph: 1 bjfu.edu.cn: 1 cs.utexas.edu: 1 rutgers.edu: 1 njit.edu: 1 student.kit.edu: 1 trinity.edu: 1 gold.ac.uk: 1 ais.uni-bonn.de: 1 eecs.berkeley.edu: 1 gmu.edu: 1 alumni.harvard.edu: 1 columbia.edu: 1 drexel.edu: 1 alumni.stanford.edu: 1 nyu.edu: 1 cse.iitm.ac.in: 1 cs.cmu.edu: 1 uni-duesseldorf.de: 1 xs.ustb.edu.cn: 1 cornell.edu: 1 iith.ac.in: 1 ualberta.ca: 1 u.rochester.edu: 1 wisc.edu: 1 student.tue.nl: 1 appstate.edu: 1 cs.washington.edu: 1 klis.tsukuba.ac.jp: 1 bu.edu: 1 ee.columbia.edu: 1 imperial.ac.uk: 1 vt.edu: 1 iastate.edu: 1 uci.edu: 1 ku.edu.tr: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 17
Total pull requests: 101
Average time to close issues: 17 days
Average time to close pull requests: 17 days
Total issue authors: 12
Total pull request authors: 19
Average comments per issue: 1.35
Average comments per pull request: 0.33
Merged pull requests: 73
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 5
Pull requests: 8
Average time to close issues: about 2 hours
Average time to close pull requests: 4 days
Issue authors: 4
Pull request authors: 2
Average comments per issue: 0.4
Average comments per pull request: 0.25
Merged pull requests: 2
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

welsonzhang (5)
Lihengwannafly (4)
fuhailin (2)
liunianxuxie (2)
HH-66 (1)
haolujun (1)
Lunewcome (1)
WSX1211 (1)
kangna-qi (1)
zER0pAGe-1 (1)
linjiuning (1)
changqi1 (1)
gl-001 (1)
sysofai (1)
houjincheng1992 (1)

Pull Request Authors

JackMoriarty (21)
liutongxuan (19)
candyzone (17)
lixy9474 (16)
nvzhou (11)
Mesilenceki (8)
npt-1707 (6)
zonghua94 (5)
fuhailin (3)
LightWang4 (2)
yitongh (2)
Lyaction (2)
changqi1 (2)
Duyi-Wang (2)
hxbai (1)

Top Labels

Issue Labels

documentation (1) bug (1)

Pull Request Labels

bug (21) enhancement (12) documentation (12) performance (7) refactoring (5) cibuild (3) benchmark (2) demo (1) unittest (1)

Dependencies

.github/workflows/ubuntu18.04-py3.6-cibuild-build-serving-gpu.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-build-serving.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-build-wheel.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-c-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-cc-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-contrib-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-core-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-examples-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-java-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-js-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-python-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-serving-unit-test-gpu.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-serving-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-stream_executor-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cibuild-unit-test.yaml actions

actions/checkout v2 composite
aliyun/ack-set-context v1 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-build-wheel.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-c-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-cc-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-contrib-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-core-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-examples-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-java-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-js-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-python-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-stream_executor-unit-test.yaml actions

actions/checkout v2 composite

.github/workflows/ubuntu18.04-py3.6-cuda11.2-cibuild-unit-test.yaml actions

actions/checkout v2 composite
aliyun/ack-set-context v1 composite

.github/workflows/ubuntu18.04-py3.6-modeltest.yaml actions

actions/checkout v2 composite
aliyun/ack-set-context v1 composite

tensorflow/examples/ios/benchmark/Podfile cocoapods

TensorFlow-experimental >= 0

tensorflow/examples/ios/camera/Podfile cocoapods

TensorFlow-experimental >= 0

tensorflow/examples/ios/simple/Podfile cocoapods

TensorFlow-experimental >= 0

tensorflow/lite/examples/ios/camera/Podfile cocoapods

TensorFlowLite = 1.13.1

tensorflow/lite/examples/ios/simple/Podfile cocoapods

TensorFlowLite = 1.13.1

tensorflow/lite/experimental/objc/apps/TestApp/Podfile cocoapods

TensorFlowLiteObjC >= 0

tensorflow/lite/experimental/swift/TestApp/Podfile cocoapods

TensorFlowLiteSwift >= 0

modelzoo/features/adagraddecay_optimizer/wide_and_deep/Dockerfile docker

ubuntu 18.04 build

modelzoo/features/embedding_variable/deepfm/Dockerfile docker

ubuntu 18.04 build

modelzoo/features/embedding_variable/wide_and_deep/Dockerfile docker

ubuntu 18.04 build

modelzoo/features/runtime/deepfm/Dockerfile docker

ubuntu 18.04 build

modelzoo/features/work_queue/wide_and_deep/Dockerfile docker

ubuntu 18.04 build

tensorflow/contrib/makefile/Dockerfile docker

ubuntu 16.04 build

tensorflow/java/maven/proto/pom.xml maven

com.google.protobuf:protobuf-java 3.16.3

tensorflow/java/maven/spark-tensorflow-connector/pom.xml maven

org.apache.hadoop:hadoop-yarn-api 2.7.3 provided
org.apache.spark:spark-core_2.11 2.4.5 provided
org.apache.spark:spark-mllib_2.11 2.4.5 provided
org.apache.spark:spark-sql_2.11 2.4.5 provided
org.tensorflow:tensorflow-hadoop 1.14.0
junit:junit 4.13.1 test
org.apache.spark:spark-mllib_2.11 2.4.5 test

tensorflow/java/maven/tensorflow/pom.xml maven

${project.groupId}:libtensorflow ${project.version}
${project.groupId}:libtensorflow_jni ${project.version}

tensorflow/java/maven/tensorflow-hadoop/pom.xml maven

com.google.protobuf:protobuf-java 3.16.3
org.apache.hadoop:hadoop-common 3.2.4
org.apache.hadoop:hadoop-mapreduce-client-core 3.2.4
org.tensorflow:proto 1.14.0
junit:junit 4.13.1 test
org.apache.hadoop:hadoop-mapreduce-client-jobclient 3.2.4 test

tensorflow/lite/java/demo/app/build.gradle maven

com.android.support.constraint:constraint-layout 1.0.2 implementation
com.android.support:appcompat-v7 25.2.0 implementation
com.android.support:design 25.2.0 implementation
com.android.support:support-annotations 25.3.1 implementation
com.android.support:support-v13 25.2.0 implementation
org.tensorflow:tensorflow-lite 0.0.0-nightly implementation
org.tensorflow:tensorflow-lite-gpu 0.0.0-nightly implementation
org.tensorflow:tensorflow-lite-local 0.0.0 implementation

tensorflow/lite/java/ovic/demo/app/build.gradle maven

com.android.support.constraint:constraint-layout 1.0.2 implementation
com.android.support:appcompat-v7 25.2.0 implementation
com.android.support:design 25.2.0 implementation
com.android.support:support-annotations 25.3.1 implementation
com.android.support:support-v13 25.2.0 implementation

docs/docs_en/requirements.txt pypi

docutils ==0.16
myst-parser *
sphinx *
sphinx_rtd_theme *

docs/docs_zh/requirements.txt pypi

docutils ==0.16
myst-parser *
sphinx *
sphinx_rtd_theme *

https://github.com/deeprec-ai/deeprec

Science Score: 23.0%

Keywords

Keywords from Contributors

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Introduction

Background

Key Features

Embedding & Optimizer

Training

Deploy and Serving

Installation

Prepare for installation

How to Build

Create whl package

Install whl package

Latest Release Images

Image for CPU

Image for GPU CUDA11.6

Continuous Build Status

Official Build

Official Unit Tests

User Document

Contact Us

License

Owner

GitHub Events

Total

Last Year

Committers

All Time

Past Year

Top Committers

Committer Domains (Top 20 + Academic)

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies