Updated 9 months ago
https://github.com/alpa-projects/mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)