Updated 9 months ago

https://github.com/alpa-projects/mms • Science 13%

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)