Aleksandar, K., Steleac, R., Thomas, J., Papoudakis, G., Schäfer, L., Wing Keung To, A., Lao, K., Cubuktepe, M., Haley, M., Börsting, P., & Albrecht, S. Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers [Conference paper]. https://arxiv.org/abs/2212.11498