Jiawei, Z. GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection (Version 1.0.0) [Computer software]. https://arxiv.org/abs/2403.03507