Updated 5 months ago

https://github.com/aim-uofa/active-o3 • Science 36%

ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO