Updated 5 months ago
https://github.com/aim-uofa/active-o3
ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO