VLAA-Thinking
Python
★ 149
updated 8mo ago
[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
No plain-English explanation yet — one is being written right now. Check back in a minute.