gitmyhub

VLAA-Thinking

Python ★ 149 updated 8mo ago

[TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

No plain-English explanation yet — one is being written right now. Check back in a minute.