gitmyhub

SeeAct

Python ★ 0 updated 1y ago ⑂ fork

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

No plain-English explanation yet — one is being written right now. Check back in a minute.