gitmyhub

BrokenHill

Python ★ 168 updated 1y ago

A productionized greedy coordinate gradient (GCG) attack tool for large language models (LLMs)

No plain-English explanation yet — one is being written right now. Check back in a minute.