dodrio
Svelte
★ 372
updated 2y ago
Exploring attention weights in transformer-based models with linguistic knowledge.
No plain-English explanation yet — one is being written right now. Check back in a minute.