gitmyhub

mixture-of-experts

Python ★ 862 updated 2y ago

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

No plain-English explanation yet — one is being written right now. Check back in a minute.