gitmyhub

InfMoE

C++ ★ 41 updated 5y ago

Inference framework for MoE layers based on TensorRT with Python binding

No plain-English explanation yet — one is being written right now. Check back in a minute.