13lack13ox logo
13lack13oxLabs
  • Home
  • Projects
  • Blog
  • Contact
<cd ../feed
how-to-train-really-large-models-on-many-gpus.log
Sep 23, 2021|src: lilianweng.github.io

How to Train Really Large Models on Many GPUs?

[Updated on 2022-03-13: add expert choice routing.]
[Updated on 2022-06-10]: Greg and I wrote a shorted and upgraded version of this post, published on OpenAI Blog: “Techniques for Training Large Neural Networks”

>open_source--originlilianweng.github.io13lack13ox Labs