Decoupled DiLoCo: Resilient, Distributed AI Training at Scale

(deepmind.google)

16 points | by metadat  2 hours ago

3 comments