Decoupled DiLoCo: Resilient, Distributed AI Training at Scale

(deepmind.google)

48 points | by metadat 3 days ago ago

6 comments