ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning
https://arxiv.org/pdf/2104.07857.pdf
PreviousZeRO-Infinity and DeepSpeed: Unlocking unprecedented model scale for deep learning trainingNextKungFu: Making Training inDistributed Machine Learning Adaptive
Was this helpful?