Wavelet: Efficient DNN Training with Tick-Tock Scheduling

https://mlsys.org/virtual/2021/oral/1586

Why?
- Cluster-level
- Might be more fragmentation
- Not something about single task utilization

Gandiva:

Gandiva:

Pipedream: minimizing the communication there

Version of the model that is read?

Last updated 4 years ago

Was this helpful?