Planetballs S2 - Search News

S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning

Abstract: Distributed stochastic gradient descent (SGD) approach has been widely used in large-scale deep learning, and the gradient collective method is vital to ensure the training scalability of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

S2 Reducer: High-Performance Sparse Communication to Accelerate Distributed Deep Learning

Trending now