Research Article

Pipelined Training with Stale Weights in Deep Convolutional Neural Networks

Algorithm 1

Pipelined stochastic gradient descent (PPL-SGD).
 Initialize weights
 Given learning rate sequence
fortodo
fortoin parallel do
  Compute gradient using stale weights:
   
  Update weights:
   
end for
end for