Research Article

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

Algorithm 1

winograd transformation.
Input:
Temp array: ,
Output:
for   to   do
for   to   do
end for
end for
for   to   do
for   to   do
end for
end for
for   to   do
for   to   do
end for
end for