Research Article

Efficient CSR-Based Sparse Matrix-Vector Multiplication on GPU

Algorithm 7

IPCSR with the adaptive number of threads per block.
Input: , , , , , ;
Output: ;
()    _;
()    _;
()    ; ;
()    ; ;
()    ; ;
    //Assemble into shared memory
()    for to with   +=   do
()      ;
()    done
()    ();
() ;
() if then
()   //Omitted: Perform a scalar-style reduction
() else
     //Omitted: Perform a multiple scalar-style reduction
() end