Implementation Details
Here is a list of hotspots ported to CUDA:
1. hypre_BoomerAMGBuildInterp in par_interp.c

coarseshiftinterp contains the cuda calls.
2. GenerateLaplacian27pt in file par_laplacian_27pt.

The function BinarySearchOnCudaCallerDif contains the cuda calls.
3. hypre_SeqVectorInnerProd

We found that the above function was called frequently by the other two hotspots hypre_BoomerAMGCycle and Hypre_ParCSRMatrix repeatedly and so we ported this call to Cuda.
4. hypre_BoomerAMGRelax

This is another function from the file seq_mv/vector.c which was ported to CUDA due to the large number of times it was called from the source of the hotspot.
5.hypre_ParCSRMatrixMatvec

This function was a hotspot when run with 27pt and which has been reduced due to porting with CUDA.