cudaMalloc
and cudaMallocManaged
? When will you pick one over another?
cudaMalloc
cudaMallocManaged
nvprof
. Show the difference with and without shared memory. In addition, measure the global memory load throughput (gld_throughput
) and instruction per cycle (ipc
) and explain your observation.
grading policy