A comparative study between Nvidia Fermi and Kepler architectures are being undertaken. The key aspects being targeted are performance scaling for computational kernels like tiled matrix multiplication, memory transfer behavior, gains using streaming and performance difference observed when using THRUST library. Variation of execution configuration, working data set and higher occupancy on individual architectures would be exercised.
Contributors: Arindam Sinha and Dan Negrut