On C2070M Tesla card, this code take 37.68ms to perform the multiplication. On a
Intel Xeon E31245 at 3.30GHz, it takes 2465ms without any parallelization (using
only one core). Consequently the speed up between the CPU and GPU version is
On C2070M Tesla card, this code take 37.68ms to perform the multiplication. On a
Intel Xeon E31245 at 3.30GHz, it takes 2465ms without any parallelization (using
only one core). Consequently the speed up between the CPU and GPU version is