In Figure~\ref{fig:time_bbs_gpu} we highlight the performances of the optimized
BBS-based PRNG on GPU. On the Tesla C1060 we
-obtain approximately 1.8GSample/s and on the GTX 280 about 1.6GSample/s, which is
+obtain approximately 700MSample/s and on the GTX 280 about 670MSample/s, which is
obviously slower than the xorlike-based PRNG on GPU. However, we will show in the
next sections that
this new PRNG has a strong level of security, which is necessary paid by a speed
#threads bbs (c1060) opti rand/s (gtx285) opti
-10240 1627217759.55 1498102683.67
-20480 1749246235.44 1607349227.62
-30720 1833097798.03 1685516555.22
-51200 1798823386.81 1653380954.14
-76800 1802172135.34 1657005808.07
-102400 1810622021.15 1664347848.70
-153600 1831246356.28 1685425622.09
-512000 1826609454.30 1682096838.84
-768000 1825429813.30 1682980716.62
-1048576 1823084118.32 1679854343.07
-2097152 1818671555.41 1676695462.37
-5242880 1805432088.56 1667271144.46
+10240 622497604.46 597300046.29
+20480 677745855.23 650787308.05
+30720 701376668.60 672256823.11
+51200 692118233.26 663780345.74
+76800 701740920.54 660640087.24
+102400 693025472.40 664058098.59
+153600 701565276.17 672620334.61
+512000 700157216.29 671244011.43
+768000 701007355.78 671801398.71
+1048576 699292505.32 669783366.17
+2097152 698125940.68 668241397.05
+5242880 675888442.85 665832927.73