Benoit Steiner
4670d7d5ce
Improved the performance of full reductions on GPU:
Before:
BM_fullReduction/10 200000 11751 8.51 MFlops/s
BM_fullReduction/80 5000 523385 12.23 MFlops/s
BM_fullReduction/640 50 36179326 11.32 MFlops/s
BM_fullReduction/4K 1 2173517195 11.50 MFlops/s
After:
BM_fullReduction/10 500000 5987 16.70 MFlops/s
BM_fullReduction/80 200000 10636 601.73 MFlops/s
BM_fullReduction/640 50000 58428 7010.31 MFlops/s
BM_fullReduction/4K 1000 2006106 12461.95 MFlops/s
2016-05-09 17:09:54 -07:00
..
2016-05-09 17:09:54 -07:00
2016-05-05 10:02:26 -07:00
2015-11-30 16:00:22 +01:00
2016-01-27 18:34:42 +01:00
2013-06-18 09:44:40 +02:00
2013-01-11 10:40:35 +01:00
2013-01-11 10:40:35 +01:00
2015-10-16 18:21:02 -07:00
2013-01-11 10:40:35 +01:00
2015-12-07 12:23:22 +01:00
2013-01-11 10:40:35 +01:00
2013-02-20 14:10:14 +01:00
2014-09-06 14:59:44 +01:00
2013-01-11 10:40:35 +01:00
2015-02-28 16:41:00 +01:00
2013-06-03 23:09:33 +02:00
2013-01-11 10:40:35 +01:00
2016-03-23 15:37:45 +01:00
2013-01-11 10:40:35 +01:00
2013-01-11 10:40:35 +01:00
2014-09-18 15:15:27 +02:00
2013-01-11 10:40:35 +01:00