Benoit Steiner
|
ac5d706a94
|
Added support for simple coefficient wise tensor expression using half floats on CUDA devices
|
2016-02-19 08:19:12 +00:00 |
|
Benoit Steiner
|
0606a0a39b
|
FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA
|
2016-02-18 23:15:23 -08:00 |
|
Benoit Steiner
|
17b9fbed34
|
Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa
|
2016-02-19 06:16:07 +00:00 |
|
Eugene Brevdo
|
f7362772e3
|
Add digamma for CPU + CUDA. Includes tests.
|
2015-12-24 21:15:38 -08:00 |
|
Benoit Steiner
|
a6c243617b
|
Fixed a typo in previous change.
|
2015-12-21 09:05:45 -08:00 |
|
Benoit Steiner
|
51be91f15e
|
Added support for CUDA architectures that don's support for 3.5 capabilities
|
2015-12-21 08:42:58 -08:00 |
|
Eugene Brevdo
|
fa4f933c0f
|
Add special functions to Eigen: lgamma, erf, erfc.
Includes CUDA support and unit tests.
|
2015-12-07 15:24:49 -08:00 |
|
Benoit Steiner
|
36cd6daaae
|
Made the CUDA implementation of ploadt_ro compatible with cuda implementations older than 3.5
|
2015-11-03 16:36:30 -08:00 |
|
Benoit Steiner
|
98f8f0db9a
|
Added support for predux_mul for CUDA devices
|
2015-09-08 15:37:25 -07:00 |
|
Gael Guennebaud
|
6245591349
|
Fix prototype of plset and generalize linspace functor.
|
2015-08-07 19:27:59 +02:00 |
|
Gael Guennebaud
|
ce57dbd937
|
Let unpacket_traits<> exposes the required alignment and make use of it everywhere
|
2015-08-07 10:44:01 +02:00 |
|
Benoit Steiner
|
abdbe8562e
|
Fixed the CUDA packet primitives
|
2015-03-24 10:45:46 -07:00 |
|
Benoit Steiner
|
7765039f1c
|
Marked the CUDA packet primitives as EIGEN_DEVICE_FUNC since they'll end up being executed on the GPU device.
|
2015-02-19 21:22:51 -08:00 |
|
Gael Guennebaud
|
6f4adc9e94
|
Add missing install directives for arch/CUDA
|
2015-02-18 11:40:06 +01:00 |
|
Benoit Steiner
|
509e4ddc02
|
Added reduction packet primitives for CUDA
|
2014-11-19 10:34:11 -08:00 |
|
Benoit Steiner
|
1946cc4478
|
Added missing packet primitives for CUDA.
|
2014-10-30 17:52:32 -07:00 |
|
Benoit Steiner
|
95a430a2ca
|
Vector primitives for CUDA
|
2014-10-03 19:45:19 -07:00 |
|