eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Author	SHA1	Message	Date
Benoit Steiner	1da10a7358	Enable the conversion between floats and half floats on older GPUs that support it.	2016-03-03 10:33:20 -08:00
Benoit Steiner	2de8cc9122	Merged in ebrevdo/eigen (pull request PR-167) Add infinity() support to numext::numeric_limits, use it in lgamma. I tested the code on my gtx-titan-black gpu, and it appears to work as expected.	2016-03-03 09:42:12 -08:00
Eugene Brevdo	ab3dc0b0fe	Small bugfix to numeric_limits for CUDA.	2016-03-02 21:48:46 -08:00
Eugene Brevdo	6afea46838	Add infinity() support to numext::numeric_limits, use it in lgamma. This makes the infinity access a __device__ function, removing nvcc warnings.	2016-03-02 21:35:48 -08:00
Gael Guennebaud	e9bea614ec	Fix shortcoming in fixed-value deduction of startRow/startCol	2016-02-29 10:31:27 +01:00
Gael Guennebaud	91e1375ba9	merge	2016-02-23 11:09:05 +01:00
Gael Guennebaud	055000a424	Fix startRow()/startCol() for dense Block with direct access: the initial implementation failed for empty rows/columns for which are ambiguous.	2016-02-23 11:07:59 +01:00
Benoit Steiner	6270d851e3	Declare the half float type as arithmetic.	2016-02-22 13:59:33 -08:00
Benoit Steiner	584832cb3c	Implemented the ptranspose function on half floats	2016-02-21 12:44:53 -08:00
Benoit Steiner	95fceb6452	Added the ability to compute the absolute value of a half float	2016-02-21 20:24:11 +00:00
Benoit Steiner	9ff269a1d3	Moved some of the fp16 operators outside the Eigen namespace to workaround some nvcc limitations.	2016-02-20 07:47:23 +00:00
Gael Guennebaud	d90a2dac5e	merge	2016-02-19 23:01:27 +01:00
Gael Guennebaud	6fa35bbd28	bug #1170 : skip calls to memcpy/memmove for empty imput.	2016-02-19 22:58:52 +01:00
Gael Guennebaud	6f0992c05b	Fix nesting type and complete reflection methods of Block expressions.	2016-02-19 22:21:02 +01:00
Benoit Steiner	180156ba1a	Added support for tensor reductions on half floats	2016-02-19 10:05:59 -08:00
Benoit Steiner	5c4901b83a	Implemented the scalar division of 2 half floats	2016-02-19 10:03:19 -08:00
Benoit Steiner	f7cb755299	Added support for operators +=, -=, *= and /= on CUDA half floats	2016-02-19 15:57:26 +00:00
Benoit Steiner	dc26459b99	Implemented protate() for CUDA	2016-02-19 15:16:54 +00:00
Benoit Steiner	ac5d706a94	Added support for simple coefficient wise tensor expression using half floats on CUDA devices	2016-02-19 08:19:12 +00:00
Benoit Steiner	0606a0a39b	FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA	2016-02-18 23:15:23 -08:00
Benoit Steiner	17b9fbed34	Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa	2016-02-19 06:16:07 +00:00
Benoit Steiner	8ce46f9d89	Improved implementation of ptanh for SSE and AVX	2016-02-18 13:24:34 -08:00
Eugene Brevdo	832380c455	Merged eigen/eigen into default	2016-02-17 14:44:06 -08:00
Eugene Brevdo	06a2bc7c9c	Tiny bugfix in SpecialFunctions: some compilers don't like doubles implicitly downcast to floats in an array constructor.	2016-02-17 14:41:59 -08:00
Gael Guennebaud	f6f057bb7d	bug #1166 : fix shortcomming in gemv when the destination is not a vector at compile-time.	2016-02-15 21:43:07 +01:00
Benoit Steiner	3628f7655d	Made it possible to run the scalar_binary_pow_op functor on GPU	2016-02-11 15:05:03 -08:00
Benoit Steiner	c569cfe12a	Inline the +=, -=, *= and /= operators consistently between DenseBase.h and SelfCwiseBinaryOp.h	2016-02-11 09:33:32 -08:00
Gael Guennebaud	2d35c0cb5f	Merged in rmlarsen/eigen (pull request PR-163) Implement complete orthogonal decomposition in Eigen.	2016-02-11 15:12:34 +01:00
Benoit Steiner	33e2373f01	Merged in nnyby/eigen/nnyby/doc-grammar-fix-linearly-space-linearly-1443742971203 (pull request PR-138) [doc] grammar fix: "linearly space" -> "linearly spaced"	2016-02-10 23:29:59 -08:00
Benoit Steiner	6d8b1dce06	Avoid implicit cast from double to float.	2016-02-10 18:07:11 -08:00
Benoit Jacob	9d6f1ad398	I'm told to use __EMSCRIPTEN__ by an Emscripten dev.	2016-02-10 12:48:34 -05:00
Benoit Steiner	bfb3fcd94f	Optimized implementation of the tanh function for SSE	2016-02-10 08:52:30 -08:00
Benoit Steiner	2d523332b3	Optimized implementation of the hyperbolic tangent function for AVX	2016-02-10 08:48:05 -08:00
Benoit Jacob	e6ee18d6b4	Make the GCC workaround for sqrt GCC-only; detect Emscripten as non-GCC	2016-02-10 11:11:49 -05:00
Benoit Jacob	964a95bf5e	Work around Emscripten bug - https://github.com/kripken/emscripten/issues/4088	2016-02-10 10:37:22 -05:00
Benoit Steiner	970751ece3	Disabling the nvcc warnings in addition to the clang warnings when clang is used as a frontend for nvcc	2016-02-09 20:55:50 -08:00
Benoit Steiner	5cc0dd5f44	Fixed the code that disables the use of variadic templates when compiling with nvcc on ARM devices.	2016-02-09 10:32:01 -08:00
Benoit Steiner	24d291cf16	Worked around nvcc crash when compiling Eigen on Tegra X1	2016-02-09 02:34:02 +00:00
Gael Guennebaud	c2bf2f56ef	Remove custom unaligned loads for SSE. They were only useful for core2 CPU.	2016-02-08 14:29:12 +01:00
Gael Guennebaud	a4c76f8d34	Improve inlining	2016-02-08 11:33:02 +01:00
Rasmus Munk Larsen	86d6201d7b	Merge.	2016-02-06 16:36:56 -08:00
Rasmus Munk Larsen	d904c8ac8f	Implement complete orthogonal decomposition in Eigen.	2016-02-06 16:32:00 -08:00
Gael Guennebaud	c6a12d1dc6	Fix warning with gcc < 4.8	2016-02-06 18:06:51 +01:00
Gael Guennebaud	5b2d287878	bug #779 : allow non aligned buffers for buffers smaller than the requested alignment.	2016-02-05 21:46:39 +01:00
Gael Guennebaud	e8e1d504d6	Add an explicit assersion on the alignment of the pointer returned by std::malloc	2016-02-05 21:38:16 +01:00
Gael Guennebaud	62a1c911cd	Remove posix_memalign, _mm_malloc, and _aligned_malloc special paths.	2016-02-05 21:24:35 +01:00
Benoit Steiner	bcdcdace48	Pulled latest updates from trunk	2016-02-04 08:56:49 -08:00
Gael Guennebaud	659fc9c159	Remove dead code	2016-02-04 09:55:09 +01:00
Gael Guennebaud	d5d7798b9d	Improve heuritics for switching between coeff-based and general matrix product implementation.	2016-02-04 09:53:47 +01:00
Benoit Steiner	f535378995	Added support for vectorized type casting of int to char.	2016-02-03 18:58:29 -08:00

1 2 3 4 5 ...

2812 Commits