Commit Graph

  • e644f60907 Pulled latest updates from trunk Benoit Steiner 2016-02-21 20:24:59 +00:00
  • 95fceb6452 Added the ability to compute the absolute value of a half float Benoit Steiner 2016-02-21 20:24:11 +00:00
  • ed69cbeef0 Added some debugging information to the test to figure out why it fails sometimes Benoit Steiner 2016-02-21 11:20:20 -08:00
  • 96a24b05cc Optimized casting of tensors in the case where the casting happens to be a no-op Benoit Steiner 2016-02-21 11:16:15 -08:00
  • 203490017f Prevent unecessary Index to int conversions Benoit Steiner 2016-02-21 08:49:36 -08:00
  • 9ff269a1d3 Moved some of the fp16 operators outside the Eigen namespace to workaround some nvcc limitations. Benoit Steiner 2016-02-20 07:47:23 +00:00
  • 1e6fe6f046 Fixed the float16 tensor test. Benoit Steiner 2016-02-20 07:44:17 +00:00
  • 8eb127022b Get rid of duplicate code. Rasmus Munk Larsen 2016-02-19 16:33:30 -08:00
  • d5e2ec7447 Speed up tensor FFT by up ~25-50%. Rasmus Munk Larsen 2016-02-19 16:29:23 -08:00
  • d90a2dac5e merge Gael Guennebaud 2016-02-19 23:01:27 +01:00
  • 485823b5f5 Add COD and BDCSVD in list of benched solvers. Gael Guennebaud 2016-02-19 23:00:33 +01:00
  • 2af04f1a57 Extend unit test to stress smart_copy with empty input/output. Gael Guennebaud 2016-02-19 22:59:28 +01:00
  • 6fa35bbd28 bug #1170: skip calls to memcpy/memmove for empty imput. Gael Guennebaud 2016-02-19 22:58:52 +01:00
  • 46fc23f91c Print an error message to stderr when the initialization of the CUDA runtime fails. This helps debugging setup issues. Benoit Steiner 2016-02-19 13:44:22 -08:00
  • 6f0992c05b Fix nesting type and complete reflection methods of Block expressions. Gael Guennebaud 2016-02-19 22:21:02 +01:00
  • f3643eec57 Add typedefs for the return type of all block methods. Gael Guennebaud 2016-02-19 22:15:01 +01:00
  • 670db7988d Updated the contraction code to make it compatible with half floats. Benoit Steiner 2016-02-19 13:03:26 -08:00
  • 180156ba1a Added support for tensor reductions on half floats Benoit Steiner 2016-02-19 10:05:59 -08:00
  • 5c4901b83a Implemented the scalar division of 2 half floats Benoit Steiner 2016-02-19 10:03:19 -08:00
  • f268db1c4b Added the ability to query the minor version of a cuda device Benoit Steiner 2016-02-19 16:31:04 +00:00
  • a08d2ff0c9 Started to work on contractions and reductions using half floats Benoit Steiner 2016-02-19 15:59:59 +00:00
  • f3352e0fb0 Don't make the array constructors explicit Benoit Steiner 2016-02-19 15:58:57 +00:00
  • f7cb755299 Added support for operators +=, -=, *= and /= on CUDA half floats Benoit Steiner 2016-02-19 15:57:26 +00:00
  • dc26459b99 Implemented protate() for CUDA Benoit Steiner 2016-02-19 15:16:54 +00:00
  • cd042dbbfd Fixed a bug in the tensor type converter Benoit Steiner 2016-02-19 15:03:26 +00:00
  • ac5d706a94 Added support for simple coefficient wise tensor expression using half floats on CUDA devices Benoit Steiner 2016-02-19 08:19:12 +00:00
  • 0606a0a39b FP16 on CUDA are only available starting with cuda 7.5. Disable them when using an older version of CUDA Benoit Steiner 2016-02-18 23:15:23 -08:00
  • f36c0c2c65 Added regression test for float16 Benoit Steiner 2016-02-19 06:23:28 +00:00
  • 7151bd8768 Reverted unintended changes introduced by a bad merge Benoit Steiner 2016-02-19 06:20:50 +00:00
  • 1304e1fb5e Pulled latest updates from trunk Benoit Steiner 2016-02-19 06:17:02 +00:00
  • 17b9fbed34 Added preliminary support for half floats on CUDA GPU. For now we can simply convert floats into half floats and vice versa Benoit Steiner 2016-02-19 06:16:07 +00:00
  • 8ce46f9d89 Improved implementation of ptanh for SSE and AVX Benoit Steiner 2016-02-18 13:24:34 -08:00
  • 832380c455 Merged eigen/eigen into default Eugene Brevdo 2016-02-17 14:44:06 -08:00
  • 06a2bc7c9c Tiny bugfix in SpecialFunctions: some compilers don't like doubles implicitly downcast to floats in an array constructor. Eugene Brevdo 2016-02-17 14:41:59 -08:00
  • f6f057bb7d bug #1166: fix shortcomming in gemv when the destination is not a vector at compile-time. Gael Guennebaud 2016-02-15 21:43:07 +01:00
  • 8e1f1ba6a6 Import wiki's paragraph: "I disabled vectorization, but I'm still getting annoyed about alignment issues" Gael Guennebaud 2016-02-12 22:16:59 +01:00
  • c8b4c4b48a bug #795: mention allocate_shared as a condidate for aligned_allocator. Gael Guennebaud 2016-02-12 22:09:16 +01:00
  • 6eff3e5185 Fix triangularView versus triangularPart. Gael Guennebaud 2016-02-12 17:09:28 +01:00
  • 4252af6897 Remove dead code. Gael Guennebaud 2016-02-12 16:13:35 +01:00
  • 2f5f56a820 Fix usage of evaluator in sparse * permutation products. Gael Guennebaud 2016-02-12 16:13:16 +01:00
  • 0a537cb2d8 bug #901: fix triangular-view with unit diagonal of sparse rectangular matrices. Gael Guennebaud 2016-02-12 15:58:31 +01:00
  • b35d1a122e Fix unit test: accessing elements in a deque by offsetting a pointer to another element causes undefined behavior. Gael Guennebaud 2016-02-12 15:31:16 +01:00
  • 9e3f3a2d27 Deleted outdated comment Benoit Steiner 2016-02-11 17:27:35 -08:00
  • de345eff2e Added a method to conjugate the content of a tensor or the result of a tensor expression. Benoit Steiner 2016-02-11 16:34:07 -08:00
  • 17e93ba148 Pulled latest updates from trunk Benoit Steiner 2016-02-11 15:05:38 -08:00
  • 3628f7655d Made it possible to run the scalar_binary_pow_op functor on GPU Benoit Steiner 2016-02-11 15:05:03 -08:00
  • eeac46f980 bug #774: re-added comment referencing equations in the original paper Hauke Heibel 2016-02-11 19:38:37 +01:00
  • c569cfe12a Inline the +=, -=, *= and /= operators consistently between DenseBase.h and SelfCwiseBinaryOp.h Benoit Steiner 2016-02-11 09:33:32 -08:00
  • 8cc9232b9a bug #774: fix a numerical issue producing unwanted reflections. Gael Guennebaud 2016-02-11 15:32:56 +01:00
  • 2d35c0cb5f Merged in rmlarsen/eigen (pull request PR-163) Gael Guennebaud 2016-02-11 15:12:34 +01:00
  • 33e2373f01 Merged in nnyby/eigen/nnyby/doc-grammar-fix-linearly-space-linearly-1443742971203 (pull request PR-138) Benoit Steiner 2016-02-10 23:29:59 -08:00
  • 6d8b1dce06 Avoid implicit cast from double to float. Benoit Steiner 2016-02-10 18:07:11 -08:00
  • 1dfaafe28a Added a regression test for tanh Benoit Steiner 2016-02-10 17:41:47 -08:00
  • b6fdf7468c Rename inverse -> pseudoInverse. Rasmus Munk Larsen 2016-02-10 13:03:07 -08:00
  • 9d6f1ad398 I'm told to use __EMSCRIPTEN__ by an Emscripten dev. Benoit Jacob 2016-02-10 12:48:34 -05:00
  • bfb3fcd94f Optimized implementation of the tanh function for SSE Benoit Steiner 2016-02-10 08:52:30 -08:00
  • 2d523332b3 Optimized implementation of the hyperbolic tangent function for AVX Benoit Steiner 2016-02-10 08:48:05 -08:00
  • e6ee18d6b4 Make the GCC workaround for sqrt GCC-only; detect Emscripten as non-GCC Benoit Jacob 2016-02-10 11:11:49 -05:00
  • 2ac59e5d36 Pulled latest updates from trunk Benoit Steiner 2016-02-10 08:03:02 -08:00
  • 9a21b38ccc Worked around a few clang compilation warnings Benoit Steiner 2016-02-10 08:02:04 -08:00
  • 964a95bf5e Work around Emscripten bug - https://github.com/kripken/emscripten/issues/4088 Benoit Jacob 2016-02-10 10:37:22 -05:00
  • 72ab7879f7 Fixed clang comilation warnings Benoit Steiner 2016-02-10 06:48:28 -08:00
  • e88535634d Fixed some clang compilation warnings Benoit Steiner 2016-02-09 23:32:41 -08:00
  • 970751ece3 Disabling the nvcc warnings in addition to the clang warnings when clang is used as a frontend for nvcc Benoit Steiner 2016-02-09 20:55:50 -08:00
  • 6323851ea9 Fixed compilation warning Benoit Steiner 2016-02-09 20:43:41 -08:00
  • bb8811c655 Enable inverse() method for computing pseudo-inverse. Rasmus Munk Larsen 2016-02-09 20:35:20 -08:00
  • 5cc0dd5f44 Fixed the code that disables the use of variadic templates when compiling with nvcc on ARM devices. Benoit Steiner 2016-02-09 10:32:01 -08:00
  • a9cc6a06b9 Fixed compilation warning in the splines test Benoit Steiner 2016-02-09 05:10:06 +00:00
  • d69946183d Updated the TensorIntDivisor code to work properly on LLP64 systems Benoit Steiner 2016-02-08 21:03:59 -08:00
  • 24d291cf16 Worked around nvcc crash when compiling Eigen on Tegra X1 Benoit Steiner 2016-02-09 02:34:02 +00:00
  • 53f60e0afc Make applyZAdjointOnTheLeftInPlace protected. Rasmus Munk Larsen 2016-02-08 09:01:43 -08:00
  • 414efa47d3 Add missing calls to tests of COD. Fix a few mistakes in 3.2 -> 3.3 port. Rasmus Munk Larsen 2016-02-08 08:50:34 -08:00
  • c2bf2f56ef Remove custom unaligned loads for SSE. They were only useful for core2 CPU. Gael Guennebaud 2016-02-08 14:29:12 +01:00
  • a4c76f8d34 Improve inlining Gael Guennebaud 2016-02-08 11:33:02 +01:00
  • 16ec450ca1 Nevermind. Rasmus Munk Larsen 2016-02-06 17:54:01 -08:00
  • 019fff9a00 Add my name to copyright notice in ColPivHouseholder.h, mostly for previous work on stable norm downdate formula. Rasmus Munk Larsen 2016-02-06 17:48:42 -08:00
  • 86d6201d7b Merge. Rasmus Munk Larsen 2016-02-06 16:36:56 -08:00
  • d904c8ac8f Implement complete orthogonal decomposition in Eigen. Rasmus Munk Larsen 2016-02-06 16:32:00 -08:00
  • 010afe1619 Add exemples for reshaping/slicing with Map. Gael Guennebaud 2016-02-06 22:49:18 +01:00
  • 8e599bc098 Fix warning in unit test Gael Guennebaud 2016-02-06 20:26:59 +01:00
  • c6a12d1dc6 Fix warning with gcc < 4.8 Gael Guennebaud 2016-02-06 18:06:51 +01:00
  • 4d4211c04e Avoid unecessary type conversions Benoit Steiner 2016-02-05 18:19:41 -08:00
  • d2cba52015 Only enable the cxx11_tensor_uint128 test on 64 bit machines since 32 bit systems don't support the __uin128_t type Benoit Steiner 2016-02-05 18:14:23 -08:00
  • fb00a4af2b Made the tensor fft test compile on tegra x1 Benoit Steiner 2016-02-06 01:42:14 +00:00
  • 5b2d287878 bug #779: allow non aligned buffers for buffers smaller than the requested alignment. Gael Guennebaud 2016-02-05 21:46:39 +01:00
  • e8e1d504d6 Add an explicit assersion on the alignment of the pointer returned by std::malloc Gael Guennebaud 2016-02-05 21:38:16 +01:00
  • 62a1c911cd Remove posix_memalign, _mm_malloc, and _aligned_malloc special paths. Gael Guennebaud 2016-02-05 21:24:35 +01:00
  • 093f2b3c01 Merge. Rasmus Munk Larsen 2016-02-04 14:32:19 -08:00
  • 3ca1ae2bb7 Commented out the version of pexp<Packet8d> since it fails to compile with gcc 5.3 Benoit Steiner 2016-02-04 13:49:06 -08:00
  • 2e39cc40a4 Fix condition that made the unit test spam stdout with bogus error messages. Rasmus Munk Larsen 2016-02-04 12:56:14 -08:00
  • 23f69ab936 Added implementations of pexp, plog, psqrt, and prsqrt optimized for AVX512 Benoit Steiner 2016-02-04 10:36:36 -08:00
  • 6c9cf117c1 Fixed indentation Benoit Steiner 2016-02-04 10:34:10 -08:00
  • bcdcdace48 Pulled latest updates from trunk Benoit Steiner 2016-02-04 08:56:49 -08:00
  • 659fc9c159 Remove dead code Gael Guennebaud 2016-02-04 09:55:09 +01:00
  • d5d7798b9d Improve heuritics for switching between coeff-based and general matrix product implementation. Gael Guennebaud 2016-02-04 09:53:47 +01:00
  • f535378995 Added support for vectorized type casting of int to char. Benoit Steiner 2016-02-03 18:58:29 -08:00
  • 4ab63a3f6f Fixed the initialization of the dummy member of the array class to make it compatible with pairs of element. Benoit Steiner 2016-02-03 17:23:07 -08:00
  • 727ff26960 Disable 2 more nvcc warning messages Benoit Steiner 2016-02-03 16:01:37 -08:00
  • 1cbb79cdfd Made sure the dummy element of size 0 array is always intialized to silence some compiler warnings Benoit Steiner 2016-02-03 15:58:26 -08:00
  • bcbde37a11 Made sure the code compiles when EIGEN_HAS_C99_MATH isn't defined Benoit Steiner 2016-02-03 14:53:08 -08:00