Commit Graph

  • cfdabbcc8f removing the *Hip files from the unsupported/Eigen/CXX11/src/Tensor and unsupported/test directories Deven Desai 2018-06-20 12:57:02 -04:00
  • 7e41c8f1a9 renaming *Cuda files to *Gpu in the unsupported/Eigen/CXX11/src/Tensor and unsupported/test directories Deven Desai 2018-06-20 12:52:30 -04:00
  • ee73ae0a80 Merged eigen/eigen into default Deven Desai 2018-06-20 12:37:11 -04:00
  • 90a53ca6fd Fix the Packet16h version of ptranspose Mark D Ryan 2018-06-16 15:13:06 -07:00
  • 1f54164eca Fix a few issues with Packet16h Gael Guennebaud 2018-07-07 00:15:07 +02:00
  • f2dc048df9 complete implementation of Packet16h (AVX512) Gael Guennebaud 2018-07-06 17:43:11 +02:00
  • a937c50208 palign is not used anymore, so let's relax the unit test Gael Guennebaud 2018-07-06 17:41:52 +02:00
  • 56a33ae57d test product kernel with half-floats. Gael Guennebaud 2018-07-06 17:14:04 +02:00
  • f4d623ffa7 Complete Packet8h implementation and test it in packetmath unit test Gael Guennebaud 2018-07-06 17:13:36 +02:00
  • a8ab6060df Add unitests for inverse and selfadjoint-eigenvalues on CUDA Gael Guennebaud 2018-07-06 09:58:45 +02:00
  • b6cc0961b1 updates based on PR feedback Deven Desai 2018-06-14 10:21:54 -04:00
  • ba972fb6b4 moving Half headers from CUDA dir to GPU dir, removing the HIP versions Deven Desai 2018-06-13 12:26:18 -04:00
  • d1d22ef0f4 syncing this fork with upstream Deven Desai 2018-06-13 12:09:52 -04:00
  • f7124b3e46 Extend CUDA support to matrix inversion and selfadjointeigensolver Andrea Bocci 2018-06-11 18:33:24 +02:00
  • 0537123953 bug #1565: help MSVC to generatenot too bad ASM in reductions. Gael Guennebaud 2018-07-05 09:21:26 +02:00
  • 6a241bd8ee Implement custom inplace triangular product to avoid a temporary Gael Guennebaud 2018-07-03 14:02:46 +02:00
  • 3ae2083e23 Make is_same_dense compatible with different scalar types. Gael Guennebaud 2018-07-03 13:21:43 +02:00
  • 67ec37f7b0 Activate dgmres unit test Gael Guennebaud 2018-07-02 12:54:14 +02:00
  • 047677a08d Fix regression in changeset f05dea6b23 : computeFromHessenberg can take any expression for matrixQ, not only an HouseholderSequence. Gael Guennebaud 2018-07-02 12:18:25 +02:00
  • d625564936 Simplify redux_evaluator using inheritance, and properly rename parameters in reducers. Gael Guennebaud 2018-07-02 11:50:41 +02:00
  • d428a199ab bug #1562: optimize evaluation of small products of the form s*A*B by rewriting them as: s*(A.lazyProduct(B)) to save a costly temporary. Measured speedup from 2x to 5x... Gael Guennebaud 2018-07-02 11:41:09 +02:00
  • a7b313a16c Fix unit test Gael Guennebaud 2018-07-01 22:45:47 +02:00
  • 0cdacf3fa4 update comment Gael Guennebaud 2018-06-29 11:28:36 +02:00
  • 54f6eeda90 Merged in net147/eigen (pull request PR-411) Gael Guennebaud 2018-06-28 21:01:04 +00:00
  • 9a81de1d35 Fix order of EIGEN_DEVICE_FUNC and returned type Gael Guennebaud 2018-06-28 00:20:59 +02:00
  • b7689bded9 Use std::complex constructor instead of assignment from scalar Jonathan Liu 2018-06-28 00:32:37 +10:00
  • f9d337780d First step towards a generic vectorised quaternion product Gael Guennebaud 2018-06-25 14:26:51 +02:00
  • ee5864f72e bug #1560 fix product with a 1x1 diagonal matrix Gael Guennebaud 2018-06-25 10:30:12 +02:00
  • 2f62cc68cd merge Rasmus Munk Larsen 2018-06-22 15:09:44 -07:00
  • bda71ad394 Fix typo in pbend for AltiVec. Rasmus Munk Larsen 2018-06-22 15:04:35 -07:00
  • b6ffcd22e3 Merged in rmlarsen/eigen2 (pull request PR-409) Benoit Steiner 2018-06-21 18:34:57 +00:00
  • 4cc32d80fd bug #1555: compilation fix with XLC Gael Guennebaud 2018-06-21 10:28:38 +02:00
  • 5418154a45 Fix oversharding bug in parallelFor. Rasmus Munk Larsen 2018-06-20 17:51:48 -07:00
  • b8271bb368 fix md5sum of lapack_addons Gael Guennebaud 2018-06-15 14:21:29 +02:00
  • d3a380af4d Merged in mfigurnov/eigen/gamma-der-a (pull request PR-403) Benoit Steiner 2018-06-11 17:57:47 +00:00
  • cb4c9a6a94 bug #1531: make dedicatd unit testing for NumDimensions Gael Guennebaud 2018-06-08 17:11:45 +02:00
  • d6813fb1c5 bug #1531: expose NumDimensions for solve and sparse expressions. Gael Guennebaud 2018-06-08 16:55:10 +02:00
  • 89d65bb9d6 bug #1531: expose NumDimensions for compatibility with Tensor Gael Guennebaud 2018-06-08 16:50:17 +02:00
  • f05dea6b23 bug #1550: prevent avoidable memory allocation in RealSchur Gael Guennebaud 2018-06-08 10:14:57 +02:00
  • 7933267c67 fix prototype Gael Guennebaud 2018-06-08 09:56:01 +02:00
  • f4d1461874 Fix the way matrix folder is passed to the tests. Gael Guennebaud 2018-06-08 09:55:46 +02:00
  • 522d3ca54d Don't use std::equal_to inside cuda kernels since it's not supported. Benoit Steiner 2018-06-07 13:02:07 -07:00
  • 7d7bb91537 Missing line during manual rebase of PR-374 Christoph Hertzberg 2018-06-07 20:30:09 +02:00
  • 30fa3d0454 Merge from eigen/eigen Michael Figurnov 2018-06-07 17:57:56 +01:00
  • 5172a32849 Updated the stopping criteria in igammac_cf_impl. Michael Figurnov 2018-06-07 12:03:58 +01:00
  • 4bd158fa37 Derivative of the incomplete Gamma function and the sample of a Gamma random variable. Michael Figurnov 2018-06-06 18:49:26 +01:00
  • 8fbd47052b Adding support for using Eigen in HIP kernels. Deven Desai 2018-06-06 10:12:58 -04:00
  • b2053990d0 Adding EIGEN_DEVICE_FUNC to Products, especially Dense2Dense Assignment specializations. Otherwise causes problems with small fixed size matrix multiplication (call to 0x00 in call_assignment_no_alias in debug mode or trap in release with CUDA 9.1). Robert Lukierski 2018-03-14 16:19:43 +00:00
  • d2b0a4a59b Merged in mfigurnov/eigen/fix-bessel (pull request PR-404) Benoit Steiner 2018-06-07 16:12:42 +00:00
  • 6c71c7d360 Merge from eigen/eigen. Michael Figurnov 2018-06-07 15:54:18 +01:00
  • c25034710e Fiw some warnings in dox examples Gael Guennebaud 2018-06-07 16:09:22 +02:00
  • 37348d03ae Fix int versus Index Gael Guennebaud 2018-06-07 15:56:43 +02:00
  • c723ffd763 Fix warning Gael Guennebaud 2018-06-07 15:56:20 +02:00
  • af7c83b9a2 Fix warning Gael Guennebaud 2018-06-07 15:45:24 +02:00
  • 7fe29aceeb Fix MSVC warning C4290: C++ exception specification ignored except to indicate a function is not __declspec(nothrow) Gael Guennebaud 2018-06-07 15:36:20 +02:00
  • aa813d417b Fix compilation of special functions without C99 math. Michael Figurnov 2018-06-07 14:35:07 +01:00
  • 55774b48e4 Fix short vs long Gael Guennebaud 2018-06-07 15:26:25 +02:00
  • e5f9f4768f Avoid unnecessary C++11 dependency Christoph Hertzberg 2018-06-07 15:03:50 +02:00
  • b3fd93207b Fix typos found using codespell Gael Guennebaud 2018-06-07 14:43:02 +02:00
  • 405859f18d Set EIGEN_IDEAL_MAX_ALIGN_BYTES correctly for AVX512 builds Mark D Ryan 2018-05-17 17:04:00 +01:00
  • 6293ad3f39 Performance improvements to tensor broadcast operation 1. Added new packet functions using SIMD for NByOne, OneByN cases 2. Modified existing packet functions to reduce index calculations when input stride is non-SIMD 3. Added 4 test cases to cover the new packet functions Vamsi Sripathi 2018-05-23 14:02:05 -07:00
  • 7134fa7a2e Fix compilation with MSVC by reverting to char* for _mm_prefetch except for PGI (the later being the one that has the wrong prototype). Gael Guennebaud 2018-06-07 09:33:10 +02:00
  • e206f8d4a4 Merged in mfigurnov/eigen (pull request PR-400) Benoit Steiner 2018-06-05 17:05:21 +00:00
  • e2ed0cf8ab Add a ThreadPoolInterface* getter for ThreadPoolDevice. Penporn Koanantakool 2018-06-02 12:07:49 -07:00
  • 84868da904 Don't run hg on non mercurial clone Gael Guennebaud 2018-05-31 21:21:57 +02:00
  • f216854453 Exponentially scaled modified Bessel functions of order zero and one. Michael Figurnov 2018-05-31 15:34:53 +01:00
  • 6af1433cb5 Doc: add aliasing in common pitfaffs. Gael Guennebaud 2018-05-29 22:37:47 +02:00
  • ea94543190 Hyperlink DOIs against preferred resolver Katrin Leinweber 2018-05-24 18:55:40 +02:00
  • 999b552c16 Search for sequential Pastix. Gael Guennebaud 2018-05-29 20:49:25 +02:00
  • eef4b7bd87 Fix handling of path names containing spaces and the likes. Gael Guennebaud 2018-05-29 20:49:06 +02:00
  • 647b724a36 Define pcast<> for SSE types even when AVX is enabled. (otherwise float are silently reinterpreted as int instead of being converted) Gael Guennebaud 2018-05-29 20:46:46 +02:00
  • 49262dfee6 Fix compilation and SSE support with PGI compiler Gael Guennebaud 2018-05-29 15:09:31 +02:00
  • 750af06362 Add an option to test with external BLAS library Christoph Hertzberg 2018-05-22 21:04:32 +02:00
  • d06a753d10 Make qr_fullpivoting unit test run for fixed-sized matrices Christoph Hertzberg 2018-05-22 20:29:17 +02:00
  • f0862b062f Fix internal::is_integral<size_t/ptrdiff_t> with MSVC 2013 and older. Gael Guennebaud 2018-05-22 19:29:51 +02:00
  • 36e413a534 Workaround a MSVC 2013 compilation issue with MatrixBase(Index,int) Gael Guennebaud 2018-05-22 18:51:35 +02:00
  • 725bd92903 fix stupid typo Gael Guennebaud 2018-05-18 17:46:43 +02:00
  • a382bc9364 is_convertible<T,Index> does not seems to work well with MSVC 2013, so let's rather use __is_enum(T) for old MSVC versions Gael Guennebaud 2018-05-18 17:02:27 +02:00
  • 4dd767f455 add some internal checks Gael Guennebaud 2018-05-18 13:59:55 +02:00
  • 345c0ab450 check that all integer types are properly handled by mat(i,j) Gael Guennebaud 2018-05-18 13:46:46 +02:00
  • e7147f69ae Add tests for sparseQR results (value and size) covering bugs #1522 and #1544 Jeff Trull 2018-04-21 10:26:30 -07:00
  • 9f0c5c3669 Make sparse QR result sizes consistent with dense QR, with the following rules: Jeff Trull 2018-02-15 15:00:31 -08:00
  • d655900953 bug #1544: Generate correct Q matrix in complex case. Original patch was by Jeff Trull in PR-386. Christoph Hertzberg 2018-05-17 19:17:01 +02:00
  • 0371380d5b Merged in rmlarsen/eigen2 (pull request PR-393) Benoit Steiner 2018-05-16 21:45:42 +00:00
  • b8d36774fa Rename clip2 to clamp. Rasmus Munk Larsen 2018-05-16 14:04:48 -07:00
  • 812480baa3 Rename scalar_clip_op to scalar_clip2_op to prevent collision with existing functor in TensorFlow. Rasmus Munk Larsen 2018-05-16 09:49:24 -07:00
  • 1403c2c15b Merged in didierjansen/eigen (pull request PR-360) Benoit Steiner 2018-05-16 01:16:36 +00:00
  • ad355b3f05 Merged in rmlarsen/eigen2 (pull request PR-392) Benoit Steiner 2018-05-16 01:15:56 +00:00
  • 0272f2451a Fix "suggest parentheses around comparison" warning Christoph Hertzberg 2018-05-15 19:35:53 +02:00
  • afec3021f7 Use numext::maxi & numext::mini. Rasmus Munk Larsen 2018-05-14 16:35:39 -07:00
  • b8c8e5f436 Add vectorized clip functor for Eigen Tensors. Rasmus Munk Larsen 2018-05-14 16:07:13 -07:00
  • 6118c6ff4f Enable RawAccess to tensor slices whenever possinle. Avoid 32-bit integer overflow in TensorSlicingOp Benoit Steiner 2018-04-30 11:28:12 -07:00
  • 6e7118265d Fix compilation with NEON+MSVC Gael Guennebaud 2018-04-26 10:50:41 +02:00
  • 097dd4616d Fix unit test for SIMD engine not supporting sqrt Gael Guennebaud 2018-04-26 10:47:39 +02:00
  • 8810baaed4 Add multi-threading for sparse-row-major * dense-row-major Gael Guennebaud 2018-04-25 10:14:48 +02:00
  • 2f3287da7d Fix "used uninitialized" warnings Gael Guennebaud 2018-04-24 17:17:25 +02:00
  • 3ffd449ef5 Workaround warning Gael Guennebaud 2018-04-24 17:11:51 +02:00
  • e8ca5166a9 bug #1428: atempt to make NEON vectorization compilable by MSVC. The workaround is to wrap NEON packet types to make them different c++ types. Gael Guennebaud 2018-04-24 11:19:49 +02:00
  • 6f5935421a fix AVX512 plog Benoit Steiner 2018-04-23 15:49:26 +00:00
  • e9da464e20 Add specializations of is_arithmetic for long long in c++11 Gael Guennebaud 2018-04-23 16:26:29 +02:00