Commit Graph

  • ec9c7163a3 Feature/skew symmetric matrix3 Thomas Gloor 2022-09-08 20:44:40 +00:00
  • 311ba66f7c Fix realloc for non-trivial types. Antonio Sánchez 2022-09-08 19:39:36 +00:00
  • 3c37dd2a1d Tweak bound for pow to account for floating-point types. Antonio Sánchez 2022-09-08 17:40:45 +00:00
  • f9dfda28ab Add missing comparison operators for GPU packets. Rasmus Munk Larsen 2022-09-07 14:10:02 -07:00
  • 242325eca7 Remove unused variable. Rasmus Munk Larsen 2022-09-07 20:46:44 +00:00
  • 133498c329 Add constexpr, test for C++14 constexpr. Tobias Schlüter 2022-03-05 19:02:12 +09:00
  • 69f50e3a67 Adjust overflow threshold bound for pow tests. Antonio Sánchez 2022-09-06 19:53:29 +00:00
  • 3e44f960ed Reduce compiler warnings for tests. Antonio Sanchez 2022-09-06 11:19:38 -07:00
  • b7e21d4e38 Call check_that_malloc_is_allowed() in aligned_realloc() Florian Richer 2022-09-05 09:11:08 +02:00
  • 6e83e906c2 fix typo in doc/TutorialSparse.dox Gilles Aouizerate 2022-09-04 21:20:21 +00:00
  • 525f066671 fixed msvc compilation error in GeneralizedEigenSolver.h Michael Palomas 2022-09-04 17:50:43 +00:00
  • f241a2c18a Add asserts for index-out-of-bounds in IndexedView. Antonio Sánchez 2022-09-02 17:28:03 +00:00
  • f5364331eb Fix some cmake issues. Antonio Sánchez 2022-09-02 16:43:14 +00:00
  • d816044b6e Fix mixingtypes tests. Antonio Sánchez 2022-09-02 15:30:13 +00:00
  • 94cc83faa1 2 typos fix in the 3rd table. Gilles Aouizerate 2022-08-31 19:54:42 +00:00
  • 30c42222a6 Fix some test build errors in new unary pow. Antonio Sánchez 2022-08-30 17:24:14 +00:00
  • bd393e15c3 Vectorize acos, asin, and atan for float. Rasmus Munk Larsen 2022-08-29 19:49:33 +00:00
  • e5af9f87f2 Vectorize pow for integer base / exponent types Charles Schlosser 2022-08-29 19:23:54 +00:00
  • 8acbf5c11c re-enable pow for complex types chuckyschluz 2022-08-26 17:29:02 -04:00
  • 7064ed1345 Specialize psign<Packet8i> for AVX2, don't vectorize psign<bool>. Rasmus Munk Larsen 2022-08-26 17:02:37 +00:00
  • 98e51c9e24 Avoid undefined behavior in array_cwise test due to signed integer overflow Rasmus Munk Larsen 2022-08-26 16:19:03 +00:00
  • a7c1cac18b Fix GeneralizedEigenSolver::info() and Asserts Arthur 2022-08-25 22:05:04 +00:00
  • 714678fc6c Add missing ptr in realloc call. Antonio Sanchez 2022-08-24 22:04:04 -07:00
  • b2a13c9dd1 Sparse Core: Replace malloc/free with conditional_aligned Charles Schlosser 2022-08-23 21:44:22 +00:00
  • 6aad0f821b Fix psign for unsigned integer types, such as bool. Rasmus Munk Larsen 2022-08-22 20:19:35 +00:00
  • 1a09defce7 Protect new pblend implementation with EIGEN_VECTORIZE_AVX2 Rasmus Munk Larsen 2022-08-22 18:28:03 +00:00
  • 7c67dc67ae Use proper double word division algorithm for pow<double>. Gives 11-15% speedup. Rasmus Munk Larsen 2022-08-17 18:36:23 +00:00
  • 7a3b667c43 Add support for AVX512-FP16 for vectorizing half precision math Matthew Sterrett 2022-08-17 18:15:21 +00:00
  • 76a669fb45 add fixed power unary operation Charles Schlosser 2022-08-16 21:32:36 +00:00
  • 39fcc89798 Removed unnecessary checks for FP16C Matthew Sterrett 2022-08-16 18:14:41 +00:00
  • 2f7cce2dd5 [SYCL] Fix some SYCL tests Romain Biessy 2022-08-16 17:37:54 +00:00
  • 27367017bd Disable bad "deprecated warning" edge-case in BDCSVD Arthur 2022-08-11 18:43:31 +00:00
  • b8e93bf589 Eliminate bool bitwise warnings. Antonio Sánchez 2022-08-09 22:42:30 +00:00
  • 66ea0c09fd Don't double-define Half functions on aarch64 Lexi Bromfield 2022-08-09 20:00:34 +00:00
  • 97e0784dc6 Vectorize the sign operator in Eigen. Rasmus Munk Larsen 2022-08-09 19:54:57 +00:00
  • be20207d10 Fix vectorized Jacobi Rotation Arthur 2022-08-08 19:29:56 +00:00
  • 7a87ed1b6a Fix code and unit test for a few corner cases in vectorized pow() Rasmus Munk Larsen 2022-08-08 18:48:36 +00:00
  • 9e0afe0f02 Fix non-VSX PowerPC build Chip Kerchner 2022-08-08 18:18:17 +00:00
  • 84a9d6fac9 Fix use of Packet2d type for non-VSX. Chip Kerchner 2022-08-03 20:48:13 +00:00
  • ce60a7be83 Partial Packet support for GEMM real-only (PowerPC). Also fix compilation warnings & errors for some conditions in new API. Chip Kerchner 2022-08-03 18:15:19 +00:00
  • 5a1c7807e6 Fix inner iterator for sparse block. Antonio Sánchez 2022-08-03 17:26:12 +00:00
  • 39d22ef46b Fix flaky packetmath_1 test. Antonio Sánchez 2022-08-02 17:42:45 +00:00
  • 7896c7dc6b Use numext::sqrt in ConjugateGradient. Antonio Sánchez 2022-07-29 20:17:23 +00:00
  • e618c4a5e9 Improve pblend AVX implementation Ilya Tokar 2022-07-29 18:45:33 +00:00
  • ef4654bae7 Add true determinant to QR and it's variants sjusju 2022-07-29 18:24:14 +00:00
  • b7668c0371 Avoid including <sstream> with EIGEN_NO_IO Alexander Richardson 2022-07-29 18:02:51 +00:00
  • 7dd3dda3da Updated AccelerateSupport documentation after PR 966. John Mather 2022-07-29 17:42:31 +00:00
  • 69714ff613 Add Sparse Subset of Matrix Inverse Julian Kent 2022-07-28 18:04:35 +00:00
  • 34780d8bd1 Include immintrin.h header for enscripten. Antonio Sánchez 2022-07-22 02:27:42 +00:00
  • 2cf4d18c9c Disable AVX512 GEMM kernels by default. Antonio Sánchez 2022-07-20 21:22:48 +00:00
  • a678a3e052 Fix aligned_realloc to call check_that_malloc_is_allowed() if ptr == 0 Charles Schlosser 2022-07-19 20:59:07 +00:00
  • 4a56359406 Add option to disable avx512 GEBP kernels b-shi 2022-07-18 17:59:09 +00:00
  • 1092574b26 Fix wrong doxygen group usage Mathieu Westphal 2022-07-12 13:22:46 +02:00
  • e1165dbf9a AutoDiff depends on Core, so include appropriate header. Antonio Sánchez 2022-07-09 23:57:09 +00:00
  • bb51d9f4fa Fix ODR violations. Antonio Sánchez 2022-07-09 04:56:36 +00:00
  • 06a458a13d Enable subtests which use device side malloc since this has been fixed in ROCm 5.2. Rohit Santhanam 2022-06-29 17:09:43 +00:00
  • 84cf3ff18d Add pload_partial, pstore_partial (and unaligned versions), pgather_partial, pscatter_partial, loadPacketPartial and storePacketPartial. Chip Kerchner 2022-06-27 19:18:00 +00:00
  • c603275dc9 Better performance for Power10 using more load and store vector pairs for GEMV Chip Kerchner 2022-06-27 18:11:55 +00:00
  • 0e18714167 Fix clang-tidy warnings about function definitions in headers. Antonio Sanchez 2022-06-23 13:47:32 -07:00
  • 8ed3b9dcd6 Skip f16/bf16 bessel specializations on AVX512 if unavailable. Antonio Sánchez 2022-06-24 15:10:36 +00:00
  • bc2ab81634 Eliminate undef warnings when not compiling for AVX512. Antonio Sánchez 2022-06-24 15:10:10 +00:00
  • 0e083b172e Use numext::sqrt in Householder.h. Antonio Sánchez 2022-06-21 16:29:59 +00:00
  • 37673ca1bc AVX512 TRSM kernels use alloca if EIGEN_NO_MALLOC requested b-shi 2022-06-17 18:05:26 +00:00
  • 4d1c16eab8 Fix tanh and erf to use vectorized version for EIGEN_FAST_MATH in VSX. Chip Kerchner 2022-06-15 16:06:43 +00:00
  • 7ea823e824 [SYCL-Spec] According to [SYCL-2020 spec](... Mehdi Goli 2022-06-13 15:52:29 +00:00
  • ba4d7304e2 Document DiagonalBase Arthur 2022-06-08 17:46:32 +00:00
  • 95463b59bc Mark index_remap as EIGEN_DEVICE_FUNC in src/Core/Reshaped.h (Fixes #2493) Binhao Qin 2022-06-07 20:10:47 +00:00
  • 28812d2ebb AVX512 TRSM Kernels respect EIGEN_NO_MALLOC Shi, Brian 2022-06-06 17:03:10 -07:00
  • 9960a30422 Fix row vs column vector typo in Matrix class tutorial sfalmo 2022-06-07 17:28:19 +00:00
  • 8c2e0e3cb8 Fix ambiguous comparisons for c++20 (again again) Antonio Sánchez 2022-06-07 17:06:17 +00:00
  • 14aae29470 Provide DiagonalMatrix Product and Initializers Arthur 2022-06-06 21:43:22 +00:00
  • 76cf6204f3 Revert "Fix c++20 ambiguity of comparisons." Antonio Sánchez 2022-06-04 02:32:10 +00:00
  • 8fbb76a043 Fix build issues with MSVC for AVX512 aaraujom 2022-06-03 14:55:40 +00:00
  • 4f6354128f Fix c++20 ambiguity of comparisons. Antonio Sánchez 2022-06-03 05:11:07 +00:00
  • f542b0a71f Adding an MKL adapter in FFT module. Oleg Shirokobrod 2022-06-02 18:10:43 +00:00
  • d49ede4dc4 Add AVX512 s/dgemm optimizations for compute kernel (2nd try) aaraujom 2022-05-28 02:00:21 +00:00
  • 510f6b9f15 Fix integer shortening warnings in visitor tests. Rasmus Munk Larsen 2022-05-27 18:51:37 +00:00
  • 705ae70646 Add R-Bidiagonalization step to BDCSVD Arthur 2022-05-27 02:00:24 +00:00
  • e99163e732 fix: issue 2481: LDLT produce wrong results with AutoDiffScalar Mario Rincon-Nigro 2022-05-25 15:26:10 +00:00
  • 477eb7f630 Revert "Avoid ambiguous Tensor comparison operators for C++20 compatibility" Antonio Sánchez 2022-05-24 16:09:59 +00:00
  • c5a5ac680c [SYCL] SYCL-2020 range does not have default constructor. Mehdi Goli 2022-05-24 03:11:46 +00:00
  • 5c2179b6c3 Avoid ambiguous Tensor comparison operators for C++20 compatibility Benjamin Kramer 2022-05-23 17:36:03 +00:00
  • aa8b7e2c37 Add subMappers to Power GEMM packing - simplifies the address calculations (10% faster) Chip Kerchner 2022-05-23 15:18:29 +00:00
  • 32348091ba Avoid signed integer overflow in adjoint test. Antonio Sánchez 2022-05-23 14:46:16 +00:00
  • cbe03f3531 [SYCL] Extending SYCL queue interface extension. Mehdi Goli 2022-05-23 14:45:27 +00:00
  • 32a3f9ac33 Improve plogical_shift_* implementations and fix typo in SVE/PacketMath.h Guoqiang QI 2022-05-23 09:32:32 +00:00
  • ac5c83a3f5 unset executable flag Eisuke Kawashima 2022-05-22 22:46:53 +09:00
  • 481a4a8c31 Fix BDCSVD condition for failing with numerical issue. Antonio Sanchez 2022-05-20 08:17:07 -07:00
  • a9868bd5be Add arg() to tensor Tobias Wood 2022-05-17 12:01:39 +01:00
  • e7248b26a1 Prevent BDCSVD crash caused by index out of bounds. Antonio Sánchez 2022-05-19 22:29:48 +00:00
  • 028ab12586 Prevent BDCSVD crash caused by index out of bounds. Antonio Sánchez 2022-05-19 22:29:48 +00:00
  • 798fc1c577 Fix 'Incorrect reference code in STL_interface.hh for ata_product' eigen/isses/2425 Rohan Ghige 2022-05-18 14:42:57 +00:00
  • 9b9496ad98 Revert "Add AVX512 optimizations for matrix multiply" Antonio Sánchez 2022-05-13 18:50:33 +00:00
  • 25db0b4a82 Add AVX512 optimizations for matrix multiply aaraujom 2022-05-12 23:41:19 +00:00
  • 00b75375e7 Adding PocketFFT support in FFT module since kissfft has some flaw in accuracy and performance Guoqiang QI 2022-05-11 17:44:22 +00:00
  • 73d65dbc43 Update README.md. Remove obsolete comment about RowMajor not being fully supported. Rasmus Munk Larsen 2022-05-06 18:19:35 +00:00
  • 68e03ab240 Add uninstall target only if not already defined. Francesco Romano 2022-05-05 17:43:08 +00:00
  • 2c055f8633 make diagonal matrix cols() and rows() methods constexpr Alex_M 2022-05-03 06:32:48 +00:00
  • c2f15edc43 Add load vector_pairs for RHS of GEMM MMA. Improved predux GEMV. Chip Kerchner 2022-04-25 16:23:01 +00:00
  • 9e026e5e28 Removed need to supply the Symmetric flag to UpLo argument for Accelerate LLT and LDLT John Mather 2022-04-21 20:02:10 +00:00