Commit Graph

  • 995714142d Restrict GCC<6.3 maxpd workaround to only gcc. Antonio Sánchez 2022-02-04 22:47:34 +00:00
  • 730a781221 Define EIGEN_HAS_AVX512_MATH in PacketMath. Antonio Sánchez 2022-02-04 22:25:52 +00:00
  • 77b2807322 Fix AVX512 math function consistency, enable for ICC. Antonio Sánchez 2022-02-04 19:35:18 +00:00
  • 52e545324e Fix ODR violations. Antonio Sánchez 2022-02-04 19:01:07 +00:00
  • 63dcb429cd Fix use of arg function in CUDA. Antonio Sánchez 2023-07-07 18:37:14 +00:00
  • 0cd4719f3e Silence some MSVC warnings Stephen Pierce 2022-01-19 21:34:10 +00:00
  • 770ed0794e fix broken asserts Erik Schultheis 2021-12-19 17:21:52 +02:00
  • 8f927fb52e Altivec: fix compilation with C++20 and higher Marcus Comstedt 2023-07-03 13:10:07 +02:00
  • d4b05454a7 Fix argument for _mm256_cvtps_ph imm parameter Kevin Leonardic 2023-07-03 13:44:20 +02:00
  • 15ac3765c4 Fix ivcSize return type in IndexedViewMethods.h Charles Schlosser 2023-07-03 03:49:37 +00:00
  • 3791ac8a1a Fix supportsMMA to obey EIGEN_ALTIVEC_MMA_DYNAMIC_DISPATCH compilation flag and compiler support. Chip Kerchner 2023-06-28 17:57:21 +00:00
  • bc57b926a0 Add Quaternion constructor from real scalar and imaginary vector H S Helson Go 2023-06-06 21:51:22 -04:00
  • 31cd2ad371 Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm. Antonio Sánchez 2023-06-26 19:21:54 +00:00
  • 7465b7651e Disable FP16 arithmetic for arm32. Antonio Sánchez 2023-06-26 18:39:42 +00:00
  • b3267f6936 Remove unused variable in test/svd_common.h. Rasmus Munk Larsen 2023-06-23 23:12:19 +00:00
  • 211c5dfc67 Add optional offset parameter to ploadu_partial and pstoreu_partial Chip Kerchner 2023-06-23 19:53:05 +00:00
  • 44c20bbbe3 rint round floor ceil Charles Schlosser 2023-06-23 16:29:16 +00:00
  • 6ee86fd473 delete deprecated function call in svd test Charles Schlosser 2023-06-23 14:17:27 +00:00
  • 387175c258 Fix safe_abs in int_pow Charles Schlosser 2023-06-23 04:12:41 +00:00
  • c6db610bc7 Fix svd test Charles Schlosser 2023-06-22 17:37:24 +00:00
  • 969c31eefc Fix AVX pstore Charles Schlosser 2023-06-15 01:47:38 +00:00
  • 6c1411e521 define a move constructor for Ref<const...> wilfried.karel 2023-06-14 20:10:51 +00:00
  • d8f3eb87bf Compile- and run-time assertions for the construction of Ref<const>. wilfried.karel 2023-06-14 15:49:58 +00:00
  • 59b3ef5409 Partially Vectorize Cast Charles Schlosser 2023-06-09 16:54:31 +00:00
  • 7d7576f326 Avoid underflow in prsqrt. Rasmus Munk Larsen 2023-06-06 14:06:19 -07:00
  • b7151ffaab Fix unary pow error handling and test Charles Schlosser 2023-06-06 18:46:55 +00:00
  • 7ac8897431 Reduce max relative error of prsqrt from 3 to 2 ulps. Rasmus Munk Larsen 2023-06-04 22:25:33 +00:00
  • 1d80e23186 Optimize scalar_unary_pow_op error handling Charles Schlosser 2023-06-02 18:53:06 +00:00
  • 316eab8deb Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation Alexander Shaposhnikov 2023-05-31 15:15:06 +00:00
  • 07e4604b19 Replace usage of CudaStreamDevice with GpuStreamDevice in tensor benchmarks GPU Alejandro Acosta 2023-05-29 17:23:58 +01:00
  • 8c43bf2b5b Clean up Redux.h and fix vectorization_logic test after changes to traversal order in Redux. Rasmus Munk Larsen 2023-05-24 20:26:52 +00:00
  • da6a71faf0 Add linear redux evaluators Charles Schlosser 2023-05-24 17:07:25 +00:00
  • 67a1e881d9 Sparse matrix column/row removal Charles Schlosser 2023-05-24 17:04:45 +00:00
  • de1c884687 Add reference to writeup of approach used in canonicalEulerAngles. Rasmus Munk Larsen 2023-05-24 15:52:26 +00:00
  • 307a417e1c Fix unrolled assignment evaluator Charles Schlosser 2023-05-22 16:39:24 +00:00
  • c18f94e3b0 Geometry/EulerAngles: introduce canonicalEulerAngles Juraj Oršulić 2023-05-19 15:42:22 +00:00
  • 7d9bb90f15 SVD: fix numerous compiler warnings / failures Charles Schlosser 2023-05-15 16:56:47 +00:00
  • 2709f4c8fb Use relative path to include EmulateArray.h in CXX11Meta.h, and get rid of redundant meta-programming code, which was moved to Core. Rasmus Munk Larsen 2023-05-09 23:21:35 +00:00
  • 9a02c977ec Use relative paths to include Meta.h and MaxSizeVector.h in Tensor Rasmus Munk Larsen 2023-05-09 22:07:55 +00:00
  • 96c42771d6 Make it possible to override the synchonization primitives used by the threadpool using macros. Rasmus Munk Larsen 2023-05-09 19:36:17 +00:00
  • 1321821e86 Add missing braces in Umeyama.h Rasmus Munk Larsen 2023-05-09 19:10:50 +00:00
  • 524c329ab2 Work around compiler bug in Umeyama.h. Rasmus Munk Larsen 2023-05-09 18:53:56 +00:00
  • fbf7189bd5 Fix cuda compilation Charles Schlosser 2023-05-08 16:15:47 +00:00
  • 0623791930 [SYCL-2020] Enabling USM support for SYCL. SYCL-1.2.1 did not have support for USM. Mehdi Goli 2023-05-05 17:30:36 +00:00
  • 1698c367a0 Use std::shared_ptr for FFTW/IMKL FFT plan implementation; Fixes #2651 Andrzej Ciarkowski 2023-05-05 16:58:23 +00:00
  • 1f79a6078f Return NaN in ndtri for values outside valid input range. Antonio Sánchez 2023-05-05 16:27:26 +00:00
  • 94f57867fe Thread pool Tobias Wood 2023-05-05 16:23:34 +00:00
  • 9eb8e2afba Change array_cwise test name Charles Schlosser 2023-05-05 03:08:43 +00:00
  • 725c11719b Visitor: fix modulo by zero compiler warning Charles Schlosser 2023-05-04 18:21:09 +00:00
  • b8208b363c Specialized loadColData correctly - fix previous BF16 GEMV MR Chip Kerchner 2023-05-04 16:38:17 +00:00
  • 2af03fb685 clean up array_cwise test Charles Schlosser 2023-05-04 16:02:08 +00:00
  • fda1373a15 Fix ColMajor BF16 GEMV for when vector is RowMajor Chip Kerchner 2023-05-03 20:12:50 +00:00
  • fdc749de2a JacobiSVD: set m_nonzeroSingularValues to zero if not finite Charles Schlosser 2023-05-02 17:48:21 +00:00
  • 6418ac0285 Unroll F32 to BF16 loop - 1.8X faster conversions for LLVM. Use vector pairs for GCC. Chip Kerchner 2023-05-01 16:54:16 +00:00
  • 874f5947f4 Add half-Packet operations to StridedLinearBufferCopy. Pedro Gonnet 2023-05-01 16:09:31 +00:00
  • c9a14f48d9 SSE Packet4ui has pcmp, pmin, pmax Charles Schlosser 2023-04-28 20:36:08 +00:00
  • 0b51f763cb Revert "Geometry/EulerAngles: make sure that returned solution has canonical ranges" Rasmus Munk Larsen 2023-04-27 00:06:23 +00:00
  • 2d0c6ad873 Revert "Vectorize cast" Antonio Sánchez 2023-04-26 18:03:36 +00:00
  • 8999525c29 AVX2: Packet4ul has pmul, abs2 Charles Schlosser 2023-04-26 16:22:16 +00:00
  • eb5ff1861a Vectorize cast Charles Schlosser 2023-04-26 02:50:13 +00:00
  • 3918768be1 Fix sparse iterator and tests. Antonio Sánchez 2023-04-25 19:05:49 +00:00
  • 70410310a4 Fix boolean bitwise and warning. Antonio Sanchez 2023-04-25 15:24:49 +00:00
  • f6cf5dca80 Packet4ul does not have Abs2 Charles Schlosser 2023-04-21 19:48:01 +00:00
  • 03f646b7e3 New VSX version of BF16 GEMV (Power) - up to 6.7X faster Chip Kerchner 2023-04-21 17:06:59 +00:00
  • 29c8e3c754 fix pow for uint32_t, disable pmul<Packet4ul> Charles Schlosser 2023-04-21 05:47:56 +00:00
  • 7f06bcae2c Geometry/EulerAngles: make sure that returned solution has canonical ranges Juraj Oršulić 2023-04-19 19:12:24 +00:00
  • a347dbbab2 Delete last few occurences of HasHalfPacket. Rasmus Munk Larsen 2023-04-19 10:36:59 -07:00
  • b378014fef Make sure we return +/-1 above the clamping point for Erf(). Rasmus Munk Larsen 2023-04-18 13:27:47 -07:00
  • e2bbf496f6 Use select ternary op in tensor select evaulator Charles Schlosser 2023-04-18 20:52:16 +00:00
  • 2b954be663 fix typo in sse packetmath Charles Schlosser 2023-04-18 18:17:41 +00:00
  • 25685c90ad Fix incorrect packet type for unsigned int version of pfirst() in MSVC workaround in PacketMath.h. Rasmus Munk Larsen 2023-04-18 17:46:23 +00:00
  • 1e223a956c Add missing 'f' in float literal in SpecialFunctionsImpl.h that triggers implicit conversion warning. Rasmus Munk Larsen 2023-04-18 17:33:29 +00:00
  • 3f3ce214e6 New BF16 pcast functions and move type casting to TypeCasting.h Chip Kerchner 2023-04-18 02:38:38 +00:00
  • 17b5b4de58 Add Packet4ui, Packet8ui, and Packet4ul to the SSE/AVX PacketMath.h headers Pedro Gonnet 2023-04-17 23:33:59 +00:00
  • 87300c93ca Refactor IndexedView Charles Schlosser 2023-04-17 12:32:50 +00:00
  • 1148f0a9ec Add dynamic dispatch to BF16 GEMM (Power) and new VSX version Chip Kerchner 2023-04-14 22:20:42 +00:00
  • 3026fc0d3c Improve accuracy of erf(). Rasmus Munk Larsen 2023-04-14 16:57:56 +00:00
  • 554fe02ae3 Enable new AVX512 GEMM kernel by default. Rasmus Munk Larsen 2023-04-12 13:39:06 -07:00
  • 0d12fcc34e Insert from triplets Charles Schlosser 2023-04-12 20:01:48 +00:00
  • 990a282fc4 exclude Eigen/Core and Eigen/src/Core from being ignored due to core ignore rule Rob Conde 2023-04-12 10:36:08 -04:00
  • b0eded878d DOC: Update documentation for 3.4.x Rohit Goswami 2023-04-06 19:20:41 +00:00
  • b0f877f8e0 Don't crash on empty tensor contraction. Rasmus Munk Larsen 2023-04-05 17:06:14 +00:00
  • 15fbddaf9b ASAN fixes for AVX512 GEMM/TRSM b-shi 2023-03-31 12:58:07 -07:00
  • 178ef8c97f qualify non-const symbolic indexed view with is_lvalue Charles Schlosser 2023-04-04 19:06:32 +00:00
  • df1049ddf4 Small packet math cleanup. Rasmus Munk Larsen 2023-04-04 16:14:32 +00:00
  • 9b48d10215 Guard all malloc, realloc and free() fonctions with check_that_malloc_is_allowed() Antoine Hoarau 2023-04-04 04:24:22 +00:00
  • c730290fa0 Use the correct truncating intrinsic for double->int casting. Rasmus Munk Larsen 2023-04-03 13:56:41 -07:00
  • 766db02020 disable raw array indexed view access for 1d arrays Charles Schlosser 2023-03-29 02:39:45 +00:00
  • bfbc66e078 refactor indexedviewmethods, enable non-const ref access with symbolic indices Charles Schlosser 2023-03-29 01:35:26 +00:00
  • 1a5dfd7c0f Fix incorrect casting in AVX512DQ path. Rasmus Munk Larsen 2023-03-27 09:28:06 -07:00
  • a08649994f Optimize generic_rsqrt_newton_step Charles Schlosser 2023-03-24 22:42:57 +00:00
  • b8b8a26145 Add more missing vectorized casts for int on x86, and remove redundant unit tests Rasmus Munk Larsen 2023-03-24 16:02:00 +00:00
  • 33e206f714 Remove unused declarations of BLAS/LAPACK routines unageek 2023-03-23 21:54:05 +00:00
  • d57a79e512 Optimize float->bool cast for AVX2, based on Charles Schlosser's comments. Rasmus Munk Larsen 2023-03-21 20:59:25 -07:00
  • a5ae832773 Fix reversal of arguments to _mm256_set_m128() in pcast<Packet4d, Packet8f>. Rasmus Munk Larsen 2023-03-22 03:21:44 +00:00
  • 09945f2cc1 Optimize casting for x86_64. Rasmus Munk Larsen 2023-03-21 18:24:16 +00:00
  • 8f9b8e3630 Replaced all instances of internal::(U)IntPtr with std::(u)intptr_t. Remove ICC workaround. Colin Broderick 2023-03-21 16:50:23 +00:00
  • 2c8011c2dd Fix arm builds. Antonio Sánchez 2023-03-20 16:59:38 +00:00
  • fd8f410bbe Fix 2624 2625 Charles Schlosser 2023-03-20 16:30:04 +00:00
  • e887196d9d Undo cmake pools changes Chip Kerchner 2023-03-17 16:06:26 +00:00