Commit Graph

  • 81cb6a51d0 Fix some typos Jonas Schulze 2023-03-16 23:11:43 +00:00
  • 555cec17ed Fix parsing of command-line arguments when already specified as a cmake list. Antonio Sánchez 2023-03-16 22:47:38 +00:00
  • 7db19baabe Remove pools if cmake is less than 3.11 Chip Kerchner 2023-03-16 16:54:45 +00:00
  • 0488b708b4 Vectorize tensor.isnan() by using typed predicates. Rasmus Munk Larsen 2023-03-16 04:04:22 +00:00
  • f02856c640 Use EIGEN_NOT_A_MACRO macro (oh the irony!) to avoid build issue in TensorFlow. Rasmus Munk Larsen 2023-03-15 11:42:57 -07:00
  • 690ae9502f Use C++11 standard features for detecting presence of Inf and NaN Rasmus Munk Larsen 2023-03-15 16:52:44 +00:00
  • d71ac6a755 Fix recent PowerPC warnings and clang warning Chip Kerchner 2023-03-15 16:50:46 +00:00
  • d54d228b49 Limit the number of build jobs to 8 and link jobs to 4 for PowerPC. This should help reduce the OOM build problems. Chip Kerchner 2023-03-15 16:29:41 +00:00
  • 23e1541863 Put deadcode checks back in from previous change. Chip Kerchner 2023-03-14 00:57:16 +00:00
  • 6c58f0fe1f Revert changes that made BF16 GEMM to cause bad register spillage for LLVM (Power) Chip Kerchner 2023-03-13 23:36:06 +00:00
  • 8fe6190001 Add numext::isnan for AnnoyingOrange^H^H^H^H^H^HScalar. Rasmus Munk Larsen 2023-03-13 21:19:35 +00:00
  • 79de101d23 Handle PropagateFast the same way as PropagateNaN in minmax visitor to Rasmus Munk Larsen 2023-03-13 20:47:11 +00:00
  • 9d72412385 Add MMA to BF16 GEMV - 5.0-6.3X faster (for Power) Chip Kerchner 2023-03-13 19:37:13 +00:00
  • 2067b54b13 Fix bug in minmax_coeff_visitor for matrix of all NaNs. Rasmus Munk Larsen 2023-03-13 18:25:22 +00:00
  • ee0ff0ab3a Fix typo in MathFunctions.h Rasmus Munk Larsen 2023-03-13 15:50:40 +00:00
  • 21c49e8f8e Delete mystery character from Eigen/src/Core/arch/NEON/MathFunctions.h Rasmus Munk Larsen 2023-03-10 23:27:24 +00:00
  • 6bb9609bcb Make new Select implementation backwards compatible. Rasmus Munk Larsen 2023-03-10 23:07:47 +00:00
  • 394aabb0a3 Fix failing MSVC tests due to compiler bugs. Antonio Sánchez 2023-03-10 22:36:57 +00:00
  • d6235d76db Clean up generic packetmath specializations for various backends with the help of a macro. Rasmus Munk Larsen 2023-03-10 22:02:23 +00:00
  • e8fdf127c6 Work around compiler bug in Tridiagonalization.h Rasmus Munk Larsen 2023-03-10 21:21:07 +00:00
  • adf26b6840 Add newline to end of file. Rasmus Munk Larsen 2023-03-10 16:53:22 +00:00
  • 3492d9e2e5 s/Lesser/Less/ Rasmus Munk Larsen 2023-03-10 00:28:31 +00:00
  • 2419632cf5 Revert change to allFinite(), since the new version does not work for complex numbers. Rasmus Munk Larsen 2023-03-09 21:50:43 +00:00
  • b1beba8a3e Fix LinAlgSVD example code Zach Davis 2023-03-08 17:04:59 +00:00
  • 7bf2968fed Specify Permutation Index for PartialPivLU and FullPivLU Charles Schlosser 2023-03-07 20:28:05 +00:00
  • eb4dbf6135 Modify failing cwise test to get it to pass. Antonio Sánchez 2023-03-07 19:47:42 +00:00
  • e577f43ab2 Set CMAKE_* cache variables only when Eigen is a top-level project Timofey Pushkin 2023-03-07 14:39:45 +00:00
  • 1ce8b25825 Vectorize any() / all() Charles Schlosser 2023-03-06 23:54:02 +00:00
  • cb8e6d4975 Fix 2240, 2620 Charles Schlosser 2023-03-06 23:11:06 +00:00
  • d670039309 fix tensor comparison test Charles Schlosser 2023-03-06 13:11:14 +00:00
  • 2b513ca2a0 Added partial linear access for LHS & Output - 30% faster for bfloat16 GEMM MMA (Power) Chip Kerchner 2023-03-02 19:22:43 +00:00
  • 0b396c3167 Scalarize comps Charles Schlosser 2023-03-02 17:06:23 +00:00
  • 3abe12472e fix signed shift test Charles Schlosser 2023-03-01 14:31:13 +00:00
  • ba7417f146 Fix gpu conv3d out-of-resources failure. Antonio Sánchez 2023-02-28 21:25:00 +00:00
  • 62d5cfe835 Fix ODR issues with Intel's AVX512 TRSM kernels. Antonio Sánchez 2023-02-27 07:54:52 +00:00
  • 826627f653 vectorize comparisons and select by enabling typed comparisons Charles Schlosser 2023-02-25 20:52:11 +00:00
  • 2e9b945baf Fix bug that disabled vectorization for coeffMin/coeffMax. Rasmus Munk Larsen 2023-02-25 20:03:54 +00:00
  • bc5cdc7a67 Guard use of long double on GPU device. Antonio Sánchez 2023-02-24 21:49:59 +00:00
  • e4598fedbe Fix compiler versions for certain instructions on Power. Chip Kerchner 2023-02-23 23:24:41 +00:00
  • 1c0a6cf228 Get rid of EIGEN_HAS_AVX512_MATH workaround. Rasmus Munk Larsen 2023-02-23 23:16:41 +00:00
  • 00844e3865 Fix a number of MSAN failures in SVD tests. Rasmus Munk Larsen 2023-02-23 18:44:53 +00:00
  • c3f67063ed [SYCL-2020]- null placeholder accessor issue in Reduction SYCL test Mehdi Goli 2023-02-22 17:44:53 +00:00
  • 6bcd941ee3 Use pmsub in twoprod. This speeds up pow() on Skylake by ~1%. Rasmus Munk Larsen 2023-02-21 20:09:29 +00:00
  • ce62177b5b Vectorize atanh & add a missing definition and unit test for atan. Rasmus Munk Larsen 2023-02-21 03:14:05 +00:00
  • 049a144798 Add typed logicals Charles Schlosser 2023-02-18 01:23:47 +00:00
  • e797974689 Add and enable Packet int divide for Power10. Chip Kerchner 2023-02-17 19:04:18 +00:00
  • 54459214a1 Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC. Chip Kerchner 2023-02-16 23:35:42 +00:00
  • a16fb889dd Guard complex sqrt on old MSVC compilers. Antonio Sánchez 2023-02-16 19:47:00 +00:00
  • 94b19dc5f2 Add CArg Charles Schlosser 2023-02-15 21:33:06 +00:00
  • 71a8e60a7a Tweak pasin_float, fix psqrt_complex Charles Schlosser 2023-02-15 01:01:14 +00:00
  • 384269937f More NEON packetmath fixes. Antonio Sánchez 2023-02-14 21:45:25 +00:00
  • c15b386203 Fix MSVC atan2 test. Antonio Sánchez 2023-02-14 18:30:58 +00:00
  • 2dfbf1b251 Fix NEON make_packet2f. Antonio Sánchez 2023-02-14 16:52:07 +00:00
  • 07aaa62e6f Fix compiler warnings in tests. Rasmus Munk Larsen 2023-02-14 02:29:03 +00:00
  • 4a03409569 Fix problem with array conversions BF16->F32 in Power. Chip Kerchner 2023-02-13 21:30:45 +00:00
  • 77b48c440e Fix compiler warnings. Rasmus Munk Larsen 2023-02-10 12:38:26 -08:00
  • 0ecae61568 Disable array BF16 to F32 conversions in Power Chip Kerchner 2023-02-10 20:06:58 +00:00
  • c999284bad Print diagonal matrix Charles Schlosser 2023-02-10 18:07:29 +00:00
  • fba12e02b3 Fold extra column calculations into an extra MMA accumulator and other bfloat16 MMA GEMM improvements Chip Kerchner 2023-02-10 17:32:06 +00:00
  • 79cfc74f4d Revert ODR changes and make gemm_extra_cols and gemm_complex_extra_cols EIGEN_ALWAYS_INLINE to avoid external functions. Chip Kerchner 2023-02-10 17:05:07 +00:00
  • f9659d91f1 Fix ODR violation with gemm_extra_cols on PPC Alexander Grund 2023-02-09 22:16:06 +00:00
  • 325e3063d9 Optimize psign Charles Schlosser 2023-02-09 22:15:26 +00:00
  • 0e490d452d Update file ColPivHouseholderQR_LAPACKE.h Charles Schlosser 2023-02-09 13:45:56 +00:00
  • 0a5392d606 Fix MSVC arm build. Antonio Sánchez 2023-02-08 21:46:37 +00:00
  • 3f7e775715 Add IWYU export pragmas to top-level headers. Antonio Sánchez 2023-02-08 17:40:31 +00:00
  • e4f58816d9 Get rid of custom implementation of equal_to and not_equal_no. No longer needed with c+14. Rasmus Munk Larsen 2023-02-07 21:36:44 -08:00
  • e256ad1823 Remove LGPL Code and references. Antonio Sánchez 2023-02-08 01:25:06 +00:00
  • e71f88abce Change in Power eigen_asserts to eigen_internal_asserts since it is putting unnecessary error checking and assertions without NDEBUG. Chip Kerchner 2023-02-08 00:57:30 +00:00
  • 232b18fa8a Fixes #2602 Gregory Kramida 2023-02-06 22:52:39 +00:00
  • f6cc359e10 More EIGEN_DEVICE_FUNC fixes for CUDA 10/11/12. Antonio Sánchez 2023-02-03 19:18:45 +00:00
  • 2a90653395 fix lapacke config Charles Schlosser 2023-02-03 16:40:08 +00:00
  • 3460f3558e Use VERIFY_IS_EQUAL to compare to zeros. Rasmus Munk Larsen 2023-02-01 13:49:56 -08:00
  • 13a1f25da9 Revert StlIterators edit from "Fix undefined behavior..." Jeremy Nimmer 2023-02-01 20:01:36 +00:00
  • fd2fd48703 Update file ForwardDeclarations.h Charles Schlosser 2023-02-01 16:52:20 +00:00
  • 37b2e97175 Tweak special case handling in atan2. Rasmus Munk Larsen 2023-01-31 17:48:00 -08:00
  • a1cdcdb038 Fix undefined behavior in Block access Jeremy Nimmer 2023-02-01 00:40:45 +00:00
  • 4a58f30aa0 Fix pre-POWER8_VECTOR bugs in pcmp_lt and pnegate and reactivate psqrt. Chip Kerchner 2023-01-31 19:40:24 +00:00
  • 12ad99ce60 Remove unused variables from GenericPacketMathFunctions.h Rasmus Munk Larsen 2023-01-29 18:10:28 +00:00
  • 6987a200bb Fix stupid sparse bugs with outerSize == 0 Charles Schlosser 2023-01-28 02:03:09 +00:00
  • 0471e61b4c Optimize various mathematical packet ops Charles Schlosser 2023-01-28 01:34:26 +00:00
  • 1aa6dc2007 Fix sparse warnings Charles Schlosser 2023-01-27 22:47:42 +00:00
  • 17ae83a966 Fix bugs exposed by enabling GPU asserts. Antonio Sánchez 2023-01-27 21:43:00 +00:00
  • ab8725d947 Turn off vectorize version of rsqrt - doesn't match generic version Chip Kerchner 2023-01-27 18:28:54 +00:00
  • 6d9f662a70 Tweak atan2 Charles Schlosser 2023-01-26 17:38:21 +00:00
  • 6fc9de7d93 Fix slowdown in bfloat16 MMA when rows is not a multiple of 8 or columns is not a multiple of 4. Chip Kerchner 2023-01-25 18:22:20 +00:00
  • 6d4221af76 Revert qr tests Charles Schlosser 2023-01-23 22:23:08 +00:00
  • 7f58bc98b1 Refactor sparse Charles Schlosser 2023-01-23 17:55:50 +00:00
  • 576448572f More fixes for __GNUC_PATCHLEVEL__. Rasmus Munk Larsen 2023-01-23 17:04:24 +00:00
  • 164ddf75ab Use __GNUC_PATCHLEVEL__ rather than __GNUC_PATCH__, according to the documentation https://gcc.gnu.org/onlinedocs/cpp/Common-Predefined-Macros.html Rasmus Munk Larsen 2023-01-23 16:56:14 +00:00
  • 5a7ca681d5 Fix sparse insert Charles Schlosser 2023-01-20 21:32:32 +00:00
  • 08c961e837 Add custom ODR-safe assert. Antonio Sánchez 2023-01-20 17:38:13 +00:00
  • 3fe8c51104 Replace the Deprecated $<CONFIGURATION> with $<CONFIG> Amir Masoud Abdol 2023-01-17 19:44:32 +00:00
  • d70b4864d9 issue #2581: review and cleanup of compiler version checks Sean McBride 2023-01-17 18:58:34 +00:00
  • b523120687 [SYCL-2020 Support] Enabling Intel DPCPP Compiler support to Eigen Mehdi Goli 2023-01-16 07:04:08 +00:00
  • bae119bb7e Support per-thread is_malloc_allowed() state tttapa 2023-01-16 01:34:56 +00:00
  • fa0bd2c34e improve sparse permutations Charles Schlosser 2023-01-15 03:21:25 +00:00
  • 2e61c0c6b4 Add missing EIGEN_DEVICE_FUNC in a few places when called by asserts. Antonio Sánchez 2023-01-15 02:06:17 +00:00
  • 4aca06f63a avoid move assignment in ColPivHouseholderQR Charles Schlosser 2023-01-15 01:34:10 +00:00
  • 68082b8226 Fix QR, again Charles Schlosser 2023-01-13 03:23:17 +00:00
  • 4d05765345 Altivec fixes for Darwin: do not use unsupported VSX insns Sergey Fedorov 2023-01-12 16:33:33 +00:00