Commit Graph

  • c8d8d5c0fc Merged in rmlarsen/eigen_threadpool (pull request PR-640) Rasmus Larsen 2019-05-13 20:04:35 +00:00
  • 5f32b79edc Collapsed revision from PR-641 * SparseLU.h - corrected example, it didn't compile * Changed encoding back to UTF8 Christoph Hertzberg 2019-05-13 19:02:30 +02:00
  • ad372084f5 Removing unused API to fix compile error in TensorFlow due to AVX512VL, AVX512BW usage Anuj Rawat 2019-05-12 14:43:10 +00:00
  • 4ccd1ece92 bug #1707: Fix deprecation warnings, or disable warnings when testing deprecated functions Christoph Hertzberg 2019-05-10 14:57:05 +02:00
  • d3ef7cf03e Fix build with clang on Windows. Rasmus Munk Larsen 2019-05-09 11:07:04 -07:00
  • e5ac8cbd7a A) fix deadlocks in thread pool caused by EventCount Rasmus Munk Larsen 2019-05-08 10:16:46 -07:00
  • 45b40d91ca Fix AVX512 & GCC 6.3 compilation Eugene Zhulenev 2019-05-07 16:44:55 -07:00
  • e6667a7060 Fix stupid shadow-warnings (with old clang versions) Christoph Hertzberg 2019-05-07 18:32:19 +02:00
  • e54dc24d62 Restore C++03 compatibility Christoph Hertzberg 2019-05-07 18:30:44 +02:00
  • cca76c272c Restore C++03 compatibility Christoph Hertzberg 2019-05-06 16:18:22 +02:00
  • 8e33844fc7 Fix traits for scalar_logistic_op. Rasmus Munk Larsen 2019-05-03 15:49:09 -07:00
  • ff06ef7584 Eigen: Fix MSVC C++17 language standard detection logic To detect C++17 support, use _MSVC_LANG macro instead of _MSC_VER. _MSC_VER can indicate whether the current compiler version could support the C++17 language standard, but not whether that standard is actually selected (i.e. via /std:c++17). See these web pages for more details: https://devblogs.microsoft.com/cppblog/msvc-now-correctly-reports-__cplusplus/ https://docs.microsoft.com/en-us/cpp/preprocessor/predefined-macros Scott Ramsby 2019-05-03 14:14:09 -07:00
  • e9f0eb8a5e Add masked_store_available to unpacket_traits Eugene Zhulenev 2019-05-02 14:52:58 -07:00
  • 96e30e936a Add masked pstoreu for Packet16h Eugene Zhulenev 2019-05-02 14:11:01 -07:00
  • b4010f02f9 Add masked pstoreu to AVX and AVX512 PacketMath Eugene Zhulenev 2019-05-02 13:14:18 -07:00
  • 578407f42f Fix regression in changeset ae33e866c7 Gael Guennebaud 2019-05-02 15:45:21 +02:00
  • ac50afaffa Merged in ezhulenev/eigen-01 (pull request PR-633) Rasmus Larsen 2019-04-29 16:29:35 +00:00
  • d4dcb71bcb Speed up GEMV on AVX-512 builds, just as done for GEBP previously. Gustavo Lima Chaves 2019-04-26 14:12:39 -07:00
  • ae33e866c7 Fix compilation with PGI version 19 Andy May 2019-04-25 21:23:19 +01:00
  • 665ac22cc6 Merged in ezhulenev/eigen-01 (pull request PR-632) Gael Guennebaud 2019-04-25 20:02:20 +00:00
  • 01d7e6ee9b Check if gpu_assert was overridden in TensorGpuHipCudaDefines Eugene Zhulenev 2019-04-25 11:19:17 -07:00
  • 8ead5bb3d8 Fix doxygen warnings to enable statis code analysis Eugene Zhulenev 2019-04-24 12:42:28 -07:00
  • 07355d47c6 Get rid of SequentialLinSpacedReturnType deprecation warnings in DenseBase.h Eugene Zhulenev 2019-04-24 11:01:35 -07:00
  • 144ca33321 Remove deprecation annotation from typedef Eigen::Index Index, as it would generate too many build warnings. Rasmus Munk Larsen 2019-04-24 08:50:07 -07:00
  • a7b7f3ca8a Add missing EIGEN_DEPRECATED annotations to deprecated functions and fix few other doxygen warnings Eugene Zhulenev 2019-04-23 17:23:19 -07:00
  • 68a2a8c445 Use packet ops instead of AVX2 intrinsics Eugene Zhulenev 2019-04-23 11:41:02 -07:00
  • 8c7a6feb8e Adding lowlevel APIs for optimized RHS packet load in TensorFlow SpatialConvolution Anuj Rawat 2019-04-20 06:46:43 +00:00
  • 4270c62812 Split the implementation of i?amax/min into two. Based on PR-627 by Sameer Agarwal. Like the Netlib reference implementation, I*AMAX now uses the L1-norm instead of the L2-norm for each element. Changed I*MIN accordingly. Christoph Hertzberg 2019-04-15 17:18:03 +02:00
  • 039ee52125 Tweak cost model for tensor contraction when parallelizing over the inner dimension. Rasmus Munk Larsen 2019-04-12 13:35:10 -07:00
  • 9a3f06d836 Update TheadPoolDevice example to include ThreadPool creation and passing pointer into constructor. Jonathon Koyle 2019-04-10 10:02:33 -06:00
  • 66a885b61e adding EIGEN_DEVICE_FUNC to the recently added TensorContractionKernel constructor. Not having the EIGEN_DEVICE_FUNC attribute on it was leading to compiler errors when compiling Eigen in the ROCm/HIP path Deven Desai 2019-04-08 13:45:08 +00:00
  • 629ddebd15 Add missing semicolon Eugene Zhulenev 2019-04-02 15:04:26 -07:00
  • 4e2f6de1a8 Add support for custom packed Lhs/Rhs blocks in tensor contractions Eugene Zhulenev 2019-04-01 11:47:31 -07:00
  • 45e65fbb77 bug #1695: fix a numerical robustness issue. Computing the secular equation at the middle range without a shift might give a wrong sign. Gael Guennebaud 2019-03-27 20:16:58 +01:00
  • 8de66719f9 Collapsed revision from PR-619 * Add support for pcmp_eq in AltiVec/Complex.h * Fixed implementation of pcmp_eq for double William D. Irons 2019-03-26 18:14:49 +00:00
  • f11364290e ICC does not support -fno-unsafe-math-optimizations Gael Guennebaud 2019-03-22 09:26:24 +01:00
  • 51e399fc15 updates requested in the PR feedback. Also droping coded within #ifdef EIGEN_HAS_OLD_HIP_FP16 Deven Desai 2019-03-19 21:45:25 +00:00
  • 2dbea5510f Merged eigen/eigen into default Deven Desai 2019-03-19 16:52:38 -04:00
  • 5c93b38c5f Merged in rmlarsen/eigen (pull request PR-618) Rasmus Larsen 2019-03-18 15:51:55 +00:00
  • 48898a988a fix unit test in c++03: c++03 does not allow passing local or anonymous enum as template param Gael Guennebaud 2019-03-18 11:38:36 +01:00
  • cf7e2e277f bug #1692: enable enum as sizes of Matrix and Array Gael Guennebaud 2019-03-17 21:59:30 +01:00
  • e42f9aa68a Make clipping outside [-18:18] consistent for vectorized and non-vectorized paths of scalar_logistic_<float>. Rasmus Munk Larsen 2019-03-15 17:15:14 -07:00
  • 1936aac43f Merged in tellenbach/eigen/sykline_consistent_include_guards (pull request PR-617) Rasmus Larsen 2019-03-15 20:04:56 +00:00
  • bd9c2ae3fd Fix include guard comments David Tellenbach 2019-03-15 15:29:17 +01:00
  • 8450a6d519 Clean up half packet traits and add a few more missing packet ops. Rasmus Munk Larsen 2019-03-14 15:18:06 -07:00
  • b013176e52 Remove undefined std::complex<int> David Tellenbach 2019-03-14 11:40:28 +01:00
  • 97f9a46cb9 PR 593: Add variadtic ctor for DiagonalMatrix with unit tests David Tellenbach 2019-03-14 10:18:24 +01:00
  • 45ab514fe2 revert debug stuff Gael Guennebaud 2019-03-14 10:08:12 +01:00
  • 6a34003141 Remove EIGEN_MPL2_ONLY guard in IncompleteCholesky that is no longer needed after the AMD reordering code was relicensed to MPL2. Rasmus Munk Larsen 2019-03-13 11:52:41 -07:00
  • d7d2f0680e bug #1684: partially workaround clang's 6/7 bug #40815 Gael Guennebaud 2019-03-13 10:40:01 +01:00
  • 690f0795d0 Merged in rmlarsen/eigen (pull request PR-615) Rasmus Larsen 2019-03-12 16:09:48 +00:00
  • ed5cd0a4d1 erm.. use proper id Thomas Capricelli 2019-03-12 13:53:51 +01:00
  • 1901433674 erm.. use proper id Thomas Capricelli 2019-03-12 13:53:38 +01:00
  • 17c2fde66b update tracking code for 3.2 branch Thomas Capricelli 2019-03-12 13:49:49 +01:00
  • 90302aa8c9 update tracking code Thomas Capricelli 2019-03-12 13:47:01 +01:00
  • 77f7d4a894 Clean up PacketMathHalf.h and add a few missing logical packet ops. Rasmus Munk Larsen 2019-03-11 17:51:16 -07:00
  • 001f10e3c9 Fix segfaults with cuda compilation Eugene Zhulenev 2019-03-11 09:43:33 -07:00
  • 899c16fa2c Fix a bug in TensorGenerator for 1d tensors Eugene Zhulenev 2019-03-11 09:42:01 -07:00
  • 0f8bfff23d Fix a data race in NonBlockingThreadPool Eugene Zhulenev 2019-03-11 09:38:44 -07:00
  • 656d9bc66b Apply SSE's pmin/pmax fix for GCC <= 5 to AVX's pmin/pmax Gael Guennebaud 2019-03-10 21:19:18 +01:00
  • 2df4f00246 Change license from LGPL to MPL2 with agreement from David Harmon. Gael Guennebaud 2019-03-07 18:17:10 +01:00
  • 3c3f639fe2 Merge. Rasmus Munk Larsen 2019-03-06 11:54:30 -08:00
  • f4ec8edea8 Add macro EIGEN_AVOID_THREAD_LOCAL to make it possible to manually disable the use of thread_local. Rasmus Munk Larsen 2019-03-06 11:52:04 -08:00
  • 41cdc370d0 Fix placement of "#if defined(EIGEN_GPUCC)" guard region. Rasmus Munk Larsen 2019-03-06 11:42:22 -08:00
  • cc407c9d4d Fix placement of "#if defined(EIGEN_GPUCC)" guard region. Rasmus Munk Larsen 2019-03-06 11:40:06 -08:00
  • 1bc2a0a57c Add missing return to NonBlockingThreadPool::LocalSteal Eugene Zhulenev 2019-03-06 10:49:49 -08:00
  • 4e4dcd9026 Remove redundant steal loop Eugene Zhulenev 2019-03-06 10:39:07 -08:00
  • 4d808e834a Merged in rmlarsen/eigen_threadpool (pull request PR-606) Rasmus Larsen 2019-03-06 17:59:03 +00:00
  • 2ea18e505f Merged in ezhulenev/eigen-01 (pull request PR-610) Rasmus Larsen 2019-03-06 16:49:38 +00:00
  • 25abaa2e41 Check that inner block dimension is continuous Eugene Zhulenev 2019-03-05 17:34:35 -08:00
  • 5d9a6686ed Block evaluation for TensorGeneratorOp Eugene Zhulenev 2019-03-05 16:35:21 -08:00
  • b4861f4778 Merged in ezhulenev/eigen-01 (pull request PR-609) Rasmus Larsen 2019-03-05 23:54:40 +00:00
  • bfbf7da047 bug #1689 fix used-but-marked-unused warning Gael Guennebaud 2019-03-05 23:46:24 +01:00
  • a407e022e6 Tune tensor contraction threadpool heuristics Eugene Zhulenev 2019-03-05 14:19:59 -08:00
  • 56c6373f82 Add an extra check for the RunQueue size estimate Eugene Zhulenev 2019-03-05 11:51:26 -08:00
  • b1a8627493 Do not create Tensor<const T> in cxx11_tensor_forced_eval test Eugene Zhulenev 2019-03-05 11:19:25 -08:00
  • 0318fc7f44 Remove EIGEN_MPL2_ONLY guards around code re-licensed from LGPL to MPL2 in 2ca1e73239 Rasmus Munk Larsen 2019-03-05 10:24:54 -08:00
  • efb5080d31 Do not initialize invalid fast_strides in TensorGeneratorOp Eugene Zhulenev 2019-03-04 16:58:49 -08:00
  • b95941e5c2 Add tiled evaluation for TensorForcedEvalOp Eugene Zhulenev 2019-03-04 16:02:22 -08:00
  • 694084ecbd Use fast divisors in TensorGeneratorOp Eugene Zhulenev 2019-03-04 11:10:21 -08:00
  • b0d406d91c Enable construction of Ref<VectorType> from a runtime vector. Gael Guennebaud 2019-03-03 15:25:25 +01:00
  • 9ba81cf0ff Fully qualify Eigen::internal::aligned_free Sam Hasinoff 2019-03-02 17:42:16 +00:00
  • 22144e949d bug #1629: fix compilation of PardisoSupport (regression introduced in changeset a7842daef2 ) Gael Guennebaud 2019-03-02 22:44:47 +01:00
  • b071672e78 Do not keep latex logs Bernhard M. Wiedemann 2019-02-27 11:09:00 +01:00
  • cf4a1c81fa Fix specialization for conjugate on non-complex types in TensorBase.h. Rasmus Munk Larsen 2019-03-01 14:21:09 -08:00
  • c181dfb8ab Consistently use EIGEN_BLAS_FUNC in BLAS. Sameer Agarwal 2019-02-27 11:30:58 -08:00
  • 9558f4c25f Merged in rmlarsen/eigen_threadpool (pull request PR-596) Rasmus Larsen 2019-02-26 20:37:26 +00:00
  • 2ca1e73239 Merged in rmlarsen/eigen (pull request PR-597) Rasmus Larsen 2019-02-25 17:02:16 +00:00
  • e409dbba14 Enable SSE vectorization of Quaternion and cross3() with AVX Gael Guennebaud 2019-02-23 10:45:40 +01:00
  • 6560692c67 Improve EventCount used by the non-blocking threadpool. Rasmus Munk Larsen 2019-02-22 13:56:26 -08:00
  • 0b25a5c431 fix alignment in ploadquad Gael Guennebaud 2019-02-22 21:39:36 +01:00
  • 1dc1677d52 Change licensing of OrderingMethods/Amd.h and SparseCholesky/SimplicialCholesky_impl.h from LGPL to MPL2. Google LLC executed a license agreement with the author of the code from which these files are derived to allow the Eigen project to distribute the code and derived works under MPL2. Rasmus Munk Larsen 2019-02-22 12:33:57 -08:00
  • 0cb4ba98e7 update wrt recent changes Gael Guennebaud 2019-02-21 17:19:36 +01:00
  • cca6c207f4 AVX512: implement faster ploadquad<Packet16f> thus speeding up GEMM Gael Guennebaud 2019-02-21 17:18:28 +01:00
  • 1c09ee8541 bug #1674: workaround clang fast-math aggressive optimizations Gael Guennebaud 2019-02-22 15:48:53 +01:00
  • 7e3084bb6f Fix compilation on ARM. Gael Guennebaud 2019-02-22 14:56:12 +01:00
  • 32502f3c45 bug #1684: add simplified regression test for respective clang's bug (this also reveal the same bug in Apples's clang) Gael Guennebaud 2019-02-22 10:29:06 +01:00
  • 42c23f14ac Speed up col/row-wise reverse for fixed size matrices by propagating compile-time sizes. Gael Guennebaud 2019-02-21 22:44:40 +01:00
  • 4d7f317102 Add a few missing packet ops: cmp_eq for NEON. pfloor for GPU. Rasmus Munk Larsen 2019-02-21 13:32:13 -08:00
  • 2a39659d79 Add fully generic Vector<Type,Size> and RowVector<Type,Size> type aliases. Gael Guennebaud 2019-02-20 15:23:23 +01:00