Commit Graph

  • 5f79b7f9a9 Removed several shadowing types and use global Index typedef everywhere Christoph Hertzberg 2018-07-25 21:47:45 +02:00
  • 44ee201337 Rename variable which shadows class name Christoph Hertzberg 2018-07-25 20:26:15 +02:00
  • 705f66a9ca Account for missing change on commit "Remove SimpleThreadPool and..." Gustavo Lima Chaves 2018-07-23 16:29:09 -07:00
  • fd4fe7cbc5 Fixed issue which made documentation not getting built anymore Christoph Hertzberg 2018-07-24 22:56:15 +02:00
  • 636126ef40 Allow to filter out build-error messages Christoph Hertzberg 2018-07-24 20:12:49 +02:00
  • 34a75c3c5c Initial support of TensorBlock Eugene Zhulenev 2018-07-20 17:37:20 -07:00
  • 2c2de9da7d Merged in glchaves/eigen (pull request PR-433) Gael Guennebaud 2018-07-23 19:38:55 +00:00
  • 4ca3e48f42 fix typo Gael Guennebaud 2018-07-23 16:51:57 +02:00
  • c747cde69a Add lastN shorcuts to seq/seqN. Gael Guennebaud 2018-07-23 16:20:25 +02:00
  • 02eaaacbc5 Move cxx11_tensor_uint128 test under an EIGEN_TEST_CXX11 guarded block Gustavo Lima Chaves 2018-07-20 16:08:40 -07:00
  • 2bf864f1eb Disable type traits for stdlibc++ <= 4.9.3 Eugene Zhulenev 2018-07-20 10:11:44 -07:00
  • de70671937 Oopps, EIGEN_COMP_MSVC is not available before including Eigen. Gael Guennebaud 2018-07-20 17:51:17 +02:00
  • 56a750b6cc Disable optimization for sparse_product unit test with MSVC 2013, otherwise it takes several hours to build. Gael Guennebaud 2018-07-20 08:36:38 -07:00
  • c58b874727 PR430: Convert count to the reducer type in MeanReducer Eugene Zhulenev 2018-07-19 17:37:03 -07:00
  • 2424e3b7ac Pass by const ref. Gael Guennebaud 2018-07-19 18:48:19 +02:00
  • 509a5fa77f Fix IsRelocatable without C++11 Gael Guennebaud 2018-07-19 18:47:38 +02:00
  • 2ca2592009 Fix determination of EIGEN_HAS_TYPE_TRAITS Gael Guennebaud 2018-07-19 18:47:18 +02:00
  • 5e5987996f Fix stupid error in Quaternion move ctor Gael Guennebaud 2018-07-19 18:33:53 +02:00
  • d908afe35f bug #1558: fix a corner case in MINRES when both v_new and w_new vanish. David Hyde 2018-07-08 22:06:38 -07:00
  • 6e654f3379 Reduce number of allocations in TensorContractionThreadPool. Eugene Zhulenev 2018-07-16 14:26:39 -07:00
  • 7ccb623746 bug #1569: fix Tensor<half>::mean() on AVX with respective unit test. Gael Guennebaud 2018-07-19 13:15:40 +02:00
  • 1f523e7304 Add MIPS changes missing from previous merge. Alexey Frunze 2018-07-18 12:27:50 -07:00
  • e3c2d61739 Assert that no output kernel is defined for GPU contraction Eugene Zhulenev 2018-07-18 14:34:22 -07:00
  • 086ded5c85 Disable type traits for GCC < 5.1.0 Eugene Zhulenev 2018-07-18 16:32:55 -07:00
  • 79d4129cce Specify default output kernel for TensorContractionOp Eugene Zhulenev 2018-07-18 14:21:01 -07:00
  • 6e5a3b898f Add regression for bugs #1573 and #1575 Gael Guennebaud 2018-07-18 23:34:34 +02:00
  • 863580fe88 bug #1432: fix conservativeResize for non-relocatable scalar types. For those we need to by-pass realloc routines and fall-back to allocate as new - copy - delete. The remaining problem is that we don't have any mechanism to accurately determine whether a type is relocatable or not, so currently let's be super conservative using either RequireInitialization or std::is_trivially_copyable Gael Guennebaud 2018-07-18 23:33:07 +02:00
  • 053ed97c72 Generalize ScalarWithExceptions to a full non-copyable and trowing scalar type to be used in other unit tests. Gael Guennebaud 2018-07-18 23:27:37 +02:00
  • a503fc8725 bug #1575: fix regression introduced in bug #1573 patch. Move ctor/assignment should not be defaulted. Gael Guennebaud 2018-07-18 23:26:13 +02:00
  • 308725c3c9 More clearly disable the inclusion of src/Core/arch/CUDA/Complex.h without CUDA Gael Guennebaud 2018-07-18 13:51:36 +02:00
  • 1eff6cf8a7 Use device's allocate function instead of internal::aligned_malloc. This would make it easier to track memory usage in device instances. Yuefeng Zhou 2018-02-20 16:50:05 -08:00
  • 000840cae0 Added a move constructor and move assignment operator to Tensor and wrote some tests. Viktor Csomor 2018-02-07 19:10:54 +01:00
  • 3a2dc3869e Fix weird issue with MSVC 2013 Gael Guennebaud 2018-07-18 02:26:43 -07:00
  • c95aacab90 Fix TensorContractionOp evaluators for GPU and SYCL Eugene Zhulenev 2018-07-17 14:09:37 -07:00
  • 038b55464b Merged in deven-amd/eigen (pull request PR-425) Gael Guennebaud 2018-07-17 21:14:40 +00:00
  • f124f07965 applying EIGEN_DECLARE_TEST to *gpu* tests Deven Desai 2018-07-17 14:16:48 -04:00
  • dff3a92d52 Remove usage of #if EIGEN_TEST_PART_XX in unit tests that does not require them (splitting can thus be avoided for them) Gael Guennebaud 2018-07-17 15:52:58 +02:00
  • 82f0ce2726 Get rid of EIGEN_TEST_FUNC, unit tests must now be declared with EIGEN_DECLARE_TEST(mytest) { /* code */ }. This provide several advantages: - more flexibility in designing unit tests - unit tests can be glued to speed up compilation - unit tests are compiled with same predefined macros, which is a requirement for zapcc Gael Guennebaud 2018-07-17 14:46:15 +02:00
  • 37f4bdd97d Fix VERIFY_EVALUATION_COUNT(EXPR,N) with a complex expression as N Gael Guennebaud 2018-07-17 13:20:49 +02:00
  • 2b2cd85694 bug #1573: add noexcept move constructor and move assignment operator to Quaternion Gael Guennebaud 2018-07-17 11:11:33 +02:00
  • 43206ac4de Call OutputKernel in evalGemv Eugene Zhulenev 2018-07-12 14:52:23 -07:00
  • e204ecdaaf Remove SimpleThreadPool and always use {NonBlocking}ThreadPool Eugene Zhulenev 2018-07-16 15:06:57 -07:00
  • b324ed55d9 Call OutputKernel in evalGemv Eugene Zhulenev 2018-07-12 14:52:23 -07:00
  • 01fd4096d3 Fuse computations into the Tensor contractions using output kernel Eugene Zhulenev 2018-07-10 13:16:38 -07:00
  • 5539587b1f Some warning fixes Gael Guennebaud 2018-07-17 10:29:12 +02:00
  • 40797dbea3 bug #1572: use c++11 atomic instead of volatile if c++11 is available, and disable multi-threaded GEMM on non-x86 without c++11. Gael Guennebaud 2018-07-17 00:11:20 +02:00
  • add5757488 Simplify handling and non-splitted tests and include split_test_helper.h instead of re-generating it. This also allows us to modify it without breaking existing build folder. Gael Guennebaud 2018-07-16 18:55:40 +02:00
  • 901c7d31f0 Fix usage of EIGEN_SPLIT_LARGE_TESTS=ON: some unit tests, such as indexed_view have to be split unconditionally. Gael Guennebaud 2018-07-16 18:35:05 +02:00
  • f2b52f9946 Add the cmake option "EIGEN_DASHBOARD_BUILD_TARGET" to control the build target in dashboard mode (e.g., ctest -D Experimental) Gael Guennebaud 2018-07-16 17:59:30 +02:00
  • 23d82c1ac5 Merged in rmlarsen/eigen2 (pull request PR-422) Gael Guennebaud 2018-07-14 11:42:58 +00:00
  • a87cff20df Fix GeneralizedEigenSolver when requesting for eigenvalues only. Gael Guennebaud 2018-07-14 09:38:49 +02:00
  • 3a9cf4e290 Get rid of alias for m_broadcast. Rasmus Munk Larsen 2018-07-13 16:24:48 -07:00
  • 4222550e17 Optimize the case where broadcasting is a no-op. Rasmus Munk Larsen 2018-07-13 16:12:38 -07:00
  • 4a3952fd55 Relax the condition to not only work on Android. Rasmus Munk Larsen 2018-07-13 11:24:07 -07:00
  • 02a9443db9 Clang produces incorrect Thumb2 assembler when using alloca. Don't define EIGEN_ALLOCA when generating Thumb with clang. Rasmus Munk Larsen 2018-07-13 11:03:04 -07:00
  • 20991c3203 bug #1571: fix is_convertible<from,to> with "from" a reference. Gael Guennebaud 2018-07-13 17:47:28 +02:00
  • 1920129d71 Remove clang warning Gael Guennebaud 2018-07-13 16:05:35 +02:00
  • 195c9c054b Print more debug info in gpu_basic Gael Guennebaud 2018-07-13 16:05:07 +02:00
  • 06eb24cf4d Introduce gpu_assert for assertion in device-code, and disable them with clang-cuda. Gael Guennebaud 2018-07-13 16:04:27 +02:00
  • 5fd03ddbfb Make EIGEN_TEST_CUDA_CLANG more friendly with OSX Gael Guennebaud 2018-07-13 16:03:14 +02:00
  • 86d9c0255c Forward declaring std::array does not work with all std libs, so let's just include <array> Gael Guennebaud 2018-07-13 13:06:44 +02:00
  • 3875fb05aa Add support for MIPS SIMD (MSA) Alexey Frunze 2018-07-06 16:04:30 -07:00
  • 44ea5f7623 Add unit test for -Tensor<complex> on GPU Gael Guennebaud 2018-07-12 17:19:38 +02:00
  • 12e1ebb68b Remove local Index typedef from unit-tests Gael Guennebaud 2018-07-12 17:16:40 +02:00
  • 63185be8b2 Disable eigenvalues test for clang-cuda Gael Guennebaud 2018-07-12 17:03:14 +02:00
  • bec013b2c9 fix unused warning Gael Guennebaud 2018-07-12 17:02:18 +02:00
  • 5c73c9223a Fix shadowing typedefs Gael Guennebaud 2018-07-12 17:01:07 +02:00
  • 98728312c8 Fix compilation regarding std::array Gael Guennebaud 2018-07-12 17:00:37 +02:00
  • eb3d8f68bb fix unused warning Gael Guennebaud 2018-07-12 16:59:47 +02:00
  • 006e18e52b Cleanup the mess in Eigen/Core by moving CUDA/HIP stuff at more appropriate places (Macros.h), and alignment/vectorization logic is now in util/ConfigureVectorization.h Gael Guennebaud 2018-07-12 16:57:41 +02:00
  • 9a6a43319f Fix cxx11_tensor_fft not building on Windows. Thales Sabino 2018-07-12 11:20:59 +01:00
  • b347eb0b1c Fix doc Gael Guennebaud 2018-07-12 11:56:18 +02:00
  • 6d451cf2b6 Add missing consts for rows and cols functions in SparseLU Julian Kent 2018-02-10 13:44:05 +01:00
  • a12b8a8c75 FindEigen3: Set Eigen3_FOUND variable Daniele E. Domenichelli 2018-07-11 16:31:50 +02:00
  • 8bdb214fd0 remove double ;; Gael Guennebaud 2018-07-12 11:17:53 +02:00
  • a9060378d3 bug #1570: fix warning Gael Guennebaud 2018-07-12 11:07:09 +02:00
  • 6cd6551b26 Add deprecated header files for TensorFlow Gael Guennebaud 2018-07-12 10:50:53 +02:00
  • da0c604078 Merged in deven-amd/eigen (pull request PR-402) Gael Guennebaud 2018-07-12 08:07:16 +00:00
  • a4ea611ca7 Remove useless specialization thanks to is_convertible being more robust. Gael Guennebaud 2018-07-12 09:59:44 +02:00
  • 8a40dda5a6 Add some basic unit-tests Gael Guennebaud 2018-07-12 09:59:00 +02:00
  • 8ef267ccbd spellcheck Gael Guennebaud 2018-07-12 09:58:29 +02:00
  • 21cf4a1a8b Make is_convertible more robust and conformant to std::is_convertible Gael Guennebaud 2018-07-12 09:57:19 +02:00
  • 8a5955a052 Optimize the product of a householder-sequence with the identity, and optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product. Gael Guennebaud 2018-07-11 17:16:50 +02:00
  • d193cc87f4 Fix regression in 9357838f94 Gael Guennebaud 2018-07-11 17:09:23 +02:00
  • fb33687736 Fix double ;; Gael Guennebaud 2018-07-11 17:08:30 +02:00
  • 876f392c39 Updates corresponding to the latest round of PR feedback Deven Desai 2018-07-11 10:39:54 -04:00
  • 1fe0b74904 deleting hip specific files that are no longer required Deven Desai 2018-07-11 09:28:44 -04:00
  • dec47a6493 renaming CUDA* to GPU* for some header files Deven Desai 2018-07-11 09:26:54 -04:00
  • 471cfe5ff7 renaming CUDA* to GPU* for some header files Deven Desai 2018-07-11 09:22:04 -04:00
  • 38807a2575 merging updates from upstream Deven Desai 2018-07-11 09:17:33 -04:00
  • f00d08cc0a Optimize extraction of Q in SparseQR by exploiting the structure of the identity matrix. Gael Guennebaud 2018-07-11 14:01:47 +02:00
  • 1625476091 Add internall::is_identity compile-time helper Gael Guennebaud 2018-07-11 14:00:24 +02:00
  • fe723d6129 Fix conversion warning Gael Guennebaud 2018-07-10 09:10:32 +02:00
  • 9357838f94 bug #1543: improve linear indexing for general block expressions Gael Guennebaud 2018-07-10 09:10:15 +02:00
  • de9e31a06d Introduce the macro ei_declare_local_nested_eval to help allocating on the stack local temporaries via alloca, and let outer-products makes a good use of it. If successful, we should use it everywhere nested_eval is used to declare local dense temporaries. Gael Guennebaud 2018-07-09 15:41:14 +02:00
  • 6190aa5632 bug #1567: add optimized path for tensor broadcasting and 'Channel First' shape Gael Guennebaud 2018-07-09 11:23:16 +02:00
  • ec323b7e66 Skip null numerators in triangular-vector-solve (as in BLAS TRSV). Gael Guennebaud 2018-07-09 11:13:19 +02:00
  • 359dd77ec3 Fix legitimate "declaration shadows a typedef" warning Gael Guennebaud 2018-07-09 11:03:39 +02:00
  • e2b2c61533 merging from master Deven Desai 2018-06-20 16:47:45 -04:00
  • 1bb6fa99a3 merging the CUDA and HIP implementation for the Tensor directory and the unit tests Deven Desai 2018-06-20 16:44:58 -04:00