Commit Graph

  • 883168ed94 Make select CUDA compatible (comparison operators aren't yet, so no test case yet) Christoph Hertzberg 2014-10-30 20:16:16 +01:00
  • e5f134006b EIGEN_UNUSED_VARIABLE works better than casting to void. Make this also usable from CUDA code Christoph Hertzberg 2014-10-30 19:59:09 +01:00
  • d2fc597d5b Removed deprecated header (unsupported/Eigen/BDCSVD is included in Eigen/SVD now) Christoph Hertzberg 2014-10-29 17:51:14 +01:00
  • 3d25b1f5b8 Split up some test cases Christoph Hertzberg 2014-10-29 17:46:54 +01:00
  • acecb7b09f Fixed include in bdcsvd.cpp Christoph Hertzberg 2014-10-29 17:46:33 +01:00
  • 21c0a2ce0c Move D&C SVD to official SVD module. Gael Guennebaud 2014-10-29 11:29:33 +01:00
  • debc97821c Added support for tensor references Benoit Steiner 2014-10-28 23:10:13 -07:00
  • e2e7ba9f85 bug #898: add inline hint to const_cast_ptr Christoph Hertzberg 2014-10-28 14:49:44 +01:00
  • bd2d330b25 Temporary workaround for bug #875: Let TriangularView<Sparse>::nonZeros() return nonZeros() of the nested expression Christoph Hertzberg 2014-10-28 13:31:00 +01:00
  • 79225db0b6 Merged in kmargar/eigen (pull request PR-87) Konstantinos Margaritis 2014-10-28 13:08:53 +02:00
  • c426054767 BDCSVD: fix CMake install (missing separator). Benjamin Chrétien 2014-10-24 15:10:56 +02:00
  • 1fa793cb97 Removed weird self assignment. Christoph Hertzberg 2014-10-24 13:19:19 +02:00
  • 04ffb9956e Replace TEST_SET_BUT_UNUSED_VARIABLE by already defined EIGEN_UNUSED_VARIABLE Christoph Hertzberg 2014-10-24 13:18:23 +02:00
  • 94ed7c81e6 Bug #896: Swap order of checking __VSX__/__ALTIVEC__ Konstantinos Margaritis 2014-10-22 06:15:18 -04:00
  • fcb3573d17 Merged eigen/eigen into default Konstantinos Margaritis 2014-10-22 10:42:18 +03:00
  • fae4fd7a26 Added ARMv8 support Konstantinos Margaritis 2014-10-22 07:39:49 +00:00
  • cf09c5f687 Prevent CUDA calling a __host__ function from a __host__ __device__ function is not allowed error. Christoph Hertzberg 2014-10-21 20:40:09 +02:00
  • b508619392 working 64-bit support in PacketMath.h, Complex.h needed Konstantinos Margaritis 2014-10-21 18:10:33 +00:00
  • 0f65f2762d add EIGEN_TEST_NEON64, but it's a dummy, AArch64 implies NEON support so extra CXXFLAGS are needed Konstantinos Margaritis 2014-10-21 18:10:01 +00:00
  • 87524922dc check for __ARM_NEON instead as it's defined in arm64 as well Konstantinos Margaritis 2014-10-21 18:08:50 +00:00
  • a303b6a733 bug #670: add unit test for mapped input in sparse solver. Gael Guennebaud 2014-10-20 16:46:47 +02:00
  • fe57b2f963 bug #701: workaround (min) and (max) blocking ADL by introducing numext::mini and numext::maxi internal functions and a EIGEN_NOT_A_MACRO macro. Gael Guennebaud 2014-10-20 15:55:32 +02:00
  • c12b7896d0 bug #766: Check minimum CUDA version Christoph Hertzberg 2014-10-20 14:23:11 +02:00
  • 973e6a035f bug #718: Introduce a compilation error when using the wrong InnerIterator type with a SparseVector Gael Guennebaud 2014-10-20 14:07:08 +02:00
  • 84aaa03182 Addendum to bug #859: pexp(NaN) for double did not return NaN, also, plog(NaN) did not return NaN. psqrt(NaN) and psqrt(-1) shall return NaN if EIGEN_FAST_MATH==0 Christoph Hertzberg 2014-10-20 13:13:43 +02:00
  • aa5f79206f Fix bug #859: pexp(NaN) returned Inf instead of NaN Gael Guennebaud 2014-10-20 11:38:51 +02:00
  • b4a9b3f496 Add unit tests for Rotation2D's inverse(), operator*, slerp, and fix regression wrt explicit ctor change Gael Guennebaud 2014-10-20 11:04:32 +02:00
  • d04f23260d Fix bug #894: the sign of LDLT was not re-initialized at each call of compute() Gael Guennebaud 2014-10-20 10:48:40 +02:00
  • 8838b0a1ff Fix SparseQR::rank for a completely empty matrix. Gael Guennebaud 2014-10-19 22:42:20 +02:00
  • f786897e4b Added access to the unerlying raw data of a tnsor slice/chip whenever possible Benoit Steiner 2014-10-17 15:33:27 -07:00
  • 7acd38d19e Created some benchmarks for the tensor code Benoit Steiner 2014-10-17 09:49:03 -07:00
  • b50e5bc816 merge Gael Guennebaud 2014-10-17 16:53:18 +02:00
  • a370b1f2e2 Fix SparseLU::absDeterminant and add respective unit test Gael Guennebaud 2014-10-17 16:52:56 +02:00
  • a13bc22204 Ignore automalically imported lapack source files Gael Guennebaud 2014-10-17 15:34:39 +02:00
  • 4b7c3abbea Fix D&C SVD wrt zero matrices Gael Guennebaud 2014-10-17 15:32:55 +02:00
  • feacfa5f83 Fix JacobiSVD wrt undeR/overflow by doing scaling prior to QR preconditioning Gael Guennebaud 2014-10-17 15:32:06 +02:00
  • 8472e697ca Add lapack interface to JacobiSVD and BDCSVD Gael Guennebaud 2014-10-17 15:31:11 +02:00
  • 65af852b54 Silenced one last warning Benoit Steiner 2014-10-16 15:02:30 -07:00
  • ae697b471c Silenced a few compilation warnings Generalized a TensorMap constructor Benoit Steiner 2014-10-16 14:52:50 -07:00
  • 94e47798f4 Fixed the return types of unary and binary expressions to properly handle the case where it is different from the input type (e.g. abs(complex<float>)) Benoit Steiner 2014-10-16 10:41:07 -07:00
  • d853adffdb Avoid calling get_future() more than once on a given promise. Benoit Steiner 2014-10-16 10:10:04 -07:00
  • 880e72c130 quieted more g++ warnings of the form: warning: typedef XXX locally defined but not used [-Wunused-local-typedefs] Mark Borgerding 2014-10-16 09:19:32 -04:00
  • bfdd9f3ac9 Made the blocking computation aware of the l3 cache Also optimized the blocking parameters to take into account the number of threads used for a computation Benoit Steiner 2014-10-15 15:32:59 -07:00
  • c566cfe2ba Make SVD unit test even more tough Gael Guennebaud 2014-10-15 23:37:47 +02:00
  • dba55041ab Added support for promises Started to improve multithreaded contractions Benoit Steiner 2014-10-15 11:20:36 -07:00
  • fd1aaf4772 merge Gael Guennebaud 2014-10-15 16:33:14 +02:00
  • c806009453 Extend svd unit tests to stress problems with duplicated singular values. Gael Guennebaud 2014-10-15 16:32:16 +02:00
  • 2cc41dbe83 D&C SVD: fix some numerical issues by truly skipping deflated singular values when computing them Gael Guennebaud 2014-10-15 15:21:12 +02:00
  • c26e8a1af3 D&C SVD: fix deflation of repeated singular values, fix sorting of singular values, fix case of complete deflation Gael Guennebaud 2014-10-15 11:59:21 +02:00
  • 0ec1fc9e11 bug #891: Determine sizeof(void*) via CMAKE variable instead of test program Christoph Hertzberg 2014-10-14 14:14:25 +02:00
  • 99d75235a9 Misc improvements and cleanups Benoit Steiner 2014-10-13 17:02:09 -07:00
  • 4c70b0a762 Added support for patch extraction Benoit Steiner 2014-10-13 10:04:04 -07:00
  • d3f52debc6 Make cuda_basic test compile again by adding lots of EIGEN_DEVICE_FUNC. Although the test passes now, there might still be some missing. Christoph Hertzberg 2014-10-13 17:18:26 +02:00
  • 0219f8aed4 Added ability to print a tensor using an iostream. Benoit Steiner 2014-10-10 16:17:26 -07:00
  • 2ed1838aeb Added support for tensor chips Benoit Steiner 2014-10-10 16:11:27 -07:00
  • 4b36c3591f Fixed the tensor shuffling test Benoit Steiner 2014-10-10 15:43:21 -07:00
  • a991f94c0e Fixed the thread pool test Benoit Steiner 2014-10-10 15:20:37 -07:00
  • 498b7eed25 Rewrote the TensorBase::random method to support the generation of random number on gpu. Benoit Steiner 2014-10-09 15:39:13 -07:00
  • 767424af18 Improved the functors defined for standard reductions Added a functor to encapsulate the generation of random numbers on cpu and gpu. Benoit Steiner 2014-10-09 15:36:23 -07:00
  • a80e17cfe8 Remove unused and dangerous CompressedStorage::Map function Gael Guennebaud 2014-10-09 23:42:33 +02:00
  • 349c2c9235 bug #367: fix double copies in atWithInsertion, and add respective unit-test Gael Guennebaud 2014-10-09 23:35:49 +02:00
  • 48d537f59f Fix indentation Gael Guennebaud 2014-10-09 23:35:26 +02:00
  • 538c059aa4 bug #887: fix CompressedStorage::reallocate wrt memory leaks Gael Guennebaud 2014-10-09 23:35:05 +02:00
  • a48b82eece Add a scoped_array helper class to handle locally allocated/used arrays Gael Guennebaud 2014-10-09 23:34:05 +02:00
  • ccd70ba123 Various numerical fixes in D&C SVD: I cannot make it fail with double, but still need to tune for single precision, and carefully test with duplicated singular values Gael Guennebaud 2014-10-09 23:29:01 +02:00
  • 44beee9d68 Removed dead code Benoit Steiner 2014-10-08 14:14:20 -07:00
  • 0a07ac574e Added support for the *= and /* operators to TensorBase Benoit Steiner 2014-10-08 13:32:41 -07:00
  • 6c047d398d Fixed a comment Benoit Steiner 2014-10-08 13:29:36 -07:00
  • 4b886e6b39 bug #889: fix protected typedef Gael Guennebaud 2014-10-08 07:48:30 +02:00
  • 5741349294 bug #882: fix various const-correctness issues with *View classes. Gael Guennebaud 2014-10-07 18:29:28 +02:00
  • 118b1113d9 Workaround MSVC issue. Gael Guennebaud 2014-10-07 09:53:39 +02:00
  • 503c176d8e Fix missing outer() member in DynamicSparseMatrix Gael Guennebaud 2014-10-07 09:53:27 +02:00
  • dbdd8b0883 D&C SVD: add scaling to avoid overflow, fix handling of fixed size matrices Gael Guennebaud 2014-10-06 19:35:57 +02:00
  • d44d432baa Re-enable products with triangular views of sparse matrices: we simply have to treat them as a sparse matrix. Gael Guennebaud 2014-10-06 16:11:26 +02:00
  • 893bfcf95f bug #887: use ei_declare_aligned_stack_constructed_variable instead of manual new[]/delete[] pairs in AMD and Paralellizer Gael Guennebaud 2014-10-06 11:54:30 +02:00
  • fb53ff1eda Fix SparseLU regarding uncompressed inputs and avoid manual new/delete calls. Gael Guennebaud 2014-10-06 11:42:31 +02:00
  • 7a17639953 Extend unit tests to check uncompressed sparse inputs in sparse solvers Gael Guennebaud 2014-10-06 11:41:50 +02:00
  • bbce6fa65d define EIGEN_VECTORIZE_CUDA when compiling with nvcc Benoit Steiner 2014-10-03 19:55:35 -07:00
  • 95a430a2ca Vector primitives for CUDA Benoit Steiner 2014-10-03 19:45:19 -07:00
  • 152f3218ac Improved contraction test Benoit Steiner 2014-10-03 19:33:44 -07:00
  • af2e5995e2 Improved support for CUDA devices. Improved contractions on GPU Benoit Steiner 2014-10-03 19:18:07 -07:00
  • 1269392822 Created the IndexPair type to store pair of tensor indices. CUDA doesn't support std::pair so we can't use them when targeting GPUs. Improved the performance on tensor contractions Benoit Steiner 2014-10-03 10:16:59 -07:00
  • b7271dffb5 Generalized the gebp apis Benoit Steiner 2014-10-02 16:51:57 -07:00
  • 8b2afe33a1 Fixes for the forced evaluation of tensor expressions More tests Benoit Steiner 2014-10-02 10:39:36 -07:00
  • 5cc23199be More tests to validate the const-correctness of the tensor code. Benoit Steiner 2014-10-02 10:30:44 -07:00
  • 7caaf6453b Added support for tensor reductions and concatenations Benoit Steiner 2014-10-01 20:38:22 -07:00
  • 1c236f4c9a Added tests for tensors of const values and tensors of stringswwq:: Benoit Steiner 2014-10-01 20:21:42 -07:00
  • 1fa6fe2abd template keyword not allowed before non-template function call Christoph Hertzberg 2014-10-01 14:33:55 +02:00
  • 9d3c69952b fixed to make big-endian VSX work as well Konstantinos Margaritis 2014-10-01 09:43:56 +00:00
  • 5180bb5e47 Add missing default ctor in Rotation2D Gael Guennebaud 2014-09-30 16:59:28 +02:00
  • 0187504912 Avoid `unneeded-internal-declaration' warning Christoph Hertzberg 2014-09-30 16:43:52 +02:00
  • 6d26deb894 Missing outerStride in AlignedVector3 resulted in infinite recursion Christoph Hertzberg 2014-09-30 16:43:19 +02:00
  • 81517eebc1 Missing explicit Christoph Hertzberg 2014-09-30 16:42:04 +02:00
  • 12d59465cb bug #884: Copy constructor of Ref shall never malloc, constructing from other RefBase shall only malloc if the memory layout is incompatible. Christoph Hertzberg 2014-09-30 14:57:54 +02:00
  • e404841235 make sure that regex does not match cmake Christoph Hertzberg 2014-09-29 19:28:10 +00:00
  • 15c946338f Related to bug #880: Accept make as well a gmake when searching the MakeCommand. And don't include \n in match expression Christoph Hertzberg 2014-09-29 19:20:01 +02:00
  • 56a0bbbbee Fix compilation with GCC Gael Guennebaud 2014-09-29 18:28:18 +02:00
  • 842e31cf5c Let KroneckerProduct exploits the recently introduced generic InnerIterator class. Gael Guennebaud 2014-09-29 13:37:49 +02:00
  • abd3502e9e Introduce a generic InnerIterator classes compatible with evaluators. Gael Guennebaud 2014-09-29 13:36:57 +02:00
  • 76c3cf6949 Re-enable -Wshorten-64-to-32 compilation flag. Gael Guennebaud 2014-09-29 10:33:16 +02:00