Commit Graph

  • 8649e16c2a Enable EIGEN_HAS_C99_MATH when building with the latest version of Visual Studio Benoit Steiner 2016-11-18 14:18:34 -08:00
  • 110b7f8d9f Deleted unnecessary semicolons Benoit Steiner 2016-11-18 14:06:17 -08:00
  • b5e3285e16 Test broadcasting on OpenCL devices with 64 bit indexing Benoit Steiner 2016-11-18 13:44:20 -08:00
  • 164414c563 Merged in ChunW/eigen (pull request PR-252) Gael Guennebaud 2016-11-18 21:07:29 +00:00
  • 37c2c516a6 Cleaned up the sycl device code Benoit Steiner 2016-11-18 12:38:06 -08:00
  • 7335c49204 Fixed the cxx11_tensor_device_sycl test Benoit Steiner 2016-11-18 12:37:13 -08:00
  • 15e226d7d3 adding Benoit changes on the TensorDeviceSycl.h Mehdi Goli 2016-11-18 16:34:54 +00:00
  • 622805a0c5 Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}. Mehdi Goli 2016-11-18 16:20:42 +00:00
  • 5159675c33 Added isnan, isfinite and isinf for SYCL device. Plus test for that. Luke Iwanski 2016-11-18 16:01:48 +00:00
  • 76b2a3e6e7 Allow to construct EulerAngles from 3D vector directly. Using assignment template struct to distinguish between 3D vector and 3D rotation matrix. Tal Hadad 2016-11-18 15:01:06 +02:00
  • 927bd62d2a Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC() Luke Iwanski 2016-11-18 11:16:42 +00:00
  • 8193ffb3d3 bug #1343: fix compilation regression in mat+=selfadjoint_view. Generic EigenBase2EigenBase assignment was incomplete. Gael Guennebaud 2016-11-18 10:17:34 +01:00
  • cebff7e3a2 bug #1343: fix compilation regression in array = matrix_product Gael Guennebaud 2016-11-18 10:09:33 +01:00
  • 7c30078b9f Merged eigen/eigen into default Benoit Steiner 2016-11-17 22:53:37 -08:00
  • 553f50b246 Added a way to detect errors generated by the opencl device from the host Benoit Steiner 2016-11-17 21:51:48 -08:00
  • 72a45d32e9 Cleanup Benoit Steiner 2016-11-17 21:29:15 -08:00
  • 4349fc640e Created a test to check that the sycl runtime can successfully report errors (like ivision by 0). Small cleanup Benoit Steiner 2016-11-17 20:27:54 -08:00
  • a6a3fd0703 Made TensorDeviceCuda.h compile on windows Benoit Steiner 2016-11-17 16:15:27 -08:00
  • 441760f239 Workaround for error in VS2012 with /clr (grafted from 0d0948c3b9 ) Chun Wang 2016-11-17 17:54:27 -05:00
  • 664162fb8a Fix compilation issue in mat = permutation (regression introduced in 8193ffb3d3 ) (grafted from 465ede0f20 ) Gael Guennebaud 2016-11-20 09:41:37 +01:00
  • aa3c761002 bug #1343: fix compilation regression in mat+=selfadjoint_view. Generic EigenBase2EigenBase assignment was incomplete. (grafted from 8193ffb3d3 ) Gael Guennebaud 2016-11-18 10:17:34 +01:00
  • 94f2cfc9c7 bug #1343: fix compilation regression in array = matrix_product (grafted from cebff7e3a2 ) Gael Guennebaud 2016-11-18 10:09:33 +01:00
  • 0d0948c3b9 Workaround for error in VS2012 with /clr Chun Wang 2016-11-17 17:54:27 -05:00
  • 004344cf54 Avoid calling log(0) or 1/0 Benoit Steiner 2016-11-17 11:56:44 -08:00
  • 4a13d79df6 replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f (grafted from a1d5c503fa ) Konstantinos Margaritis 2016-11-17 13:27:45 -05:00
  • a1d5c503fa replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f Konstantinos Margaritis 2016-11-17 13:27:45 -05:00
  • 463176cc44 implement float/std::complex<float> for ZVector as well, minor fixes to ZVector (grafted from 672aa97d4d ) Konstantinos Margaritis 2016-11-17 13:27:33 -05:00
  • 672aa97d4d implement float/std::complex<float> for ZVector as well, minor fixes to ZVector Konstantinos Margaritis 2016-11-17 13:27:33 -05:00
  • 8290e21fb5 replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f Konstantinos Margaritis 2016-11-17 13:21:15 -05:00
  • 7878756dea Fixed existing test. Luke Iwanski 2016-11-17 17:46:55 +00:00
  • c5130dedbe Specialised basic math functions for SYCL device. Luke Iwanski 2016-11-17 11:47:13 +00:00
  • f2e8b73256 Enable the use of AVX512 instruction by default Benoit Steiner 2016-11-16 21:28:04 -08:00
  • 7b09e4dd8c bump default branch to 3.3.90 Gael Guennebaud 2016-11-16 22:20:58 +01:00
  • dff9a049c4 Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUs Benoit Steiner 2016-11-16 09:01:51 -08:00
  • b5c75351e3 Merged eigen/eigen into default Benoit Steiner 2016-11-14 15:54:44 -08:00
  • 32df1b1046 Reduce dispatch overhead in parallelFor by only calling thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics). Rasmus Munk Larsen 2016-11-14 14:18:16 -08:00
  • 05e8c2a1d9 Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. Mehdi Goli 2016-11-14 18:13:53 +00:00
  • f8ca893976 Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. Mehdi Goli 2016-11-14 17:51:57 +00:00
  • 5aab97fba6 Optimize sparse<bool> && sparse<bool> to use the same path as for coeff-wise products. (grafted from 0ee92aa38e ) Gael Guennebaud 2016-11-14 18:47:41 +01:00
  • 0ee92aa38e Optimize sparse<bool> && sparse<bool> to use the same path as for coeff-wise products. Gael Guennebaud 2016-11-14 18:47:41 +01:00
  • 89abc6806d bug #426: move operator && and || to MatrixBase and SparseMatrixBase. (grafted from 2e334f5da0 ) Gael Guennebaud 2016-11-14 18:47:02 +01:00
  • 2e334f5da0 bug #426: move operator && and || to MatrixBase and SparseMatrixBase. Gael Guennebaud 2016-11-14 18:47:02 +01:00
  • a048aba14c Merged in olesalscheider/eigen (pull request PR-248) Gael Guennebaud 2016-11-14 13:25:53 +00:00
  • eedb87f4ba Fix regression in SparseMatrix::ReverseInnerIterator Gael Guennebaud 2016-11-14 14:05:53 +01:00
  • baf793ebaa Make sure not to call numext::maxi on expression templates (grafted from 51fef87408 ) Niels Ole Salscheider 2016-11-12 12:20:57 +01:00
  • b4ddafcfac Fix regression in SparseMatrix::ReverseInnerIterator (grafted from eedb87f4ba ) Gael Guennebaud 2016-11-14 14:05:53 +01:00
  • 51fef87408 Make sure not to call numext::maxi on expression templates Niels Ole Salscheider 2016-11-12 12:20:57 +01:00
  • a5c3f15682 Adding comment to TensorDeviceSycl.h and cleaning the code. Mehdi Goli 2016-11-11 19:06:34 +00:00
  • f4722aa479 Merged in benoitsteiner/opencl (pull request PR-247) Benoit Steiner 2016-11-11 00:01:28 +00:00
  • 3be3963021 Adding EIGEN_STRONG_INLINE back; using size() instead of dimensions.TotalSize() on Tensor. Mehdi Goli 2016-11-10 19:16:31 +00:00
  • 12387abad5 adding the missing in eigen_assert! Mehdi Goli 2016-11-10 18:58:08 +00:00
  • 2e704d4257 Adding Memset; optimising MecopyDeviceToHost by removing double copying; Mehdi Goli 2016-11-10 18:45:12 +00:00
  • 1079967710 Added tag 3.3.0 for changeset eeac81b8c0 Gael Guennebaud 2016-11-10 13:57:29 +01:00
  • eeac81b8c0 bump to 3.3.0 3.3.0 Gael Guennebaud 2016-11-10 13:55:14 +01:00
  • e80bc2ddb0 Fix printing of sparse expressions Gael Guennebaud 2016-11-10 10:35:32 +01:00
  • 75c080b176 Added a test to validate memory transfers between host and sycl device Benoit Steiner 2016-11-09 06:23:42 -08:00
  • db3903498d Merged in benoitsteiner/opencl (pull request PR-246) Benoit Steiner 2016-11-08 22:28:44 +00:00
  • dcc14bee64 Fixed the formatting of the code Benoit Steiner 2016-11-08 14:24:46 -08:00
  • b88c1117d4 Fixed the indentation of the cmake file Benoit Steiner 2016-11-08 14:22:36 -08:00
  • 912cb3d660 #if EIGEN_EXCEPTION -> #ifdef EIGEN_EXCEPTIONS. Luke Iwanski 2016-11-08 22:01:14 +00:00
  • 1b345b0895 Fix for SYCL queue initialisation. Luke Iwanski 2016-11-08 21:56:31 +00:00
  • 1b95717358 Use try/catch only when exceptions are enabled. Luke Iwanski 2016-11-08 21:08:53 +00:00
  • d57430dd73 Converting all sycl buffers to uninitialised device only buffers; adding memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion. Mehdi Goli 2016-11-08 17:08:02 +00:00
  • 73985ead27 Extend unit test to check sparse solvers with a SparseVector as the rhs and result. Gael Guennebaud 2016-11-06 20:29:57 +01:00
  • 436a111792 Generalize Cholmod support to hanlde any sparse type as the rhs and result of the solve method Gael Guennebaud 2016-11-06 20:29:23 +01:00
  • afc55b1885 Generalize IterativeSolverBase::solve to hanlde any sparse type as the results (instead of SparseMatrix only) Gael Guennebaud 2016-11-06 20:28:18 +01:00
  • a5c2d8a3cc Generalize solve_sparse_through_dense_panels to handle SparseVector. Gael Guennebaud 2016-11-06 15:20:58 +01:00
  • f8bfe10613 Add missing friend declaration Gael Guennebaud 2016-11-06 15:20:30 +01:00
  • fc7180cda8 Add a default ctor to evaluator<SparseVector>. Needed for evaluator<Solve>. Gael Guennebaud 2016-11-06 15:20:00 +01:00
  • 4d226ab5b5 Enable swapping between SparseMatrix and SparseVector Gael Guennebaud 2016-11-06 15:15:03 +01:00
  • ad086b03e4 Removed unnecessary statement Benoit Steiner 2016-11-05 12:43:27 -07:00
  • dad177be01 Added missing includes Benoit Steiner 2016-11-05 10:04:42 -07:00
  • 55b4fd1d40 Extend mpreal unit test to check LLT with complexes. Gael Guennebaud 2016-11-05 11:28:53 +01:00
  • a354c3ca59 Fix compilation of LLT with complex<mpreal>. Gael Guennebaud 2016-11-05 11:28:29 +01:00
  • d46a36cc84 Merged eigen/eigen into default Benoit Steiner 2016-11-04 18:22:55 -07:00
  • 0ebe3808ca Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size; Mehdi Goli 2016-11-04 18:18:19 +00:00
  • 47d1b4a609 Added tag 3.3-rc2 for changeset ba05572dcb Gael Guennebaud 2016-11-04 09:09:18 +01:00
  • ba05572dcb bump to 3.3-rc2 3.3-rc2 Gael Guennebaud 2016-11-04 09:09:06 +01:00
  • 5c3995769c Improved AVX512 configuration Benoit Steiner 2016-11-03 04:50:28 -07:00
  • fbe672d599 Reenable the generation of dynamic blas libraries. Benoit Steiner 2016-11-03 04:08:43 -07:00
  • ca0ba0d9a4 Improved AVX512 support Benoit Steiner 2016-11-03 04:00:49 -07:00
  • c80587c92b Merged eigen/eigen into default Benoit Steiner 2016-11-03 03:55:11 -07:00
  • 3f1d0cdc22 bug #1337: improve doc of homogeneous() and hnormalized() Gael Guennebaud 2016-11-03 11:03:08 +01:00
  • 18038fc829 bug #1330: Cholmod supports double precision only, so let's trigger a static assertion if the scalar type does not match this requirement. Gael Guennebaud 2016-11-03 10:21:59 +01:00
  • 78e93ac1ad bug #1330: Cholmod supports double precision only, so let's trigger a static assertion if the scalar type does not match this requirement. Gael Guennebaud 2016-11-03 10:21:59 +01:00
  • 3e37166d0b Merged in benoitsteiner/opencl (pull request PR-244) Benoit Steiner 2016-11-02 22:01:03 +00:00
  • 0585b2965d Disable vectorization on device only when compiling for sycl Benoit Steiner 2016-11-02 11:44:27 -07:00
  • e6e77ed08b Don't call lgamma_r when compiling for an Apple device, since the function isn't available on MacOS Benoit Steiner 2016-11-02 09:55:39 -07:00
  • b238f387b4 Pulled latest updates from trunk Benoit Steiner 2016-11-02 08:53:13 -07:00
  • c8db17301e Special functions require math.h: make sure it is included. Benoit Steiner 2016-11-02 08:51:52 -07:00
  • a07bb428df bug #1004: improve accuracy of LinSpaced for abs(low) >> abs(high). Gael Guennebaud 2016-11-02 11:34:38 +01:00
  • 598de8b193 Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX. Gael Guennebaud 2016-11-02 10:38:13 +01:00
  • e44519744e Merged in benoitsteiner/opencl (pull request PR-243) Benoit Steiner 2016-11-02 02:56:58 +00:00
  • 0a6ae41555 Merged eigen/eigen into default Rasmus Munk Larsen 2016-11-01 15:37:00 -07:00
  • b730952414 Don't attempts to use lgamma_r for CUDA devices. Fix type in lgamma_impl<double>. Rasmus Munk Larsen 2016-11-01 15:34:19 -07:00
  • 7a0e96b80d Gate the code that refers to cuda fp16 primitives more thoroughly Benoit Steiner 2016-11-01 12:08:09 -07:00
  • 51af6ae971 Fixed the ambiguity in callig make_tuple for sycl backend. Mehdi Goli 2016-10-31 16:35:51 +00:00
  • 0a9ad6fc72 Worked around Visual Studio compilation errors Benoit Steiner 2016-10-28 07:54:27 -07:00
  • d5f88e2357 Sharded the tensor_image_patch test to help it run on low power devices Benoit Steiner 2016-10-27 21:48:21 -07:00
  • 0b4b0f11e8 Fixed a few more compilation warnings Benoit Steiner 2016-10-28 04:01:01 +00:00