Antonio Sanchez
1ec1b16d36
Add buildtests_gpu and check_gpu to simplify GPU testing.
...
This is in preparation of adding GPU tests to the CI, allowing
us to limit building/testing of GPU-specific tests for a given
GPU-capable runner.
GPU tests are tagged with the label "gpu". The new targets
```
make buildtests_gpu
make check_gpu
```
allow building and running only the gpu tests.
(cherry picked from commit 16f9a20a6f )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
0f39c851a5
Fix use of arg function in CUDA.
...
(cherry picked from commit 63dcb429cd )
2023-07-10 15:30:41 -07:00
Kevin Leonardic
daa0b70a65
Fix argument for _mm256_cvtps_ph imm parameter
...
(cherry picked from commit d4b05454a7 )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
33ba98b641
Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm.
...
(cherry picked from commit 31cd2ad371 )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
e6e921f0e3
Disable FP16 arithmetic for arm32.
...
(cherry picked from commit 7465b7651e )
2023-07-10 15:30:41 -07:00
Alexander Shaposhnikov
ebfdd6bdea
Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation
...
(cherry picked from commit 316eab8deb )
2023-07-10 15:30:41 -07:00
Alejandro Acosta
357bb11066
Replace usage of CudaStreamDevice with GpuStreamDevice in tensor benchmarks GPU
...
(cherry picked from commit 07e4604b19 )
2023-07-10 15:30:40 -07:00
Rasmus Munk Larsen
9b3d104c02
Add missing braces in Umeyama.h
...
(cherry picked from commit 1321821e86 )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
af3ca50f0b
Work around compiler bug in Umeyama.h.
...
(cherry picked from commit 524c329ab2 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
26b8fabd80
Return NaN in ndtri for values outside valid input range.
...
(cherry picked from commit 1f79a6078f )
2023-07-10 14:52:08 -07:00
Charles Schlosser
385a0b38f8
JacobiSVD: set m_nonzeroSingularValues to zero if not finite
...
(cherry picked from commit fdc749de2a )
2023-07-10 14:52:08 -07:00
Antonio Sanchez
a4ecfd8ead
Fix boolean bitwise and warning.
...
(cherry picked from commit 70410310a4 )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
f296720d7d
Make sure we return +/-1 above the clamping point for Erf().
...
(cherry picked from commit b378014fef )
2023-07-10 14:52:08 -07:00
Rob Conde
f04d02dbf6
exclude Eigen/Core and Eigen/src/Core from being ignored due to core ignore rule
...
(cherry picked from commit 990a282fc4 )
2023-07-10 14:52:08 -07:00
Rohit Goswami
6f9bffe8dd
DOC: Update documentation for 3.4.x
...
(cherry picked from commit b0eded878d )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
d4c24eca96
Don't crash on empty tensor contraction.
...
(cherry picked from commit b0f877f8e0 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
72b0759451
Fix arm builds.
...
(cherry picked from commit 2c8011c2dd )
2023-07-10 14:52:08 -07:00
Jonas Schulze
34d0d83278
Fix some typos
...
(cherry picked from commit 81cb6a51d0 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
63e8b31c94
Fix parsing of command-line arguments when already specified as a cmake list.
...
(cherry picked from commit 555cec17ed )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
99473f255b
Fix failing MSVC tests due to compiler bugs.
...
(cherry picked from commit 394aabb0a3 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
2ce5dc428f
Guard use of long double on GPU device.
...
(cherry picked from commit bc5cdc7a67 )
2023-07-10 14:52:08 -07:00
Chip Kerchner
8f1b6198c2
Fix epsilon and dummy_precision values in long double for double doubles. Prevented some algorithms from converging on PPC.
...
(cherry picked from commit 54459214a1 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
dae8c6d7ad
Guard complex sqrt on old MSVC compilers.
...
(cherry picked from commit a16fb889dd )
2023-07-10 14:52:07 -07:00
Antonio Sánchez
2dfdaa2abf
More NEON packetmath fixes.
...
(cherry picked from commit 384269937f )
2023-07-10 14:52:03 -07:00
Antonio Sánchez
a659b5dbb2
Fix NEON make_packet2f.
...
(cherry picked from commit 2dfbf1b251 )
2023-07-10 14:34:09 -07:00
Antonio Sánchez
879854382c
Fix MSVC arm build.
...
(cherry picked from commit 0a5392d606 )
2023-07-10 14:34:09 -07:00
Jeremy Nimmer
90dce8dfa3
Fix undefined behavior in Block access
...
(cherry picked from commit a1cdcdb038 )
2023-07-10 14:34:09 -07:00
Martin Burchell
b26ada1e03
Fix error: unused parameter 'tmp' [-Werror,-Wunused-parameter] on clang/32-bit arm
...
(cherry picked from commit c54785b071 )
2023-07-10 14:34:09 -07:00
Antonio Sánchez
f5593b4baa
Fix reshape strides when input has non-zero inner stride.
...
(cherry picked from commit 2260e11eb0 )
2023-07-10 14:34:09 -07:00
Alexandre Hoffmann
3eb0c8b69e
Changing BiCGSTAB parameters initialization so that it works with custom types
...
(cherry picked from commit 23524ab6fc )
2023-07-10 14:34:09 -07:00
Antonio Sánchez
26adb0e5af
Fix sparseLU solver when destination has a non-unit stride.
...
(cherry picked from commit ab2b26fbc2 )
2023-07-10 14:34:09 -07:00
Antonio Sánchez
5547205092
Correct pnegate for floating-point zero.
...
(cherry picked from commit 8588d8c74b )
2023-07-10 14:34:04 -07:00
Antonio Sánchez
771e91860b
Fix typo in CholmodSupport
...
(cherry picked from commit 7dc6db75d4 )
2023-07-10 12:26:39 -07:00
Antonio Sánchez
4786edba26
Fix pragma check for disabling fastmath.
...
(cherry picked from commit c27d1abe46 )
2023-07-10 10:09:09 -07:00
Antonio Sánchez
15e23ab849
Explicitly state that indices must be sorted.
...
(cherry picked from commit bf48d46338 )
2023-07-10 10:09:09 -07:00
Laurent Rineau
af6e7cc66a
Eigen/Sparse: fix warnings -Wunused-but-set-variable
...
(cherry picked from commit 7846c7387c )
2023-07-10 10:09:09 -07:00
Rasmus Munk Larsen
3fbb1c1b48
Guard GCC-specific pragmas with "#ifdef EIGEN_COMP_GNUC"
...
(cherry picked from commit 5ceed0d57f )
2023-07-10 10:09:09 -07:00
Antonio Sánchez
28cd280726
Fix 4x4 inverse when compiling with -Ofast.
...
(cherry picked from commit 7d6a9925cc )
2023-07-10 10:09:09 -07:00
Antonio Sánchez
8cc3ec8e47
Fix realloc for non-trivial types.
...
(cherry picked from commit 311ba66f7c )
2023-07-10 10:09:02 -07:00
Gilles Aouizerate
d641062a05
fix typo in doc/TutorialSparse.dox
...
(cherry picked from commit 6e83e906c2 )
2023-07-07 15:21:18 -07:00
Michael Palomas
1000cf9fbc
fixed msvc compilation error in GeneralizedEigenSolver.h
...
(cherry picked from commit 525f066671 )
2023-07-07 15:21:18 -07:00
Antonio Sánchez
fd2817e3d6
Add asserts for index-out-of-bounds in IndexedView.
...
(cherry picked from commit f241a2c18a )
2023-07-07 15:21:18 -07:00
Antonio Sánchez
11dacc4802
Fix some cmake issues.
...
(cherry picked from commit f5364331eb )
2023-07-07 15:21:18 -07:00
Antonio Sánchez
ab6f39e1e3
Fix mixingtypes tests.
...
(cherry picked from commit d816044b6e )
2023-07-07 15:21:18 -07:00
Gilles Aouizerate
6576ee4fb1
2 typos fix in the 3rd table.
...
(cherry picked from commit 94cc83faa1 )
2023-07-07 15:21:18 -07:00
Arthur
68f35d76b8
Fix GeneralizedEigenSolver::info() and Asserts
...
(cherry picked from commit a7c1cac18b )
2023-07-07 15:21:18 -07:00
Matthew Sterrett
d0e2b3e58d
Removed unnecessary checks for FP16C
...
(cherry picked from commit 39fcc89798 )
2023-07-07 15:21:17 -07:00
Antonio Sánchez
669dc8fadf
Eliminate bool bitwise warnings.
...
(cherry picked from commit b8e93bf589 )
2023-07-07 15:21:17 -07:00
Lexi Bromfield
33a602eb37
Don't double-define Half functions on aarch64
...
(cherry picked from commit 66ea0c09fd )
2023-07-07 15:21:17 -07:00
Rasmus Munk Larsen
a9490cd3c5
Fix code and unit test for a few corner cases in vectorized pow()
...
(cherry picked from commit 7a87ed1b6a )
2023-07-07 15:21:17 -07:00