Antonio Sanchez
42b04a08c4
Fix preshear transformation.
...
Fixes #2777 . The `preshear` function seems to have always used an invalid constructor
internally, and has been broken for a while. Fixed the implementation and added a test.
(cherry picked from commit 45da84e21570bf70238cf489ad862b2f09242c5f)
2024-01-29 12:30:06 -08:00
Rasmus Munk Larsen
b86ac5f1e7
Use padd instead of +.
...
(cherry picked from commit bbfc4d54cd )
2024-01-29 10:50:55 -08:00
Rasmus Munk Larsen
380a9483e0
Implement a generic vectorized version of Smith's algorithms for complex division.
...
(cherry picked from commit 9312a5bf5c )
2024-01-26 20:42:52 -08:00
Charles Schlosser
25270e35db
Fix compiler warnings in 3.4
2023-12-21 00:57:21 +00:00
Antonio Sanchez
ebf968b272
Remove c++11 from ctz/clz
2023-12-20 14:18:48 -08:00
Charles Schlosser
bd57b99f44
fix msvc clz
...
(cherry picked from commit 2c4541f735 )
2023-12-14 13:38:18 -08:00
Antonio Sánchez
b8f894947a
Add internal ctz/clz implementation.
...
(cherry picked from commit 75e273afcc )
2023-12-14 13:37:03 -08:00
Antonio Sanchez
4be2870267
Only apply ASM work-around for min/max on GNUC strict.
...
Fixes #2742 .
2023-11-27 10:08:18 -08:00
Charles Schlosser
f7085a1096
replace using with typedef
2023-11-24 19:42:54 +00:00
Charles Schlosser
63291e34bf
Update file GeneralMatrixVector.h
...
(cherry picked from commit 283dec7f25 )
2023-11-24 19:07:33 +00:00
Charles Schlosser
23886fd7db
Gemv microoptimization
...
(cherry picked from commit d1b03fb5c9 )
2023-11-24 19:07:17 +00:00
Charles Schlosser
7c6020e424
Fix -Waggressive-loop-optimizations
...
(cherry picked from commit 4e9e493b4a )
2023-11-24 19:06:40 +00:00
arthurfeeney
2e3f1d8044
Fix implicit conversion warning in GEBP kernel's packing
...
(cherry picked from commit 937c3d73cb )
2023-11-18 18:17:21 +00:00
Silvio Traversaro
fc5575264f
Backport "disambiguate overloads for empty index list" to 3.4 branch
2023-11-10 04:03:11 +00:00
Antonio Sanchez
bae907b8f6
Update version to 3.4.1
...
Tests all pass: https://gitlab.com/libeigen/eigen_ci_cross_testing/-/pipelines/1060764169
2023-11-06 13:53:54 -08:00
Charles Schlosser
cf207eacd5
Patch SparseLU
...
(cherry picked from commit a8bab0d8ae )
2023-11-02 21:17:17 -07:00
Chip Kerchner
e734787bb7
Fix pre-POWER8_VECTOR bugs in pcmp_lt and pnegate and reactivate psqrt.
...
(cherry picked from commit 4a58f30aa0 )
2023-10-25 15:19:37 -07:00
Antonio Sanchez
1217390db4
Fix windows+CUDA builds
2023-10-25 20:55:59 +00:00
Antonio Sanchez
7176ae1623
Make 3.4.1 compatible with c++03
2023-10-16 15:38:25 -07:00
Antonio Sánchez
0db5928f00
Eliminate use of _res.
...
(cherry picked from commit 5bdf58b8df )
2023-10-16 13:38:17 -07:00
Erik Schultheis
764b132a79
ensure that eigen::internal::size is not found by ADL, rename to ssize and...
...
(cherry picked from commit 9210e71fb3 )
2023-08-24 12:42:34 -07:00
Fabian Keßler
d0bfdc1658
optimize cmake scripts for subproject use
...
(cherry picked from commit 19cacd3ecb )
2023-07-26 12:01:28 -07:00
Antonio Sánchez
75ebef26b6
Adds new CMake Options for controlling build components.
...
(cherry picked from commit cf82186416 )
2023-07-26 11:52:47 -07:00
Charles Schlosser
208e44c979
fix warnings in tensorreduction and memory
2023-07-19 16:48:07 +00:00
Antonio Sánchez
17d57fb168
Fix up PowerPC MMA flags so it builds by default.
...
(cherry picked from commit 591906477b )
2023-07-11 16:27:32 -07:00
Antonio Sánchez
6973687c70
Fix up PowerPC MMA flags so it builds by default.
...
(cherry picked from commit 65eeedf964 )
2023-07-11 16:20:57 -07:00
Antonio Sanchez
ac561cd038
Reduce tensor_contract_gpu test.
...
The original test times out after 60 minutes on Windows, even when
setting flags to optimize for speed. Reducing the number of
contractions performed from 3600->27 for subtests 8,9 allow the
two to run in just over a minute each.
(cherry picked from commit be9e7d205f )
2023-07-11 11:27:31 -07:00
Antonio Sanchez
554982beef
Disable Tree reduction for GPU.
...
For moderately sized inputs, running the Tree reduction quickly
fills/overflows the GPU thread stack space, leading to memory errors.
This was happening in the `cxx11_tensor_complex_gpu` test, for example.
Disabling tree reduction on GPU fixes this.
(cherry picked from commit 24ebb37f38 )
2023-07-10 16:09:30 -07:00
Antonio Sanchez
89a71f3126
Fix gpu special function tests.
...
Some checks used incorrect values, partly from copy-paste errors,
partly from the change in behaviour introduced in !398 .
Modified results to match scipy, simplified tests by updating
`VERIFY_IS_CWISE_APPROX` to work for scalars.
(cherry picked from commit 701f5d1c91 )
2023-07-10 15:57:08 -07:00
Antonio Sanchez
a605d6b996
Rename EIGEN_CUDA_FLAGS to EIGEN_CUDA_CXX_FLAGS
...
Also add a missing space for clang.
(cherry picked from commit 846d34384a )
2023-07-10 15:30:41 -07:00
Antonio Sanchez
dfcd6de20a
Clean up CUDA CMake files.
...
- Unify test/CMakeLists.txt and unsupported/test/CMakeLists.txt
- Added `EIGEN_CUDA_FLAGS` that are appended to the set of flags passed
to the cuda compiler (nvcc or clang).
The latter is to support passing custom flags (e.g. `-arch=` to nvcc,
or to disable cuda-specific warnings).
(cherry picked from commit 7b00e8b186 )
2023-07-10 15:30:41 -07:00
Antonio Sanchez
1ec1b16d36
Add buildtests_gpu and check_gpu to simplify GPU testing.
...
This is in preparation of adding GPU tests to the CI, allowing
us to limit building/testing of GPU-specific tests for a given
GPU-capable runner.
GPU tests are tagged with the label "gpu". The new targets
```
make buildtests_gpu
make check_gpu
```
allow building and running only the gpu tests.
(cherry picked from commit 16f9a20a6f )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
0f39c851a5
Fix use of arg function in CUDA.
...
(cherry picked from commit 63dcb429cd )
2023-07-10 15:30:41 -07:00
Kevin Leonardic
daa0b70a65
Fix argument for _mm256_cvtps_ph imm parameter
...
(cherry picked from commit d4b05454a7 )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
33ba98b641
Ensure EIGEN_HAS_ARM64_FP16_VECTOR_ARITHMETIC is always defined on arm.
...
(cherry picked from commit 31cd2ad371 )
2023-07-10 15:30:41 -07:00
Antonio Sánchez
e6e921f0e3
Disable FP16 arithmetic for arm32.
...
(cherry picked from commit 7465b7651e )
2023-07-10 15:30:41 -07:00
Alexander Shaposhnikov
ebfdd6bdea
Do not set EIGEN_HAS_ARM64_FP16_SCALAR_ARITHMETIC for cuda compilation
...
(cherry picked from commit 316eab8deb )
2023-07-10 15:30:41 -07:00
Alejandro Acosta
357bb11066
Replace usage of CudaStreamDevice with GpuStreamDevice in tensor benchmarks GPU
...
(cherry picked from commit 07e4604b19 )
2023-07-10 15:30:40 -07:00
Rasmus Munk Larsen
9b3d104c02
Add missing braces in Umeyama.h
...
(cherry picked from commit 1321821e86 )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
af3ca50f0b
Work around compiler bug in Umeyama.h.
...
(cherry picked from commit 524c329ab2 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
26b8fabd80
Return NaN in ndtri for values outside valid input range.
...
(cherry picked from commit 1f79a6078f )
2023-07-10 14:52:08 -07:00
Charles Schlosser
385a0b38f8
JacobiSVD: set m_nonzeroSingularValues to zero if not finite
...
(cherry picked from commit fdc749de2a )
2023-07-10 14:52:08 -07:00
Antonio Sanchez
a4ecfd8ead
Fix boolean bitwise and warning.
...
(cherry picked from commit 70410310a4 )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
f296720d7d
Make sure we return +/-1 above the clamping point for Erf().
...
(cherry picked from commit b378014fef )
2023-07-10 14:52:08 -07:00
Rob Conde
f04d02dbf6
exclude Eigen/Core and Eigen/src/Core from being ignored due to core ignore rule
...
(cherry picked from commit 990a282fc4 )
2023-07-10 14:52:08 -07:00
Rohit Goswami
6f9bffe8dd
DOC: Update documentation for 3.4.x
...
(cherry picked from commit b0eded878d )
2023-07-10 14:52:08 -07:00
Rasmus Munk Larsen
d4c24eca96
Don't crash on empty tensor contraction.
...
(cherry picked from commit b0f877f8e0 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
72b0759451
Fix arm builds.
...
(cherry picked from commit 2c8011c2dd )
2023-07-10 14:52:08 -07:00
Jonas Schulze
34d0d83278
Fix some typos
...
(cherry picked from commit 81cb6a51d0 )
2023-07-10 14:52:08 -07:00
Antonio Sánchez
63e8b31c94
Fix parsing of command-line arguments when already specified as a cmake list.
...
(cherry picked from commit 555cec17ed )
2023-07-10 14:52:08 -07:00