Antonio Sánchez
b2f95d3733
Fix more cache size queries.
...
libeigen/eigen!2295
2026-03-14 15:43:24 +00:00
Antonio Sánchez
9ae0e0f195
Remove include from within Eigen namespace.
...
libeigen/eigen!2294
2026-03-13 21:03:24 +00:00
Chip Kerchner
5d9beb81ab
Initial version of reactivating RVV features like GeneralBlockPanelKernel
...
libeigen/eigen!2096
2026-01-07 13:41:02 -05:00
Chip Kerchner
4f14da11d9
Add basic support for packetmath for BF16 RVV
...
libeigen/eigen!2093
2025-12-16 14:25:46 -05:00
sharad bhaskar
8a1083e9bf
Aocl integration updated
...
libeigen/eigen!1952
2025-11-24 17:20:42 +00:00
Chip Kerchner
49623d0c4e
This patch adds support for RISCV's vector extension RVV1.0.
...
libeigen/eigen!2030
2025-11-20 16:28:07 +00:00
Rasmus Munk Larsen
8eb6551a8a
Add support for complex numbers in the generic clang backend
...
libeigen/eigen!2078
Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com >
2025-11-20 00:26:37 +00:00
Rasmus Munk Larsen
fecfa7f27e
Fixes to make generic backend build with AVX512
...
See merge request libeigen/eigen!2064
Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com >
2025-11-06 22:50:32 +00:00
Rasmus Munk Larsen
ec93a6d098
Add a generic Eigen backend based on clang vector extensions
...
The goal of this MR is to implement a generic SIMD backend (packet ops) for Eigen that uses clang vector extensions instead of platform-dependent intrinsics. Ideally, this should make it possible to build Eigen and achieve reasonable speed on any platform that has a recent clang compiler, without having to write any inline assembly or intrinsics.
Caveats:
* The current implementation is a proof of concept and supports vectorization for float, double, int32_t, and int64_t using fixed-size 512-bit vectors (a somewhat arbitrary choice). I have not done much to tune this for speed yet.
* For now, there is no way to enable this other than setting -DEIGEN_VECTORIZE_GENERIC on the command line.
* This only compiles with newer versions of clang. I have tested that it compiles and all tests pass with clang 19.1.7.
https://clang.llvm.org/docs/LanguageExtensions.html#vectors-and-extended-vectors
Closes #2998 and #2997
See merge request libeigen/eigen!2051
Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com >
Co-authored-by: Antonio Sánchez <cantonios@google.com >
2025-11-06 21:52:19 +00:00
Saran Tunyasuvunakool
db02f97850
Add a missing #include <version> to Core.
2025-10-15 15:10:30 +00:00
Antonio Sánchez
a627f72cd6
Add "Version" file and update version.
2025-09-20 02:08:59 +00:00
Tyler Veness
d368998120
Fix MSVC error about missing std::bit_cast
2025-08-23 22:25:52 +00:00
Charles Schlosser
43a65a9cbd
add RealView api
2025-08-12 16:55:05 +00:00
Charles Schlosser
f5ead2d34c
Fix intel packet math header inclusion order
2025-07-29 01:00:37 +00:00
Rasmus Munk Larsen
efe5b6979d
Unconditionally include <memory>. Some c++20 builds are currently broken because it is needed for std::assume_aligned.
2025-07-18 18:06:28 +00:00
Charles Schlosser
d0b490ee09
Optimize maxCoeff and friends
2025-06-06 14:55:49 +00:00
Charles Schlosser
4fdf87bbf5
clean up intel packet reductions
2025-05-30 19:18:07 +00:00
Antonio Sánchez
70f2aead9a
Use native _Float16 for AVX512FP16 and update vectorization.
2025-03-19 19:55:26 +00:00
Antonio Sánchez
b1e74b1ccd
Fix all the doxygen warnings.
2025-02-01 00:00:31 +00:00
Pengzhou0810
e986838464
Add LoongArch64 architecture LSX support.(build/test )
2025-01-20 18:37:44 +00:00
Charles Schlosser
8ad4344ca7
optimize setConstant, setZero
2024-11-22 03:39:19 +00:00
Charles Schlosser
fb477b8be1
Better dot products
2024-09-10 21:02:31 +00:00
Rasmus Munk Larsen
1dbc7581ec
Include <thread> for std::this_thread::yield().
2024-08-14 17:44:14 +00:00
Tobias Wood
5a9f66fb35
Fix Thread tests
2024-05-24 16:50:14 +00:00
Charles Schlosser
99adca8b34
Incorporate Threadpool in Eigen Core
2024-05-20 23:42:51 +00:00
Charles Schlosser
e63d9f6ccb
Fix random again
2024-03-29 21:49:27 +00:00
Cheng Wang
2c6b61c006
Add half and quarter vector support to HVX architecture
2024-01-22 21:23:21 +00:00
Tobias Wood
f38e16c193
Apply clang-format
2023-11-29 11:12:48 +00:00
Rasmus Munk Larsen
76e8c04553
Generalize parallel GEMM implementation in Core to work with ThreadPool in addition to OpenMP.
2023-11-10 17:42:30 +00:00
cheng wang
66e8f38891
Add architecture definition files for Qualcomm Hexagon Vector Extension (HVX)
2023-08-01 17:47:57 +00:00
Charles Schlosser
59b3ef5409
Partially Vectorize Cast
2023-06-09 16:54:31 +00:00
Charles Schlosser
fbf7189bd5
Fix cuda compilation
2023-05-08 16:15:47 +00:00
Mehdi Goli
0623791930
[SYCL-2020] Enabling USM support for SYCL. SYCL-1.2.1 did not have support for USM.
2023-05-05 17:30:36 +00:00
Antonio Sánchez
2d0c6ad873
Revert "Vectorize cast"
...
This reverts commit eb5ff1861a
2023-04-26 18:03:36 +00:00
Charles Schlosser
eb5ff1861a
Vectorize cast
2023-04-26 02:50:13 +00:00
Chip Kerchner
3f3ce214e6
New BF16 pcast functions and move type casting to TypeCasting.h
2023-04-18 02:38:38 +00:00
Charles Schlosser
1ce8b25825
Vectorize any() / all()
2023-03-06 23:54:02 +00:00
Antonio Sánchez
3f7e775715
Add IWYU export pragmas to top-level headers.
2023-02-08 17:40:31 +00:00
Charles Schlosser
2a90653395
fix lapacke config
2023-02-03 16:40:08 +00:00
Antonio Sánchez
08c961e837
Add custom ODR-safe assert.
2023-01-20 17:38:13 +00:00
Sean McBride
d70b4864d9
issue #2581 : review and cleanup of compiler version checks
2023-01-17 18:58:34 +00:00
Mehdi Goli
b523120687
[SYCL-2020 Support] Enabling Intel DPCPP Compiler support to Eigen
2023-01-16 07:04:08 +00:00
Alexander Richardson
37de432907
Avoid using std::raise() for divide by zero
2022-12-14 20:06:16 +00:00
Rasmus Munk Larsen
7b2901e2aa
Add vectorized integer division for int32 with AVX512, AVX or SSE.
2022-09-21 00:27:23 +00:00
Thomas Gloor
ec9c7163a3
Feature/skew symmetric matrix3
2022-09-08 20:44:40 +00:00
Matthew Sterrett
7a3b667c43
Add support for AVX512-FP16 for vectorizing half precision math
2022-08-17 18:15:21 +00:00
Chip Kerchner
9e0afe0f02
Fix non-VSX PowerPC build
2022-08-08 18:18:17 +00:00
Alexander Richardson
b7668c0371
Avoid including <sstream> with EIGEN_NO_IO
2022-07-29 18:02:51 +00:00
aaraujom
d49ede4dc4
Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
2022-05-28 02:00:21 +00:00
Antonio Sánchez
9b9496ad98
Revert "Add AVX512 optimizations for matrix multiply"
...
This reverts commit 25db0b4a82
2022-05-13 18:50:33 +00:00