eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Author	SHA1	Message	Date
Chip Kerchner	5d9beb81ab	Initial version of reactivating RVV features like GeneralBlockPanelKernel libeigen/eigen!2096	2026-01-07 13:41:02 -05:00
Chip Kerchner	4f14da11d9	Add basic support for packetmath for BF16 RVV libeigen/eigen!2093	2025-12-16 14:25:46 -05:00
sharad bhaskar	8a1083e9bf	Aocl integration updated libeigen/eigen!1952	2025-11-24 17:20:42 +00:00
Chip Kerchner	49623d0c4e	This patch adds support for RISCV's vector extension RVV1.0. libeigen/eigen!2030	2025-11-20 16:28:07 +00:00
Rasmus Munk Larsen	8eb6551a8a	Add support for complex numbers in the generic clang backend libeigen/eigen!2078 Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com>	2025-11-20 00:26:37 +00:00
Rasmus Munk Larsen	fecfa7f27e	Fixes to make generic backend build with AVX512 See merge request libeigen/eigen!2064 Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com>	2025-11-06 22:50:32 +00:00
Rasmus Munk Larsen	ec93a6d098	Add a generic Eigen backend based on clang vector extensions The goal of this MR is to implement a generic SIMD backend (packet ops) for Eigen that uses clang vector extensions instead of platform-dependent intrinsics. Ideally, this should make it possible to build Eigen and achieve reasonable speed on any platform that has a recent clang compiler, without having to write any inline assembly or intrinsics. Caveats: * The current implementation is a proof of concept and supports vectorization for float, double, int32_t, and int64_t using fixed-size 512-bit vectors (a somewhat arbitrary choice). I have not done much to tune this for speed yet. * For now, there is no way to enable this other than setting -DEIGEN_VECTORIZE_GENERIC on the command line. * This only compiles with newer versions of clang. I have tested that it compiles and all tests pass with clang 19.1.7. https://clang.llvm.org/docs/LanguageExtensions.html#vectors-and-extended-vectors Closes #2998 and #2997 See merge request libeigen/eigen!2051 Co-authored-by: Rasmus Munk Larsen <rmlarsen@google.com> Co-authored-by: Antonio Sánchez <cantonios@google.com>	2025-11-06 21:52:19 +00:00
Saran Tunyasuvunakool	db02f97850	Add a missing `#include <version>` to Core.	2025-10-15 15:10:30 +00:00
Antonio Sánchez	a627f72cd6	Add "Version" file and update version.	2025-09-20 02:08:59 +00:00
Tyler Veness	d368998120	Fix MSVC error about missing std::bit_cast	2025-08-23 22:25:52 +00:00
Charles Schlosser	43a65a9cbd	add RealView api	2025-08-12 16:55:05 +00:00
Charles Schlosser	f5ead2d34c	Fix intel packet math header inclusion order	2025-07-29 01:00:37 +00:00
Rasmus Munk Larsen	efe5b6979d	Unconditionally include <memory>. Some c++20 builds are currently broken because it is needed for std::assume_aligned.	2025-07-18 18:06:28 +00:00
Charles Schlosser	d0b490ee09	Optimize maxCoeff and friends	2025-06-06 14:55:49 +00:00
Charles Schlosser	4fdf87bbf5	clean up intel packet reductions	2025-05-30 19:18:07 +00:00
Antonio Sánchez	70f2aead9a	Use native _Float16 for AVX512FP16 and update vectorization.	2025-03-19 19:55:26 +00:00
Antonio Sánchez	b1e74b1ccd	Fix all the doxygen warnings.	2025-02-01 00:00:31 +00:00
Pengzhou0810	e986838464	Add LoongArch64 architecture LSX support.(build/test )	2025-01-20 18:37:44 +00:00
Charles Schlosser	8ad4344ca7	optimize setConstant, setZero	2024-11-22 03:39:19 +00:00
Charles Schlosser	fb477b8be1	Better dot products	2024-09-10 21:02:31 +00:00
Rasmus Munk Larsen	1dbc7581ec	Include <thread> for std::this_thread::yield().	2024-08-14 17:44:14 +00:00
Tobias Wood	5a9f66fb35	Fix Thread tests	2024-05-24 16:50:14 +00:00
Charles Schlosser	99adca8b34	Incorporate Threadpool in Eigen Core	2024-05-20 23:42:51 +00:00
Charles Schlosser	e63d9f6ccb	Fix random again	2024-03-29 21:49:27 +00:00
Cheng Wang	2c6b61c006	Add half and quarter vector support to HVX architecture	2024-01-22 21:23:21 +00:00
Tobias Wood	f38e16c193	Apply clang-format	2023-11-29 11:12:48 +00:00
Rasmus Munk Larsen	76e8c04553	Generalize parallel GEMM implementation in Core to work with ThreadPool in addition to OpenMP.	2023-11-10 17:42:30 +00:00
cheng wang	66e8f38891	Add architecture definition files for Qualcomm Hexagon Vector Extension (HVX)	2023-08-01 17:47:57 +00:00
Charles Schlosser	59b3ef5409	Partially Vectorize Cast	2023-06-09 16:54:31 +00:00
Charles Schlosser	fbf7189bd5	Fix cuda compilation	2023-05-08 16:15:47 +00:00
Mehdi Goli	0623791930	[SYCL-2020] Enabling USM support for SYCL. SYCL-1.2.1 did not have support for USM.	2023-05-05 17:30:36 +00:00
Antonio Sánchez	2d0c6ad873	Revert "Vectorize cast" This reverts commit `eb5ff1861a`	2023-04-26 18:03:36 +00:00
Charles Schlosser	eb5ff1861a	Vectorize cast	2023-04-26 02:50:13 +00:00
Chip Kerchner	3f3ce214e6	New BF16 pcast functions and move type casting to TypeCasting.h	2023-04-18 02:38:38 +00:00
Charles Schlosser	1ce8b25825	Vectorize any() / all()	2023-03-06 23:54:02 +00:00
Antonio Sánchez	3f7e775715	Add IWYU export pragmas to top-level headers.	2023-02-08 17:40:31 +00:00
Charles Schlosser	2a90653395	fix lapacke config	2023-02-03 16:40:08 +00:00
Antonio Sánchez	08c961e837	Add custom ODR-safe assert.	2023-01-20 17:38:13 +00:00
Sean McBride	d70b4864d9	issue #2581 : review and cleanup of compiler version checks	2023-01-17 18:58:34 +00:00
Mehdi Goli	b523120687	[SYCL-2020 Support] Enabling Intel DPCPP Compiler support to Eigen	2023-01-16 07:04:08 +00:00
Alexander Richardson	37de432907	Avoid using std::raise() for divide by zero	2022-12-14 20:06:16 +00:00
Rasmus Munk Larsen	7b2901e2aa	Add vectorized integer division for int32 with AVX512, AVX or SSE.	2022-09-21 00:27:23 +00:00
Thomas Gloor	ec9c7163a3	Feature/skew symmetric matrix3	2022-09-08 20:44:40 +00:00
Matthew Sterrett	7a3b667c43	Add support for AVX512-FP16 for vectorizing half precision math	2022-08-17 18:15:21 +00:00
Chip Kerchner	9e0afe0f02	Fix non-VSX PowerPC build	2022-08-08 18:18:17 +00:00
Alexander Richardson	b7668c0371	Avoid including <sstream> with EIGEN_NO_IO	2022-07-29 18:02:51 +00:00
aaraujom	d49ede4dc4	Add AVX512 s/dgemm optimizations for compute kernel (2nd try)	2022-05-28 02:00:21 +00:00
Antonio Sánchez	9b9496ad98	Revert "Add AVX512 optimizations for matrix multiply" This reverts commit `25db0b4a82`	2022-05-13 18:50:33 +00:00
aaraujom	25db0b4a82	Add AVX512 optimizations for matrix multiply	2022-05-12 23:41:19 +00:00
Shi, Brian	fc1d888415	Remove AVX512VL dependency in trsm	2022-04-14 12:44:24 -07:00

1 2 3 4 5 ...

486 Commits