eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Author	SHA1	Message	Date
Antonio Sanchez	3395f4e604	Fix tridiagonalization_inplace_selector. The `Options` of the new `hCoeffs` vector do not necessarily match those of the `MatrixType`, leading to build errors. Having the `CoeffVectorType` be a template parameter relieves this restriction. (cherry picked from commit `ebd4b17d2f`)	2021-09-08 15:47:39 +00:00
Antonio Sanchez	f03d3e7072	Missing EIGEN_DEVICE_FUNCs to get `gpu_basic` passing with CUDA 9. CUDA 9 seems to require labelling defaulted constructors as `EIGEN_DEVICE_FUNC`, despite giving warnings that such labels are ignored. Without these labels, the `gpu_basic` test fails to compile, with errors about calling `__host__` functions from `__host__ __device__` functions. (cherry picked from commit `998bab4b04`)	2021-09-02 03:21:43 +00:00
Maxiwell S. Garcia	b8cf1ed753	Rename 'vec_all_nan' of cxx11_tensor_expr test because this symbol is used by altivec.h (cherry picked from commit `09fc0f97b5`)	2021-09-01 17:26:59 +00:00
Rasmus Munk Larsen	9263475740	Add missing dependency on LAPACK test suite binaries to target `buildtests`, so `make check` will work correctly when `EIGEN_ENABLE_LAPACK_TESTS` is `ON`. (cherry picked from commit `6f429a202d`)	2021-09-01 16:41:47 +00:00
Rasmus Munk Larsen	0fdc99c65e	Allow old Fortran code for LAPACK tests to compile despite argument mismatch errors (REAL passed to COMPLEX workspace argument) with GNU Fortran 10. (cherry picked from commit `7e096ddcb0`)	2021-09-01 16:41:28 +00:00
Antonio Sanchez	07cc362238	Fix EIGEN_OPTIMIZATION_BARRIER for arm-clang. Clang doesn't like !621, needs the "g" constraint back. The "g" constraint also works for GCC >= 5. This fixes our gitlab CI. (cherry picked from commit `3a6296d4f1`)	2021-09-01 16:40:08 +00:00
Antonio Sanchez	4ef67cbfb2	GCC 4.8 arm EIGEN_OPTIMIZATION_BARRIER fix (#2315 ). GCC 4.8 doesn't seem to like the `g` register constraint, failing to compile with "error: 'asm' operand requires impossible reload". Tested `r` instead, and that seems to work, even with latest compilers. Also fixed some minor macro issues to eliminate warnings on armv7. Fixes #2315. (cherry picked from commit `ff07a8a639`)	2021-08-31 21:23:28 +00:00
Antonio Sanchez	c2b6df6e60	Disable cuda Eigen::half vectorization on host. All cuda `__half` functions are device-only in CUDA 9, including conversions. Host-side conversions were added in CUDA 10. The existing code doesn't build prior to 10.0. All arithmetic functions are always device-only, so there's therefore no reason to use vectorization on the host at all. Modified the code to disable vectorization for `__half` on host, which required also updating the `TensorReductionGpu` implementation which previously made assumptions about available packets. (cherry picked from commit `cc3573ab44`)	2021-08-31 21:23:11 +00:00
Adam Kallai	277d369060	win: include intrin header in Windows on ARM intrin header is needed for _BitScanReverse and _BitScanReverse64 (cherry picked from commit `1415817d8d`)	2021-08-31 21:22:37 +00:00
Antonio Sanchez	7aee90b8d3	Fix fix<N> when variable templates are not supported. There were some typos that checked `EIGEN_HAS_CXX14` that should have checked `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES`, causing a mismatch in some of the `Eigen::fix<N>` assumptions. Also fixed the `symbolic_index` test when `EIGEN_HAS_CXX14_VARIABLE_TEMPLATES` is 0. Fixes #2308 (cherry picked from commit `5db9e5c779`)	2021-08-30 16:23:35 +00:00
Rasmus Munk Larsen	3147391d94	Change version to 3.4.0. 3.4.0	2021-08-18 13:41:58 -07:00
Antonio Sanchez	115591b9e3	Workaround VS 2017 arg bug. In VS 2017, `std::arg` for real inputs always returns 0, even for negative inputs. It should return `PI` for negative real values. This seems to be fixed in VS 2019 (MSVC 1920). (cherry picked from commit `2b410ecbef`)	2021-08-18 19:04:50 +00:00
Antonio Sanchez	fd100138dd	Remove unaligned assert tests. Manually constructing an unaligned object declared as aligned invokes UB, so we cannot technically check for alignment from within the constructor. Newer versions of clang optimize away this check. Removing the affected tests. (cherry picked from commit `0c4ae56e37`)	2021-08-18 18:39:04 +00:00
Jakob Struye	1ec173b54e	Clearer doc for squaredNorm (cherry picked from commit `53a29c7e35`)	2021-08-18 15:12:36 +00:00
Antonio Sanchez	aef926abf6	Renamed shift_left/shift_right to shiftLeft/shiftRight. For naming consistency. Also moved to ArrayCwiseUnaryOps, and added test. (cherry picked from commit `fc9d352432`)	2021-08-18 14:44:31 +00:00
Antonio Sanchez	f1032255d3	Add missing PPC packet comparisons. This is to fix the packetmath tests on the ppc pipeline. (cherry picked from commit `2cc6ee0d2e`)	2021-08-17 15:33:55 +00:00
Chip-Kerchner	f57dec64ef	Fix unaligned loads in ploadLhs & ploadRhs for P8. (cherry picked from commit `8dcf3e38ba`)	2021-08-17 12:48:36 +00:00
Rasmus Munk Larsen	926e1a8226	Update documentation for matrix decompositions and least squares solvers. (cherry picked from commit `7e6f94961c`)	2021-08-16 22:11:38 +00:00
andiwand	cd474d4cd0	minor doc fix in Map.h (cherry picked from commit `5c6b3efead`)	2021-08-16 14:26:39 +00:00
Chip-Kerchner	0b56b62f30	Reverse compare logic in F32ToBf16 since vec_cmpne is not available in Power8 - now compiles for clang10 default (P8). (cherry picked from commit `e07227c411`)	2021-08-13 18:01:15 +00:00
Chip Kerchner	44cc96e1a1	Get rid of used uninitialized warnings for EIGEN_UNUSED_VARIABLE in gcc11+ (cherry picked from commit `66499f0f17`)	2021-08-12 21:39:17 +00:00
Rasmus Munk Larsen	576e451b10	Add CompleteOrthogonalDecomposition to the table of linear algeba decompositions. (cherry picked from commit `96e3b4fc95`)	2021-08-12 16:49:40 +00:00
Antonio Sanchez	0d89012708	Update code snippet for tridiagonalize_inplace. (cherry picked from commit `fb1718ad14`)	2021-08-12 15:37:32 +00:00
Rasmus Munk Larsen	6d2506040c	* revise the meta_least_common_multiple function template, add a bool variable to check whether the A is larger than B. * This can make less compile_time if A is smaller than B. and avoid failure in compile if we get a little A and a great B. Authored by @awoniu. (cherry picked from commit `8ce341caf2`)	2021-08-11 18:11:26 +00:00
Nikolay Tverdokhleb	cb44a003de	Do not set AnnoyingScalar::dont_throw if not defined EIGEN_TEST_ANNOYING_SCALAR_DONT_THROW. - Because that member is not declared if the macro is defined. (cherry picked from commit `f1b899eef7`)	2021-08-11 16:39:44 +00:00
ChipKerchner	13d7658c5d	Fix errors on older compilers (gcc 7.5 - lack of vec_neg, clang10 - can not use const pointers with vec_xl). (cherry picked from commit `413bc491f1`)	2021-08-10 20:40:54 +00:00
jenswehner	338924602d	added includes for unordered_map (cherry picked from commit `e3e74001f7`)	2021-08-10 16:10:03 +00:00
Gauri Deshpande	93bff85a42	remove denormal flushing in fp32tobf16 for avx & avx512 (cherry picked from commit `e6a5a594a7`)	2021-08-09 22:15:42 +00:00
Rasmus Munk Larsen	4e0357c6dd	Avoid memory allocation in tridiagonalization_inplace_selector::run. (cherry picked from commit `a5a7faeb45`)	2021-08-06 21:48:00 +00:00
Daniel N. Miller (APD)	1e9f623f3e	Do not build shared libs if not supported (cherry picked from commit `09d7122468`)	2021-08-06 21:47:37 +00:00
Jens Wehner	4240b480e0	updated documentation for middleCol and middleRow (cherry picked from commit `4d870c49b7`)	2021-08-05 17:53:36 +00:00
Antonio Sanchez	5b83d3c4bc	Make inverse 3x3 faster and avoid gcc bug. There seems to be a gcc 4.7 bug that incorrectly flags the current 3x3 inverse as using uninitialized memory. I'm pretty sure it's a false positive, but it's hard to trigger. The same warning does not trigger with clang or later compiler versions. In trying to find a work-around, this implementation turns out to be faster anyways for static-sized matrices. ``` name old cpu/op new cpu/op delta BM_Inverse3x3<DynamicMatrix3T<float>> 423ns ± 2% 433ns ± 3% +2.32% (p=0.000 n=98+96) BM_Inverse3x3<DynamicMatrix3T<double>> 425ns ± 2% 427ns ± 3% +0.48% (p=0.003 n=99+96) BM_Inverse3x3<StaticMatrix3T<float>> 7.10ns ± 2% 0.80ns ± 1% -88.67% (p=0.000 n=114+112) BM_Inverse3x3<StaticMatrix3T<double>> 7.45ns ± 2% 1.34ns ± 1% -82.01% (p=0.000 n=105+111) BM_AliasedInverse3x3<DynamicMatrix3T<float>> 409ns ± 3% 419ns ± 3% +2.40% (p=0.000 n=100+98) BM_AliasedInverse3x3<DynamicMatrix3T<double>> 414ns ± 3% 413ns ± 2% ~ (p=0.322 n=98+98) BM_AliasedInverse3x3<StaticMatrix3T<float>> 7.57ns ± 1% 0.80ns ± 1% -89.37% (p=0.000 n=111+114) BM_AliasedInverse3x3<StaticMatrix3T<double>> 9.09ns ± 1% 2.58ns ±41% -71.60% (p=0.000 n=113+116) ``` (cherry picked from commit `5ad8b9bfe2`)	2021-08-04 22:06:52 +00:00
Antonio Sanchez	46ecdcd745	Fix MPReal detection and support. The latest version of `mpreal` has a bug that breaks `min`/`max`. It also breaks with the latest dev version of `mpfr`. Here we add `FindMPREAL.cmake` which searches for the library and tests if compilation works. Removed our internal copy of `mpreal.h` under `unsupported/test`, as it is out-of-sync with the latest, and similarly breaks with the latest `mpfr`. It would be best to use the installed version of `mpreal` anyways, since that's what we actually want to test. Fixes #2282. (cherry picked from commit `31f796ebef`)	2021-08-03 18:13:12 +00:00
Antonio Sanchez	9a1691a14e	Fix cmake warnings, FindPASTIX/FindPTSCOTCH. We were getting a lot of warnings due to nested `find_package` calls within `Find*.cmake` files. The recommended approach is to use [`find_dependency`](https://cmake.org/cmake/help/latest/module/CMakeFindDependencyMacro.html) in package configuration files. I made this change for all instances. Case mismatches between `Find<Package>.cmake` and calling `find_package(<PACKAGE>`) also lead to warnings. Fixed for `FindPASTIX.cmake` and `FindSCOTCH.cmake`. `FindBLASEXT.cmake` was broken due to calling `find_package_handle_standard_args(BLAS ...)`. The package name must match, otherwise the `find_package(BLASEXT)` falsely thinks the package wasn't found. I changed to `BLASEXT`, but then also copied that value to `BLAS_FOUND` for compatibility. `FindPastix.cmake` had a typo that incorrectly added `PTSCOTCH` when looking for the `SCOTCH` component. `FindPTSCOTCH` incorrectly added `*-NOTFOUND` to include/library lists, corrupting them. This led to cmake errors down-the-line. Fixes #2288. (cherry picked from commit `1cdec38653`)	2021-08-03 17:48:20 +00:00
Antonio Sanchez	bb33880e57	Fix TriSycl CMake files. This is to enable compiling with the latest trisycl. `FindTriSYCL.cmake` was broken by commit `00f32752`, which modified `add_sycl_to_target` for ComputeCPP. This makes the corresponding modifications for trisycl to make them consistent. Also, trisycl now requires c++17. (cherry picked from commit `8cf6cb27ba`)	2021-08-03 17:25:17 +00:00
Antonio Sanchez	237c59a2aa	Modify scalar pzero, ptrue, pselect, and p<binary> operations to avoid memset. The `memset` function and bitwise manipulation only apply to POD types that do not require initialization, otherwise resulting in UB. We currently violate this in `ptrue` and `pzero`, we assume bitmasks for `pselect`, and bitwise operations are applied byte-by-byte in the generic implementations. This is causing issues for scalar types that do require initialization or that contain non-POD info such as pointers (#2201). We either break them, or force specializations of these functions for custom scalars, even if they are not vectorized. Here we modify these functions for scalars only - instead using only scalar operations: - `pzero`: `Scalar(0)` for all scalars. - `ptrue`: `Scalar(1)` for non-trivial scalars, bitset to one bits for trivial scalars. - `pselect`: ternary select comparing mask to `Scalar(0)` for all scalars - `pand`, `por`, `pxor`, `pnot`: use operators `&`, `\|`, `^`, `~` for all integer or non-trivial scalars, otherwise apply bytewise. For non-scalar types, the original implementations are used to maintain compatibility and minimize the number of changes. Fixes #2201. (cherry picked from commit `3d98a6ef5c`)	2021-08-03 16:32:59 +00:00
Antonio Sanchez	3dc42eeaec	Enable equality comparisons on GPU. Since `std::equal_to::operator()` is not a device function, it fails on GPU. On my device, I seem to get a silent crash in the kernel (no reported error, but the kernel does not complete). Replacing this with a portable version enables comparisons on device. Addresses #2292 - would need to be cherry-picked. The 3.3 branch also requires adding `EIGEN_DEVICE_FUNC` in `BooleanRedux.h` to get fully working. (cherry picked from commit `7880f10526`)	2021-08-03 16:15:44 +00:00
hyunggi-sv	7adc1545b4	fix:typo in dox (has->have) (cherry picked from commit `02a0e79c70`)	2021-08-03 00:54:41 +00:00
Antonio Sanchez	c0c7b695cd	Fix assignment operator issue for latest MSVC+NVCC. Details are scattered across #920, #1000, #1324, #2291. Summary: some MSVC versions have a bug that requires omitting explicit `operator=` definitions (leads to duplicate definition errors), and some MSVC versions require adding explicit `operator=` definitions (otherwise implicitly deleted errors). This mess tries to cover all the cases encountered. Fixes #2291. (cherry picked from commit `9816fe59b4`)	2021-08-03 00:52:21 +00:00
Alexander Karatarakis	c334eece44	_DerType -> DerivativeType as underscore-followed-by-caps is a reserved identifier (cherry picked from commit `f357283d31`)	2021-07-29 18:18:47 +00:00
Jonas Harsch	5ccb72b2e4	Fixed typo in TutorialSparse.dox (cherry picked from commit `5b81764c0f`)	2021-07-26 14:33:10 +00:00
arthurfeeney	9c90d5d832	Fixes #1387 for compilation error in JacobiSVD with HouseholderQRPreconditioner that occurs when input is a compile-time row vector. (cherry picked from commit `a77638387d`)	2021-07-22 18:01:55 +00:00
Antonio Sanchez	5d37114fc0	Fix explicit default cache size typo. (cherry picked from commit `297f0f563d`)	2021-07-20 18:42:25 +00:00
Rohit Santhanam	930696fc53	Enable extract et. al. for HIP GPU. (cherry picked from commit `beea14a18f`)	2021-07-09 16:14:19 +00:00
Rasmus Munk Larsen	56966fd2e6	Defer to std::fill_n when filling a dense object with a constant value. (cherry picked from commit `0c361c4899`)	2021-07-09 03:59:56 +00:00
Jonas Harsch	5a3c9eddb4	Removed superfluous boolean `degenerate` in TensorMorphing.h. (cherry picked from commit `e9c9a3130b`)	2021-07-08 18:34:10 +00:00
Guoqiang QI	69ec4907da	Make a copy of input matrix when try to do the inverse in place, this fixes #2285 . (cherry picked from commit `4bcd42c271`)	2021-07-08 17:07:54 +00:00
Antonio Sanchez	7571704a43	Fix CMake directory issues. Allows absolute and relative paths for - `INCLUDE_INSTALL_DIR` - `CMAKEPACKAGE_INSTALL_DIR` - `PKGCONFIG_INSTALL_DIR` Type should be `PATH` not `STRING`. Contrary to !211, these don't seem to be made absolute if user-defined - according to the doc any directories should use `PATH` type, which allows a file dialog to be used via the GUI. It also better handles file separators. If user provides an absolute path, it will be made relative to `CMAKE_INSTALL_PREFIX` so that the `configure_packet_config_file` will work. Fixes #2155 and #2269. (cherry picked from commit `f44f05532d`)	2021-07-07 17:44:00 +00:00
Antonio Sanchez	84955d109f	Fix Tensor documentation page. The extra [TOC] tag is generating a huge floating duplicated table-of-contents, which obscures the majority of the page (see bottom of https://eigen.tuxfamily.org/dox/unsupported/eigen_tensors.html). Remove it. Also, headers do not support markup (see [doxygen bug](https://github.com/doxygen/doxygen/issues/7467)), so backticks like ``` ``` end up generating titles that looks like ``` Constructor <tt>Tensor<double,2></tt> ``` Removing backticks for now. To generate proper formatted headers, we must directly use html instead of markdown, i.e. ``` <h2>Constructor <code>Tensor<double,2></code></h2> ``` which is ugly. Fixes #2254. (cherry picked from commit `f5a9873bbb`)	2021-07-07 17:18:20 +00:00
Jonas Harsch	601814b575	Don't crash when attempting to shuffle an empty tensor. (cherry picked from commit `aab747021b`)	2021-07-02 21:08:38 +00:00

1 2 3 4 5 ...

11497 Commits