eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Author	SHA1	Message	Date
Rasmus Munk Larsen	444ae9761d	Clamp igamma/igammac output to [0,1] for numerical stability libeigen/eigen!2229 Co-authored-by: Rasmus Munk Larsen <rmlarsen@gmail.com>	2026-02-28 08:52:44 -08:00
Rasmus Munk Larsen	667cabe3aa	Clean up comments in unsupported module libeigen/eigen!2198 Co-authored-by: Rasmus Munk Larsen <rmlarsen@gmail.com>	2026-02-22 22:04:23 -08:00
Rasmus Munk Larsen	112c2324bd	Consolidate BF16/F16 wrapper macros and simplify arch math functions libeigen/eigen!2195 Co-authored-by: Rasmus Munk Larsen <rmlarsen@gmail.com>	2026-02-22 20:17:43 -08:00
Blake	752911927f	betainc edge case checks at start of calculation libeigen/eigen!2123 Closes #2359	2026-02-08 10:05:06 -08:00
Rasmus Munk Larsen	abeba85356	Use proper float literals in SpecialFunctionsImpl.h.	2025-07-19 01:17:12 +00:00
Rasmus Munk Larsen	b5bef9dcb0	Fix bug in Erfc introduced in !1862 .	2025-07-18 17:58:48 -07:00
Adam Cogdell	3f00059beb	Fix fuzzer range error for scalar parity check.	2025-06-05 22:27:35 +00:00
Rasmus Munk Larsen	d5eec781b7	Get rid of redundant computation for large arguments to erf(x).	2024-11-18 10:51:58 -08:00
Rasmus Munk Larsen	5133c836c0	Vectorize erf(x) for double.	2024-11-16 19:05:16 +00:00
Rasmus Munk Larsen	0d366f6532	Vectorize erfc(x) for double and improve erfc(x) for float.	2024-11-08 17:21:11 +00:00
Rasmus Munk Larsen	7eea0a9213	Vectorize erfc() for float	2024-10-09 18:38:05 +00:00
Rasmus Munk Larsen	78f3c654ee	Don't use constexpr with half.	2024-10-08 16:44:40 +00:00
Rasmus Munk Larsen	74dcfbbd0f	Use ppolevl for polynomial evaluation in more places.	2024-10-07 13:27:28 -07:00
Rasmus Munk Larsen	a097f728fe	Avoid producing erf(x) = NaN for large \|x\|.	2024-10-04 12:15:23 -07:00
Rasmus Munk Larsen	44b16f48cb	Improve speed and accuracy or erf()	2024-10-03 01:52:16 +00:00
Tobias Wood	f38e16c193	Apply clang-format	2023-11-29 11:12:48 +00:00
Antonio Sánchez	6e4d5d4832	Add IWYU private pragmas to internal headers.	2023-08-21 16:25:22 +00:00
Antonio Sánchez	1f79a6078f	Return NaN in ndtri for values outside valid input range.	2023-05-05 16:27:26 +00:00
Rasmus Munk Larsen	b378014fef	Make sure we return +/-1 above the clamping point for Erf().	2023-04-18 20:53:01 +00:00
Rasmus Munk Larsen	1e223a956c	Add missing 'f' in float literal in SpecialFunctionsImpl.h that triggers implicit conversion warning.	2023-04-18 17:33:29 +00:00
Rasmus Munk Larsen	3026fc0d3c	Improve accuracy of erf().	2023-04-14 16:57:56 +00:00
Rasmus Munk Larsen	1c0a6cf228	Get rid of EIGEN_HAS_AVX512_MATH workaround.	2023-02-23 23:16:41 +00:00
Antonio Sánchez	e5794873cb	Replace assert with eigen_assert.	2022-10-04 17:11:23 +00:00
Antonio Sánchez	8ed3b9dcd6	Skip f16/bf16 bessel specializations on AVX512 if unavailable.	2022-06-24 15:10:36 +00:00
Tobias Schlüter	f3ba220c5d	Remove EIGEN_EMPTY_STRUCT_CTOR	2022-04-08 18:27:26 +00:00
Antonio Sánchez	9296bb4b93	Fix edge-case in zeta for large inputs.	2022-03-08 21:21:20 +00:00
Rasmus Munk Larsen	ea2c02060c	Add reciprocal packet op and fast specializations for float with SSE, AVX, and AVX512.	2022-01-21 23:49:18 +00:00
Kolja Brix	afa616bc9e	Fix some typos found	2021-09-23 15:22:00 +00:00
Rasmus Munk Larsen	6cadab6896	Clean up EIGEN_STATIC_ASSERT to only use standard c++11 static_assert.	2021-09-16 20:43:54 +00:00
Rasmus Munk Larsen	d7d0bf832d	Issue an error in case of direct inclusion of internal headers.	2021-09-10 19:12:26 +00:00
frgossen	33e0af0130	Return nan at poles of polygamma, digamma, and zeta if limit is not defined	2021-02-19 16:35:11 +00:00
Antonio Sanchez	2dbac2f99f	Fix bad NEON fp16 check	2020-12-04 13:42:18 -08:00
Antonio Sanchez	e2f21465fe	Special function implementations for half/bfloat16 packets. Current implementations fail to consider half-float packets, only half-float scalars. Added specializations for packets on AVX, AVX512 and NEON. Added tests to `special_packetmath`. The current `special_functions` tests would fail for half and bfloat16 due to lack of precision. The NEON tests also fail with precision issues and due to different handling of `sqrt(inf)`, so special functions bessel, ndtri have been disabled. Tested with AVX, AVX512.	2020-12-04 10:16:29 -08:00
David Tellenbach	8f8d77b516	Add EIGEN prefix for HAS_LGAMMA_R	2020-10-08 18:32:19 +02:00
Eugene Zhulenev	2279f2c62f	Use lgamma_r if it is available (update check for glibc 2.19+)	2020-10-08 00:26:45 +00:00
Teng Lu	386d809bde	Support BFloat16 in Eigen	2020-06-20 19:16:24 +00:00
Jeff Daily	b5df8cabd7	fix hip-clang compilation due to new HIP scalar accessor	2020-01-20 21:08:52 +00:00
Deven Desai	6d284bb1b7	Fix for HIP breakage - 200115. Adding a missing EIGEN_DEVICE_FUNC attr	2020-01-16 00:51:43 +00:00
Srinivas Vasudevan	f6c6de5d63	Ensure Igamma does not NaN or Inf for large values.	2020-01-14 21:32:48 +00:00
Jeff Daily	de07c4d1c2	fix compilation due to new HIP scalar accessor	2019-12-17 20:27:30 +00:00
Gael Guennebaud	c3f6fcf2c0	bug #1747 : one more fix for MSVC regarding the Bessel implementation.	2019-11-15 11:12:35 +01:00
Gael Guennebaud	39fb9eeccf	bug #1747 : fix compilation with MSVC	2019-10-14 22:50:23 +02:00
Rasmus Munk Larsen	20c4a9118f	Use "pdiv" rather than operator/ to support packet types.	2019-10-04 16:54:03 -07:00
Rasmus Munk Larsen	13ef08e5ac	Move implementation of vectorized error function erf() to SpecialFunctionsImpl.h.	2019-09-27 13:56:04 -07:00
Deven Desai	5e186b1987	Fix for the HIP build+test errors. The errors were introduced by this commit : `d38e6fbc27` After the above mentioned commit, some of the tests started failing with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:70: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsHalf.h:28:22: error: call to 'erf' is ambiguous return Eigen::half(Eigen::numext::erf(static_cast<float>(a))); ^~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1600:7: note: candidate function [with T = float] float erf(const float &x) { return ::erff(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = float] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:23: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:33: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ 3 errors generated. ``` This PR fixes the compile error by removing the "old" implementation for "erf" (assuming that the "new" implementation is what we want going forward. from a GPU point-of-view both implementations are the same). This PR also fixes what seems like a cut-n-paste error in the aforementioned commit	2019-09-25 15:39:13 +00:00
Rasmus Larsen	d38e6fbc27	Merged in rmlarsen/eigen (pull request PR-704) Add generic PacketMath implementation of the Error Function (erf).	2019-09-24 23:40:29 +00:00
Rasmus Munk Larsen	591a554c68	Add TODO to cleanup FMA cost modelling.	2019-09-24 16:39:25 -07:00
Christoph Hertzberg	e4c1b3c1d2	Fix implicit conversion warnings and use pnegate to negate packets	2019-09-23 16:07:43 +02:00
Rasmus Munk Larsen	6de5ed08d8	Add generic PacketMath implementation of the Error Function (erf).	2019-09-19 12:48:30 -07:00
Srinivas Vasudevan	df0816b71f	Merging eigen/eigen.	2019-09-16 19:33:29 -04:00

1 2

80 Commits