eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Author	SHA1	Message	Date
Gael Guennebaud	4dd767f455	add some internal checks	2018-05-18 13:59:55 +02:00
Gael Guennebaud	e9da464e20	Add specializations of is_arithmetic for long long in c++11	2018-04-23 16:26:29 +02:00
Christoph Hertzberg	34e499ad36	Disable -Wshadow when compiling with g++	2018-04-21 22:08:26 +02:00
Gael Guennebaud	7a9089c33c	fix linking issue	2018-04-13 08:51:47 +02:00
Gael Guennebaud	e43ca0320d	bug #1520 : workaround some -Wfloat-equal warnings by calling std::equal_to	2018-04-11 15:24:13 +02:00
Gael Guennebaud	b903fa74fd	Extend list of MSVC versions	2018-04-04 15:14:09 +02:00
Gael Guennebaud	8c7b5158a1	commit 45e9c9996da790b55ed9c4b0dfeae49492ac5c46 (HEAD -> memory_fix) Author: George Burgess IV <gbiv@google.com> Date: Thu Mar 1 11:20:24 2018 -0800 Prefer `::operator new` to `new` The C++ standard allows compilers much flexibility with `new` expressions, including eliding them entirely (https://godbolt.org/g/yS6i91). However, calls to `operator new` are required to be treated like opaque function calls. Since we're calling `new` for side-effects other than allocating heap memory, we should prefer the less flexible version. Signed-off-by: George Burgess IV <gbiv@google.com>	2018-04-03 17:15:38 +02:00
luz.paz	e3912f5e63	MIsc. source and comment typos Found using `codespell` and `grep` from downstream FreeCAD	2018-03-11 10:01:44 -04:00
Gael Guennebaud	f6be7289d7	Implement better static assertion checking to make sure that the first assertion is a static one and not a runtime one.	2018-03-09 10:00:51 +01:00
Gael Guennebaud	d820ab9edc	Add static assertion on selfadjoint-view's UpLo parameter.	2018-03-09 09:33:43 +01:00
Gael Guennebaud	546ab97d76	Add possibility to overwrite EIGEN_STRONG_INLINE.	2017-12-14 14:47:38 +01:00
Gael Guennebaud	f86bb89d39	Add EIGEN_MKL_NO_DIRECT_CALL option	2017-11-09 11:07:45 +01:00
Gael Guennebaud	5fa79f96b8	Patch from Konstantin Arturov to enable MKL's direct call by default	2017-11-09 10:58:38 +01:00
Benoit Steiner	f16ba2a630	Merged in LaFeuille/eigen-1/LaFeuille/typo-fix-alignmeent-alignment-1505889397887 (pull request PR-335) Typo fix alignmeent ->alignment	2017-10-21 01:59:55 +00:00
Gael Guennebaud	8579195169	bug #1468 (1/2) : add missing std:: to memcpy	2017-09-22 09:23:24 +02:00
Gael Guennebaud	7ad07fc6f2	Update documentation for aligned_allocator	2017-09-20 10:22:00 +02:00
LaFeuille	7c9b07dc5c	Typo fix alignmeent ->alignment	2017-09-20 06:38:39 +00:00
Christoph Hertzberg	23f8b00bc8	clang provides __has_feature(is_enum) (but not <type_traits>) in C++03 mode	2017-09-14 19:26:03 +02:00
Christoph Hertzberg	0c9ad2f525	std::integral_constant is not C++03 compatible	2017-09-14 19:23:38 +02:00
Gael Guennebaud	6d42309f13	Fix compilation of Vector::operator()(enum) by treating enums as Index	2017-09-07 14:34:30 +02:00
Gael Guennebaud	21633e585b	bug #1462 : remove all occurences of the deprecated __CUDACC_VER__ macro by introducing EIGEN_CUDACC_VER	2017-08-24 11:06:47 +02:00
Gael Guennebaud	8c858bd891	Clarify doc regarding the usage of MKL_DIRECT_CALL	2017-08-17 12:17:45 +02:00
Gael Guennebaud	b95f92843c	Fix support for MKL's BLAS when using MKL_DIRECT_CALL.	2017-08-17 12:07:10 +02:00
Gael Guennebaud	9f8136ff74	disable nvcc boolean-expr-is-constant warning	2017-07-17 10:43:18 +02:00
Gael Guennebaud	bbd97b4095	Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH aliases	2017-07-17 01:02:51 +02:00
Gael Guennebaud	b651ce0ffa	Fix a gcc7 warning: Wint-in-bool-context	2017-06-26 09:58:28 +02:00
Gael Guennebaud	c3e2afce0d	Enable MSVC 2010 workaround from MSVC only	2017-06-09 16:25:18 +02:00
Abhijit Kundu	4343db84d8	updated warning number for nvcc relase 8 (V8.0.61) for the stupid warning message 'calling a __host__ function from a __host__ __device__ function is not allowed'.	2017-05-01 10:36:27 -04:00
Gael Guennebaud	aae19c70ac	update has_ReturnType to be more consistent with other has_ helpers	2017-03-17 17:33:15 +01:00
Benoit Steiner	09ae0e6586	Adjusted the EIGEN_DEVICE_FUNC qualifiers to make sure that: * they're used consistently between the declaration and the definition of a function * we avoid calling host only methods from host device methods.	2017-03-01 11:47:47 -08:00
Benoit Steiner	7b61944669	Made most of the packet math primitives usable within CUDA kernel when compiling with clang	2017-02-28 17:05:28 -08:00
Gael Guennebaud	63798df038	Fix usage of CUDACC_VER	2017-02-20 08:16:36 +01:00
Gael Guennebaud	5937c4ae32	Fall back is_integral to std::is_integral in c++11	2017-02-13 17:14:26 +01:00
Jonathan Hseu	3453b00a1e	Fix vector indexing with uint64_t	2017-02-11 21:45:32 -08:00
Gael Guennebaud	e43016367a	Forgot to include a file in previous commit	2017-02-11 10:34:18 +01:00
Benoit Steiner	8b3cc54c42	Added a new EIGEN_HAS_INDEXED_VIEW define that set to 0 for older compilers that are known to fail to compile the indexed views (I used the define from the indexed_views.cpp test). Only include the indexed view methods when the compiler supports the code. This makes it possible to use Eigen again in complex code bases such as TensorFlow and older compilers such as gcc 4.8	2017-02-10 13:08:49 -08:00
Gael Guennebaud	0256c52359	Include clang in the list of non strict MSVC (just to be sure)	2017-02-10 13:41:52 +01:00
Gael Guennebaud	0eceea4efd	Define EIGEN_COMP_GNUC to reflect version number: 47, 48, 49, 50, 60, ...	2017-02-01 23:36:40 +01:00
Gael Guennebaud	d024e9942d	MSVC 1900 release is not c++14 compatible enough for us. The 1910 update seems to be fine though.	2017-01-27 22:17:59 +01:00
Rasmus Munk Larsen	edaa0fc5d1	Revert PR-292. After further investigation, the memcpy->memmove change was only good for Haswell on older versions of glibc. Adding a switch for small sizes is perhaps useful for string copies, but also has an overhead for larger sizes, making it a poor trade-off for general memcpy. This PR also removes a couple of unnecessary semi-colons in Eigen/src/Core/AssignEvaluator.h that caused compiler warning everywhere.	2017-01-26 12:46:06 -08:00
Gael Guennebaud	25a1703579	Merged in ggael/eigen-flexidexing (pull request PR-294) generalized operator() for indexed access and slicing	2017-01-26 08:04:23 +00:00
Gael Guennebaud	28351073d8	Fix unamed type as template argument (ok in c++11 only)	2017-01-25 22:54:51 +01:00
Gael Guennebaud	607be65a03	Fix duplicates of array_size bewteen unsupported and Core	2017-01-25 22:53:58 +01:00
Gael Guennebaud	296d24be4d	bug #1381 : fix sparse.diagonal() used as a rvalue. The problem was that is "sparse" is not const, then sparse.diagonal() must have the LValueBit flag meaning that sparse.diagonal().coeff(i) must returns a const reference, const Scalar&. However, sparse::coeff() cannot returns a reference for a non-existing zero coefficient. The trick is to return a reference to a local member of evaluator<SparseMatrix>.	2017-01-25 17:39:01 +01:00
Rasmus Munk Larsen	3be5ee2352	Update copy helper to use fast_memcpy.	2017-01-24 14:22:49 -08:00
Rasmus Munk Larsen	e6b1020221	Adds a fast memcpy function to Eigen. This takes advantage of the following: 1. For small fixed sizes, the compiler generates inline code for memcpy, which is much faster. 2. My colleague eriche at googl dot com discovered that for large sizes, memmove is significantly faster than memcpy (at least on Linux with GCC or Clang). See benchmark numbers measured on a Haswell (HP Z440) workstation here: https://docs.google.com/a/google.com/spreadsheets/d/1jLs5bKzXwhpTySw65MhG1pZpsIwkszZqQTjwrd_n0ic/pubhtml This is of course surprising since memcpy is a less constrained version of memmove. This stackoverflow thread contains some speculation as to the causes: http://stackoverflow.com/questions/22793669/poor-memcpy-performance-on-linux Below are numbers for copying and slicing tensors using the multithreaded TensorDevice. The numbers show significant improvements for memcpy of very small blocks and for memcpy of large blocks single threaded (we were already able to saturate memory bandwidth for >1 threads before on large blocks). The "slicingSmallPieces" benchmark also shows small consistent improvements, since memcpy cost is a fair portion of that particular computation. The benchmarks operate on NxN matrices, and the names are of the form BM_$OP_${NUMTHREADS}T/${N}. Measured improvements in wall clock time: Run on rmlarsen3.mtv (12 X 3501 MHz CPUs); 2017-01-20T11:26:31.493023454-08:00 CPU: Intel Haswell with HyperThreading (6 cores) dL1:32KB dL2:256KB dL3:15MB Benchmark Base (ns) New (ns) Improvement ------------------------------------------------------------------ BM_memcpy_1T/2 3.48 2.39 +31.3% BM_memcpy_1T/8 12.3 6.51 +47.0% BM_memcpy_1T/64 371 383 -3.2% BM_memcpy_1T/512 66922 66720 +0.3% BM_memcpy_1T/4k 9892867 6849682 +30.8% BM_memcpy_1T/5k 14951099 10332856 +30.9% BM_memcpy_2T/2 3.50 2.46 +29.7% BM_memcpy_2T/8 12.3 7.66 +37.7% BM_memcpy_2T/64 371 376 -1.3% BM_memcpy_2T/512 66652 66788 -0.2% BM_memcpy_2T/4k 6145012 6117776 +0.4% BM_memcpy_2T/5k 9181478 9010942 +1.9% BM_memcpy_4T/2 3.47 2.47 +31.0% BM_memcpy_4T/8 12.3 6.67 +45.8 BM_memcpy_4T/64 374 376 -0.5% BM_memcpy_4T/512 67833 68019 -0.3% BM_memcpy_4T/4k 5057425 5188253 -2.6% BM_memcpy_4T/5k 7555638 7779468 -3.0% BM_memcpy_6T/2 3.51 2.50 +28.8% BM_memcpy_6T/8 12.3 7.61 +38.1% BM_memcpy_6T/64 373 378 -1.3% BM_memcpy_6T/512 66871 66774 +0.1% BM_memcpy_6T/4k 5112975 5233502 -2.4% BM_memcpy_6T/5k 7614180 7772246 -2.1% BM_memcpy_8T/2 3.47 2.41 +30.5% BM_memcpy_8T/8 12.4 10.5 +15.3% BM_memcpy_8T/64 372 388 -4.3% BM_memcpy_8T/512 67373 66588 +1.2% BM_memcpy_8T/4k 5148462 5254897 -2.1% BM_memcpy_8T/5k 7660989 7799058 -1.8% BM_memcpy_12T/2 3.50 2.40 +31.4% BM_memcpy_12T/8 12.4 7.55 +39.1 BM_memcpy_12T/64 374 378 -1.1% BM_memcpy_12T/512 67132 66683 +0.7% BM_memcpy_12T/4k 5185125 5292920 -2.1% BM_memcpy_12T/5k 7717284 7942684 -2.9% BM_slicingSmallPieces_1T/2 47.3 47.5 +0.4% BM_slicingSmallPieces_1T/8 53.6 52.3 +2.4% BM_slicingSmallPieces_1T/64 491 476 +3.1% BM_slicingSmallPieces_1T/512 21734 18814 +13.4% BM_slicingSmallPieces_1T/4k 394660 396760 -0.5% BM_slicingSmallPieces_1T/5k 218722 209244 +4.3% BM_slicingSmallPieces_2T/2 80.7 79.9 +1.0% BM_slicingSmallPieces_2T/8 54.2 53.1 +2.0 BM_slicingSmallPieces_2T/64 497 477 +4.0% BM_slicingSmallPieces_2T/512 21732 18822 +13.4% BM_slicingSmallPieces_2T/4k 392885 390490 +0.6% BM_slicingSmallPieces_2T/5k 221988 208678 +6.0% BM_slicingSmallPieces_4T/2 80.8 80.1 +0.9% BM_slicingSmallPieces_4T/8 54.1 53.2 +1.7% BM_slicingSmallPieces_4T/64 493 476 +3.4% BM_slicingSmallPieces_4T/512 21702 18758 +13.6% BM_slicingSmallPieces_4T/4k 393962 404023 -2.6% BM_slicingSmallPieces_4T/5k 249667 211732 +15.2% BM_slicingSmallPieces_6T/2 80.5 80.1 +0.5% BM_slicingSmallPieces_6T/8 54.4 53.4 +1.8% BM_slicingSmallPieces_6T/64 488 478 +2.0% BM_slicingSmallPieces_6T/512 21719 18841 +13.3% BM_slicingSmallPieces_6T/4k 394950 397583 -0.7% BM_slicingSmallPieces_6T/5k 223080 210148 +5.8% BM_slicingSmallPieces_8T/2 81.2 80.4 +1.0% BM_slicingSmallPieces_8T/8 58.1 53.5 +7.9% BM_slicingSmallPieces_8T/64 489 480 +1.8% BM_slicingSmallPieces_8T/512 21586 18798 +12.9% BM_slicingSmallPieces_8T/4k 394592 400165 -1.4% BM_slicingSmallPieces_8T/5k 219688 208301 +5.2% BM_slicingSmallPieces_12T/2 80.2 79.8 +0.7% BM_slicingSmallPieces_12T/8 54.4 53.4 +1.8 BM_slicingSmallPieces_12T/64 488 476 +2.5% BM_slicingSmallPieces_12T/512 21931 18831 +14.1% BM_slicingSmallPieces_12T/4k 393962 396541 -0.7% BM_slicingSmallPieces_12T/5k 218803 207965 +5.0%	2017-01-24 13:55:18 -08:00
Gael Guennebaud	d83db761a2	Add support for std::integral_constant	2017-01-24 16:28:12 +01:00
Gael Guennebaud	bc10201854	Add test for multiple symbols	2017-01-24 16:27:51 +01:00
Gael Guennebaud	ddd83f82d8	Add support for "SymbolicExpr op fix<N>" in C++98/11 mode.	2017-01-24 10:54:42 +01:00
Gael Guennebaud	228fef1b3a	Extended the set of arithmetic operators supported by FixedInt (-,+,*,/,%,&,\|)	2017-01-24 10:53:51 +01:00

1 2 3 4 5 ...

1130 Commits