Commit Graph

  • 1108b4f218 Fixed the signature of numext::abs to make it compatible with complex numbers Benoit Steiner 2016-04-04 11:09:25 -07:00
  • b8245cc325 Merged eigen/eigen into default tillahoffmann 2016-04-04 12:28:11 +01:00
  • 2b457f8e5e Fix cross-compiling windows version detection Gael Guennebaud 2016-04-04 11:47:46 +02:00
  • 30242b7565 Merged eigen/eigen into default Rasmus Larsen 2016-04-01 17:19:36 -07:00
  • 9d51f7c457 Add rcond method to LDLT. Rasmus Munk Larsen 2016-04-01 16:48:38 -07:00
  • f54137606e Add condition estimation to Cholesky (LLT) factorization. Rasmus Munk Larsen 2016-04-01 16:19:45 -07:00
  • fb8dccc23e Replace "inline static" with "static inline" for consistency. Rasmus Munk Larsen 2016-04-01 12:48:18 -07:00
  • 91414e0042 Fix comments in ConditionEstimator and minor cleanup. Rasmus Munk Larsen 2016-04-01 11:58:17 -07:00
  • 1aa89fb855 Add matrix condition estimator module that implements the Higham/Hager algorithm from http://www.maths.manchester.ac.uk/~higham/narep/narep135.pdf used in LPACK. Add rcond() methods to FullPivLU and PartialPivLU. Rasmus Munk Larsen 2016-04-01 10:27:59 -07:00
  • 80eba21ad0 Merge upstream. Till Hoffmann 2016-04-01 18:18:49 +01:00
  • eb0ae602bd Added CUDA tests. Till Hoffmann 2016-04-01 18:17:45 +01:00
  • ffd770ce94 Fixed CUDA signature. Till Hoffmann 2016-04-01 17:58:24 +01:00
  • 3cb0a237c1 Fixed suggestions by Eugene Brevdo. Till Hoffmann 2016-04-01 17:51:39 +01:00
  • 49960adbdd Merged eigen/eigen into default tillahoffmann 2016-04-01 14:36:15 +01:00
  • 57239f4a81 Added polygamma function. Till Hoffmann 2016-04-01 14:35:21 +01:00
  • dd5d390daf Added zeta function. Till Hoffmann 2016-04-01 13:32:29 +01:00
  • 3da495e6b9 Relaxed the condition used to gate the fft code. Benoit Steiner 2016-03-31 18:11:51 -07:00
  • 0ea7ab4f62 Hashing was only officially introduced in c++11. Therefore only define an implementation of the hash function for float16 if c++11 is enabled. Benoit Steiner 2016-03-31 14:44:55 -07:00
  • 92b7f7b650 Improved code formating Benoit Steiner 2016-03-31 13:09:58 -07:00
  • f197813f37 Added the ability to hash a fp16 Benoit Steiner 2016-03-31 13:09:23 -07:00
  • 0f5cc504fe Properly gate the fft code Benoit Steiner 2016-03-31 12:59:39 -07:00
  • 4c859181da Made it possible to use the NumTraits for complex and Array in a cuda kernel. Benoit Steiner 2016-03-31 12:48:38 -07:00
  • c36ab19902 Added __ldg primitive for fp16. Benoit Steiner 2016-03-31 10:55:03 -07:00
  • b575fb1d02 Added NumTraits for half floats Benoit Steiner 2016-03-31 10:43:59 -07:00
  • 8c8a79cec1 Fixed a typo Benoit Steiner 2016-03-31 10:33:32 -07:00
  • af4ef540bf Fixed a off-by-one bug in a debug assertion Benoit Steiner 2016-03-30 18:37:19 -07:00
  • 791e5cfb69 Added NumTraits for type2index. Benoit Steiner 2016-03-30 18:36:36 -07:00
  • 4f1a7e51c1 Pull math functions from the global namespace only when compiling cuda code with nvcc. When compiling with clang, we want to use the std namespace. Benoit Steiner 2016-03-30 17:59:49 -07:00
  • bc68fc2fe7 Enable constant expressions when compiling cuda code with clang. Benoit Steiner 2016-03-30 17:58:32 -07:00
  • 483aaad10a Fixed compilation warning Benoit Steiner 2016-03-30 17:08:13 -07:00
  • 1b40abbf99 Added missing assignment operator to the TensorUInt128 class, and made misc small improvements Benoit Steiner 2016-03-30 13:17:03 -07:00
  • 01b5333e44 bug #1186 - vreinterpretq_u64_f64 fails to build on Android/Aarch64/Clang toolchain Benoit Jacob 2016-03-30 11:02:33 -04:00
  • aa45ad2aac Fixed the formatting of the README. Benoit Steiner 2016-03-29 15:06:13 -07:00
  • 56df5ef1d7 Attempt to fix the formatting of the README Benoit Steiner 2016-03-29 15:03:38 -07:00
  • 1bcd82e31b Pulled latest updates from trunk Benoit Steiner 2016-03-29 13:36:18 -07:00
  • 09ad31aa85 Add regression test for nesting type handling in blas_traits Gael Guennebaud 2016-03-29 22:33:57 +02:00
  • 1841d6d4c3 Added missing cuda template specializations for numext::ceil Benoit Steiner 2016-03-29 13:29:34 -07:00
  • 7b7d2a9fa5 Use false instead of 0 as the expected value of a boolean Benoit Steiner 2016-03-29 11:50:17 -07:00
  • e02b784ec3 Added support for standard mathematical functions and trancendentals(such as exp, log, abs, ...) on fp16 Benoit Steiner 2016-03-29 09:20:36 -07:00
  • c38295f0a0 Added support for fmod Benoit Steiner 2016-03-28 15:53:02 -07:00
  • 6772f653c3 Made it possible to customize the threadpool Benoit Steiner 2016-03-28 10:01:04 -07:00
  • 1bc81f7889 Fixed compilation warnings on arm Benoit Steiner 2016-03-28 09:21:04 -07:00
  • 78f83d6f6a Prevent potential overflow. Benoit Steiner 2016-03-28 09:18:04 -07:00
  • 01e7298fe6 actually include ZVector files, passes most basic tests (float still fails) Konstantinos Margaritis 2016-03-28 10:58:02 -04:00
  • f48011119e Merged eigen/eigen into default Konstantinos Margaritis 2016-03-28 01:48:45 +03:00
  • ed6b9d08f1 some primitives ported, but missing intrinsics and crash with asm() are a problem Konstantinos Margaritis 2016-03-27 18:47:49 -04:00
  • 74f91ed06c Improved support for integer modulo Benoit Steiner 2016-03-25 17:21:56 -07:00
  • 65716e99a5 Improved the cost estimate of the quotient op Benoit Steiner 2016-03-25 11:13:53 -07:00
  • d94f6ba965 Started to model the cost of divisions more accurately. Benoit Steiner 2016-03-25 11:02:56 -07:00
  • a86c9f037b Fixed compilation error on windows Benoit Steiner 2016-03-24 18:54:31 -07:00
  • 0968e925a0 Updated the benchmarking code to use Eigen::half instead of half Benoit Steiner 2016-03-24 18:00:33 -07:00
  • 044efea965 Made sure that the cxx11_tensor_cuda test can be compiled even without support for cxx11. Benoit Steiner 2016-03-23 20:02:11 -07:00
  • 2e4e4cb74d Use numext::abs instead of abs to avoid incorrect conversion to integer of the argument Benoit Steiner 2016-03-23 16:57:12 -07:00
  • 41434a8a85 Avoid unnecessary conversions Benoit Steiner 2016-03-23 16:52:38 -07:00
  • 92693b50eb Fixed compilation warning Benoit Steiner 2016-03-23 16:40:36 -07:00
  • 9bc9396e88 Use portable includes Benoit Steiner 2016-03-23 16:30:06 -07:00
  • 393bc3b16b Added comment Benoit Steiner 2016-03-23 16:22:15 -07:00
  • 81d340984a Removed executable bit from header files Benoit Steiner 2016-03-23 16:15:02 -07:00
  • bff8cbad06 Removed executable bit from header files Benoit Steiner 2016-03-23 16:14:23 -07:00
  • 7a570e50ef Fixed contractions of fp16 Benoit Steiner 2016-03-23 16:00:06 -07:00
  • 7168afde5e Made the tensor benchmarks compile on MacOS Benoit Steiner 2016-03-23 14:21:04 -07:00
  • 2062ee2d26 Added a test to verify that notifications are working properly Benoit Steiner 2016-03-23 13:39:00 -07:00
  • fc3660285f Made type conversion explicit Benoit Steiner 2016-03-23 09:56:50 -07:00
  • 0e68882604 Added the ability to divide a half float by an index Benoit Steiner 2016-03-23 09:46:42 -07:00
  • 6971146ca9 Added more conversion operators for half floats Benoit Steiner 2016-03-23 09:44:52 -07:00
  • 9642fd7a93 Replace all M_PI by EIGEN_PI and add a check to the testsuite. Christoph Hertzberg 2016-03-23 15:37:45 +01:00
  • 28e02996df Merged patch 672 from Justin Lebar: Don't use long doubles with cuda Benoit Steiner 2016-03-22 16:53:57 -07:00
  • 3d1e857327 Fixed compilation error Benoit Steiner 2016-03-22 15:48:28 -07:00
  • de7d92c259 Pulled latest updates from trunk Benoit Steiner 2016-03-22 15:24:49 -07:00
  • 002cf0d1c9 Use a single Barrier instead of a collection of Notifications to reduce the thread synchronization overhead Benoit Steiner 2016-03-22 15:24:23 -07:00
  • bc2b802751 Fixed a couple of typos Benoit Steiner 2016-03-22 14:27:34 -07:00
  • e7a468c5b7 Filter some compilation flags that nvcc warns about. Benoit Steiner 2016-03-22 14:26:50 -07:00
  • 6a31b7be3e Avoid using std::vector whenever possible Benoit Steiner 2016-03-22 14:02:50 -07:00
  • 65a7113a36 Use an enum instead of a static const int to prevent possible link error Benoit Steiner 2016-03-22 09:33:54 -07:00
  • f9ad25e4d8 Fixed contractions of 16 bit floats Benoit Steiner 2016-03-22 09:30:23 -07:00
  • 8ef3181f15 Worked around a constness related issue Benoit Steiner 2016-03-21 11:24:05 -07:00
  • 7a07d6aa2b Small cleanup Benoit Steiner 2016-03-21 11:12:17 -07:00
  • a9a6710e15 add initial s390x(zEC13) ZVECTOR support Konstantinos Margaritis 2016-03-21 13:46:47 -04:00
  • e91f255301 Marked variables that's only used in debug mode as such Benoit Steiner 2016-03-21 10:02:00 -07:00
  • db5c14de42 Explicitly cast the default value into the proper scalar type. Benoit Steiner 2016-03-21 09:52:58 -07:00
  • b224771f40 bug #1178: Simplified modification of the SSE control register for better portability Christoph Hertzberg 2016-03-20 10:57:08 +01:00
  • 8e03333f06 Renamed some class members to make the code more readable. Benoit Steiner 2016-03-18 15:21:04 -07:00
  • 6c08943d9f Fixed a bug in the padding of extracted image patches. Benoit Steiner 2016-03-18 15:19:10 -07:00
  • 134d750eab Completed the implementation of vectorized type casting of half floats. Benoit Steiner 2016-03-18 13:36:28 -07:00
  • 7bd551b3a9 Make all the conversions explicit Benoit Steiner 2016-03-18 12:20:08 -07:00
  • bb0e73c191 Gate all the CUDA tests under the EIGEN_TEST_NVCC option Benoit Steiner 2016-03-18 12:17:37 -07:00
  • 2db4a04827 Fixed a typo Benoit Steiner 2016-03-18 12:08:01 -07:00
  • dd514de8a9 Added a test to validate the fallback path for half floats Benoit Steiner 2016-03-18 12:02:39 -07:00
  • 9a7ece9caf Worked around constness issue Benoit Steiner 2016-03-18 10:38:29 -07:00
  • edc679f6c6 Fixed compilation warning Benoit Steiner 2016-03-18 07:12:34 -07:00
  • 53d498ef06 Fixed compilation warnings in the cuda tests Benoit Steiner 2016-03-18 07:04:54 -07:00
  • e10e126cd0 pulled latest updates from trunk Benoit Steiner 2016-03-17 21:48:38 -07:00
  • 70eb70f5f8 Avoid mutable class members when possible Benoit Steiner 2016-03-17 21:47:18 -07:00
  • 7b98de1f15 Implemented some of the missing type casting for half floats Benoit Steiner 2016-03-17 21:45:45 -07:00
  • afb81b7ded Made sure to use the hard abi when compiling with NEON instructions to avoid the "gnu/stubs-soft.h: No such file or directory" error Benoit Steiner 2016-03-17 21:24:24 -07:00
  • 95b8961a9b Allocate the mersenne twister used by the random number generators on the heap instead of on the stack since they tend to keep a lot of state (i.e. about 5k) around. Benoit Steiner 2016-03-17 15:23:51 -07:00
  • f7329619da Fix bug in tensor contraction. The code assumes that contraction axis indices for the LHS (after possibly swapping to ColMajor!) is increasing. Explicitly sort the contraction axis pairs to make it so. Benoit Steiner 2016-03-17 15:08:02 -07:00
  • 46aa9772fc Merged in ebrevdo/eigen (pull request PR-169) Christoph Hertzberg 2016-03-16 21:59:08 +01:00
  • f1f7181f53 Merge default branch. Eugene Brevdo 2016-03-16 12:46:19 -07:00
  • 1f69a1b65f Change the header guard around certain numext functions to be CUDA specific. Eugene Brevdo 2016-03-16 12:44:35 -07:00