Commit Graph

  • b1a17dbfe4 fix a few weird issues with gcc 4.3 32bits and complex<float> Gael Guennebaud 2010-07-09 08:27:58 +02:00
  • 504d3a3586 fix SliceVectorizedTraversal for packetsize==1 Gael Guennebaud 2010-07-08 23:31:14 +02:00
  • 51ec188da0 extend vectorization_logic Gael Guennebaud 2010-07-08 23:30:16 +02:00
  • 300a226ffa scalars fitting in a single packet requires more work, step 1 * add a, Alignable trait * update LinearVectorization assignment Gael Guennebaud 2010-07-08 14:27:47 +02:00
  • 2a1500915a compilation fix Gael Guennebaud 2010-07-08 14:26:00 +02:00
  • d89925e6de an attempt to fix wrong unaligned store Gael Guennebaud 2010-07-07 22:35:06 +02:00
  • 02fd3acd81 update to support mixin types Gael Guennebaud 2010-07-07 19:49:48 +02:00
  • 31a36aa9c4 support for real * complex matrix product - step 1 (works for some special cases) Gael Guennebaud 2010-07-07 19:49:09 +02:00
  • c64c0f382f Examples for DenseBase::middle{Rows,Cols}() Manoj Rajagopalan 2010-06-30 11:26:31 -04:00
  • 5c58582a08 Renamed DenseBase::{row,col}Range() to DenseBase::middle{Rows,Cols}() Manoj Rajagopalan 2010-06-29 14:31:39 -04:00
  • 6e5bed69dc Included tests for middleRows() and middleCols() Manoj Rajagopalan 2010-06-29 12:39:58 -04:00
  • 464fc297cf Included definitions for rowRange() and colRange() member functions of DenseBase Manoj Rajagopalan 2010-06-26 17:37:17 -04:00
  • 4b474fdb34 Relax assertion to allow for matrices with cols() == 0 and/or rows() == 0. Martin Senst 2010-07-20 21:25:43 +02:00
  • 95f2e7f3f5 introduce a new LvalueBit flag and split DenseCoeffBase into three level of accessors Gael Guennebaud 2010-07-21 10:57:01 +02:00
  • 3abbdfd621 Add (set)LinSpaced to quick reference guide. Jitse Niesen 2010-07-20 21:55:22 +01:00
  • abd5faf784 Require at least MPFR version 2.3.0, because we use mpfr_signbit. Code in FindMPFR.cmake is taken from FindEigen2.cmake . Jitse Niesen 2010-07-19 12:26:52 +01:00
  • cac147ba10 add support for determinant on empty matrix Gael Guennebaud 2010-07-19 10:45:06 +02:00
  • 78d3c54631 add a small bench demoing the possibilities of a direct 3x3 eigen decomposition Gael Guennebaud 2010-07-18 17:26:06 +02:00
  • ea27678153 fix compilation of ei_tridiagonalization_inplace_selector for 1x1 matrix Gael Guennebaud 2010-07-18 17:10:11 +02:00
  • 2a820d41df finish/fix level1 blas, all test pass Gael Guennebaud 2010-07-17 13:49:43 +02:00
  • dd27e10360 fix level3 blas: it now passes all computational tests Gael Guennebaud 2010-07-17 11:59:09 +02:00
  • 2d78023815 fix hemm to not use the imaginary part of the diagonal entries Gael Guennebaud 2010-07-17 11:57:54 +02:00
  • cbd6fe323c fix a couple a issue with blas (new TRMM api, and enforece column major) Gael Guennebaud 2010-07-16 23:30:06 +02:00
  • f59226e901 fix compilation of blas lib Gael Guennebaud 2010-07-16 22:27:24 +02:00
  • 4c19024fbf re-enable writing to reversed objects Gael Guennebaud 2010-07-16 22:26:07 +02:00
  • fb041c260c fix for empty matrices Gael Guennebaud 2010-07-16 22:25:35 +02:00
  • 883a8cbb2c disable the optimized 3x3 path for complexes which was not working at all Gael Guennebaud 2010-07-16 18:22:00 +02:00
  • 6ab9e8632f fix bad fuzzy comparison in 3x3 tridiagonalization Gael Guennebaud 2010-07-16 16:38:58 +02:00
  • 044424b0e2 fix sum()/prod() on empty matrix making sure this does not affect fixed sized object, extend related unit tests including partial reduction Gael Guennebaud 2010-07-16 14:02:20 +02:00
  • 6a370f50c7 MPRealSupport was missing Gael Guennebaud 2010-07-15 20:45:45 +02:00
  • b08c26aefa merge Gael Guennebaud 2010-07-15 20:41:33 +02:00
  • 84fdbded4d add support for strictly triangular matrix in trmm though it is not really useful Gael Guennebaud 2010-07-15 20:39:20 +02:00
  • 87e89fea4e add a support module for MPFR C++ with basic unit testing Gael Guennebaud 2010-07-15 16:29:04 +02:00
  • bfbe61454e merge Gael Guennebaud 2010-07-15 09:54:31 +02:00
  • cf9edd9958 fix compilation for non trivial types Gael Guennebaud 2010-07-14 23:31:38 +02:00
  • b6fac91998 merge Gael Guennebaud 2010-07-14 22:51:53 +02:00
  • d4d4382b18 use dummy_precision by default instead of 0 Gael Guennebaud 2010-07-14 22:50:03 +02:00
  • 90d6fc0e28 fix ei_aligned_delete for null pointers and non trivial dtors Gael Guennebaud 2010-07-14 22:49:34 +02:00
  • b0bd1cfa05 Tutorial page 4: add some text, diversify examples. Use \verbinclude for output text to disable syntax highlighting. Give tables consistent look. Jitse Niesen 2010-07-14 10:16:12 +01:00
  • e4f3759c4d add a bench for quaternion multiplication Gael Guennebaud 2010-07-13 13:29:35 +02:00
  • 6ba5d2c90c Implemented SSE optimized double-precision Quaternion multiplication Christoph Hertzberg 2010-07-12 23:30:47 +02:00
  • c36316f284 Change EXPAND_AS_DEFINED doxygen configuration option. Add macros so that MatrixBase::cwiseProduct() and ArrayBase::min() are documented, and remove one macro which is no longer used. Jitse Niesen 2010-07-13 10:14:58 +01:00
  • 140ad0908d Tutorial page 3: add more cwise operations, condense rest. Jitse Niesen 2010-07-12 22:45:57 +01:00
  • 8e776c94c1 Tutorial page 1: Put code and output side-by-side. Jitse Niesen 2010-07-12 12:02:31 +01:00
  • 19a70ae939 fix doc compilation on non 32bits systems Gael Guennebaud 2010-07-11 11:01:17 +02:00
  • 850c6d8a2b fix unused warning Gael Guennebaud 2010-07-11 10:58:58 +02:00
  • e5bc9526f1 * generalize rowmajor by vector * fix weird compilation error when constructing a matrix with a row by matrix product Gael Guennebaud 2010-07-10 22:53:27 +02:00
  • c4ef69b5bd fix compilation: make the check_coordinates* functions const Gael Guennebaud 2010-07-10 22:37:16 +02:00
  • 6dcd373b9d let ei_pset1 use _mm_loaddup_pd. Not a significant speed improvement, but also not a speed regression, and replaces 3 instructions by 1 single instruction. Benoit Jacob 2010-07-09 18:51:17 -04:00
  • 6ad3f1ab1f Added NEON/Complex.h, ~3.5x faster than scalar std::complex<float> minor fix in AltiVec Complex.h Konstantinos Margaritis 2010-07-10 00:09:29 +03:00
  • 96f9015807 disable MSVC optimization when the underlying compiler is ICC Gael Guennebaud 2010-07-09 19:33:43 +02:00
  • b2effa2b2c move ei_conj_if to a more appropriate file Gael Guennebaud 2010-07-09 18:05:57 +02:00
  • 642cc27eb1 forgot to commit ei_p4f_FORWARD; Konstantinos Margaritis 2010-07-09 18:08:18 +03:00
  • f6bd508351 forgot to add the Complex.h include for AltiVec. Konstantinos Margaritis 2010-07-09 17:56:53 +03:00
  • d9e134c73c Altivec port of Complex.h. Note: For some reason g++ 4.4 is >200% slower than g++ 4.3 on altivec code. The same benchmark (bench_gemm) was tested, on the same hardware/OS (G4/Debian testing), with same CFLAGS. With some code reorganizing I managed to get some minor gain on 4.4, but I just could not reach 4.3 speed. This is most likely a bug, but I'm waiting to see if it's fixed on 4.5. I'll look into this a bit more. Konstantinos Margaritis 2010-07-09 17:54:41 +03:00
  • 26cfe5a958 Be consistent in how the tutorial pages link together. Jitse Niesen 2010-07-09 11:59:29 +01:00
  • 2c03ca3325 Small changes to tutorial page 2 (matrix arithmetic): * slightly more extensive discussion of aliasing * layout: put example code and output side-by-side * add some links, etc Jitse Niesen 2010-07-09 11:46:07 +01:00
  • 551cb9b7b4 bench: use of Eigen/Array is deprecated + fix includes for iostream Thomas Capricelli 2010-07-09 03:59:36 +02:00
  • 951da96f14 Added more redux types/examples in tutorial and fixed some display issues Carlos Becker 2010-07-08 18:16:39 +01:00
  • cb3aad1d91 Reductions/Broadcasting/Visitor Tutorial added to index Carlos Becker 2010-07-08 17:45:25 +01:00
  • 9852e7b9cb Reductions/Broadcasting/Visitor Tutorial added Carlos Becker 2010-07-08 17:42:23 +01:00
  • 2066ed91de enabling aligned loads/store for complex<double> is much more tricky, so the temporary fix is to always perform unaligned load/store Gael Guennebaud 2010-07-07 22:50:19 +02:00
  • fc3fd8ab57 mention that array = matrix is fine too Gael Guennebaud 2010-07-07 18:10:11 +02:00
  • 861962c55f sync Gael Guennebaud 2010-07-07 16:44:05 +02:00
  • 0f2d480af0 add support for complex Gael Guennebaud 2010-07-07 16:41:29 +02:00
  • a2415388ef optimized conjugate products for SSE3 Gael Guennebaud 2010-07-07 16:37:20 +02:00
  • 65257f6b29 optimize for SSE3 => significant speed up !! Gael Guennebaud 2010-07-07 15:34:46 +02:00
  • dd18b22f0b optimize pmul for complex<double> Gael Guennebaud 2010-07-07 15:29:04 +02:00
  • 845994f18f optimize gemv for complex<double> and fix gcc alignment issue in 32bits Gael Guennebaud 2010-07-07 15:28:41 +02:00
  • e07c0f6bb5 cleanning Gael Guennebaud 2010-07-07 11:41:29 +02:00
  • 3a7f16a655 typo Gael Guennebaud 2010-07-07 11:13:30 +02:00
  • b0896382a3 s/IsVectorized/Vectorizable Gael Guennebaud 2010-07-07 11:10:46 +02:00
  • 74cf12cbe0 add a compile time error if someone call packet on Diagonal (instead of infinite runtime loop) Gael Guennebaud 2010-07-07 11:07:12 +02:00
  • d5e0efaf69 fix vectorization rule of diagonal-product Gael Guennebaud 2010-07-07 11:06:31 +02:00
  • c851044eae fix row cwise-prod column in coeff based products... I really don't know why this worked so far... Gael Guennebaud 2010-07-07 10:52:59 +02:00
  • 55495dcbae extend product unit tests Gael Guennebaud 2010-07-07 10:50:40 +02:00
  • e38fc9692d add a conj_product functor and optimize dot products Gael Guennebaud 2010-07-07 10:00:08 +02:00
  • f8d3b4c060 fix mixing types in DiagonalProduct Gael Guennebaud 2010-07-07 09:43:29 +02:00
  • bfa606d16f * add a IsVectorized mechanism (instead of packet-size>1...) * vectorize complex<double> Gael Guennebaud 2010-07-06 23:36:00 +02:00
  • 38d0a0d5d6 add a unit test for previous bug Gael Guennebaud 2010-07-06 20:54:35 +02:00
  • 2dba4b7ce7 add a unit test for conj_helper and ei_pconj Gael Guennebaud 2010-07-06 20:54:14 +02:00
  • bc57c68cf5 bug fix forgot to conjugate the scalar factor when needed Gael Guennebaud 2010-07-06 20:53:48 +02:00
  • e04c3f2cc0 reduce code generation and minor speed up Gael Guennebaud 2010-07-06 19:15:02 +02:00
  • d6454788d9 add support for vectorized conjugated products Gael Guennebaud 2010-07-06 19:10:24 +02:00
  • 291fef5760 fix range Gael Guennebaud 2010-07-06 19:09:31 +02:00
  • 49747fa4a9 Various documentation improvements. * Add short documentation for Array class * Put all classes explicitly in Core module (where applicable) * Section on Modules in Quick Reference Guide * Put Page 7 after Page 6 in Contents :) Jitse Niesen 2010-07-06 13:10:08 +01:00
  • 3428d80d20 Small changes to tutorial page 1. Jitse Niesen 2010-07-06 10:48:25 +01:00
  • d849bc4401 Avoid calling resizeLike, if EIGEN_NO_AUTOMATIC_RESIZING is defined Jens Mueller 2010-07-06 10:11:18 +02:00
  • 5322b670c8 Add all unsupported modules and fix header file paths Jens Mueller 2010-07-06 10:25:52 +02:00
  • 7d23e7f9f1 indentation Gael Guennebaud 2010-07-06 11:02:01 +02:00
  • d1243b393e Added tag 3.0-beta1 for changeset 8cfbf33f60 Benoit Jacob 2010-07-06 00:50:30 -04:00
  • 8cfbf33f60 fix the overview page and add mention that the wrong stack alignment issue may have been solved by gcc 4.5 3.0-beta1 Benoit Jacob 2010-07-06 00:50:16 -04:00
  • c69a226192 * extend the Has* packet traits and makes all functor use it * extend the packing routines to support conjugation Gael Guennebaud 2010-07-05 23:27:54 +02:00
  • 8db60afb47 oops I did not see that Gael Guennebaud 2010-07-05 21:27:15 +02:00
  • e1eccfad3f add intitial support for the vectorization of complex<float> Gael Guennebaud 2010-07-05 16:18:09 +02:00
  • 1505221263 add check for non x86 platforms, we get a compile error on arm/powerpc without the check (there is no known -yet- method to get cpuid, without resolving to kernel /sys interface) Konstantinos Margaritis 2010-07-05 16:44:41 +03:00
  • 1daf9b11ba check for !x86 platforms, otherwise the BTL benchmark doesn't compile on arm/powerpc Konstantinos Margaritis 2010-07-05 16:42:11 +03:00
  • 9fa4e9a098 Improve documentation, mostly by adding links to Quick Start Guide. Jitse Niesen 2010-07-05 10:59:29 +01:00
  • efb79600b9 fix warning "type qualifiers ignored on function return type" for long long scalar types Gael Guennebaud 2010-07-05 11:23:05 +02:00
  • 15a421ef63 char is not necessarily signed.... Gael Guennebaud 2010-07-05 11:15:08 +02:00