eigen/Eigen at gpu-cg-interop - eigen - Gitea: Git with a cup of tea

devtools/eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Files

History

Rasmus Munk Larsen 014f12f11a GPU: Add BLAS-1 ops, DeviceScalar, device-resident SpMV, and CG interop (5/5)

Add the operator interface needed for GPU iterative solvers:

- BLAS Level-1 on DeviceMatrix: dot(), norm(), squaredNorm(), setZero(),
  noalias(), operator+=/-=/\*= dispatching to cuBLAS axpy/scal/dot/nrm2.
- DeviceScalar<Scalar>: device-resident scalar returned by reductions.
  Defers host sync until value is read (implicit conversion). Device-side
  division via NPP for real types.
- GpuContext: stream-borrowing constructor, setThreadLocal(), cublasLtHandle(),
  cusparseHandle().
- GEMM upgraded from cublasGemmEx to cublasLtMatmul with heuristic algorithm
  selection and plan caching.
- GpuSparseContext: GpuContext& constructor for same-stream execution,
  deviceView() returning DeviceSparseView with operator* for device-resident
  SpMV (d_y = d_A * d_x).
- geam expressions: d_C = d_A + alpha * d_B via cublasXgeam.
- GpuSVD::matrixV() convenience wrapper.

These additions make DeviceMatrix usable as a VectorType in Eigen algorithm
templates. Conjugate gradient is the motivating example and is tested against
CPU ConjugateGradient for correctness.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-09 20:19:59 -07:00

..

GPU: Add BLAS-1 ops, DeviceScalar, device-resident SpMV, and CG interop (5/5)

2026-04-09 20:19:59 -07:00

AccelerateSupport

Apply clang-format

2023-11-29 11:12:48 +00:00

Cholesky

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

CholmodSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Core

GPU: Raise CUDA/HIP minimum and remove legacy guards

2026-04-09 15:21:39 -07:00

Dense

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Eigen

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Eigenvalues

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Geometry

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

GPU

GPU: Add BLAS-1 ops, DeviceScalar, device-resident SpMV, and CG interop (5/5)

2026-04-09 20:19:59 -07:00

Householder

Add block Householder right-side application for HouseholderSequence

2026-03-27 19:56:08 -07:00

IterativeLinearSolvers

Apply clang-format

2023-11-29 11:12:48 +00:00

Jacobi

Apply clang-format

2023-11-29 11:12:48 +00:00

KLUSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

LU

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

MetisSupport

Apply clang-format

2023-11-29 11:12:48 +00:00

OrderingMethods

fixing a lot of typos

2024-07-30 22:15:49 +00:00

PardisoSupport

Apply clang-format

2023-11-29 11:12:48 +00:00

PaStiXSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

QR

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

QtAlignedMalloc

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Sparse

Apply clang-format

2023-11-29 11:12:48 +00:00

SparseCholesky

Apply clang-format

2023-11-29 11:12:48 +00:00

SparseCore

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

SparseLU

Apply clang-format

2023-11-29 11:12:48 +00:00

SparseQR

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

SPQRSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

StdDeque

Apply clang-format

2023-11-29 11:12:48 +00:00

StdList

Apply clang-format

2023-11-29 11:12:48 +00:00

StdVector

Apply clang-format

2023-11-29 11:12:48 +00:00

SuperLUSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

SVD

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

ThreadPool

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

UmfPackSupport

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00

Version

Clean up top-level Eigen headers

2026-03-29 16:28:09 -07:00