eigen

mirror of https://gitlab.com/libeigen/eigen.git synced 2026-04-10 11:34:33 +08:00

Files

Rasmus Munk Larsen 58c44ef36d GPU: Add library dispatch module (DeviceMatrix, cuBLAS, cuSOLVER)

Add Eigen/GPU module: A standalone GPU library dispatch layer where
DeviceMatrix<Scalar> operations map 1:1 to cuBLAS/cuSOLVER calls.
CPU and GPU solvers coexist in the same binary with compatible syntax.

Core infrastructure:
- DeviceMatrix<Scalar>: RAII dense column-major GPU memory wrapper with
  async host transfer (fromHost/toHost) and CUDA event-based cross-stream
  synchronization.
- GpuContext: Unified execution context owning a CUDA stream + cuBLAS
  handle + cuSOLVER handle. Thread-local default with explicit override
  via setThreadLocal(). Stream-borrowing constructor for integration.
- DeviceBuffer: Typed RAII device allocation with move semantics.

cuBLAS dispatch (expression syntax):
- GEMM: d_C = d_A.adjoint() * d_B (cublasXgemm)
- TRSM: d_X = d_A.triangularView<Lower>().solve(d_B) (cublasXtrsm)
- SYMM/HEMM: d_C = d_A.selfadjointView<Lower>() * d_B (cublasXsymm)
- SYRK/HERK: d_C = d_A * d_A.adjoint() (cublasXsyrk)

cuSOLVER dispatch:
- GpuLLT: Cached Cholesky factorization (cusolverDnXpotrf + Xpotrs)
- GpuLU: Cached LU factorization (cusolverDnXgetrf + Xgetrs)
- Solver chaining: auto x = d_A.llt().solve(d_B)
- Solver expressions with .device(ctx) for explicit stream control.

CI: Bump CUDA container to Ubuntu 22.04 (CMake 3.22), GCC 10->11,
Clang 12->14. Bump cmake_minimum_required to 3.17 for FindCUDAToolkit.

Tests: gpu_cublas.cpp, gpu_cusolver_llt.cpp, gpu_cusolver_lu.cpp,
gpu_device_matrix.cpp, gpu_library_example.cu
Benchmarks: bench_gpu_solvers.cpp, bench_gpu_chaining.cpp,
bench_gpu_batching.cpp

2026-04-09 19:05:25 -07:00

scripts

CI: split NVHPC build and make fallback parallelism configurable

2026-04-01 16:43:33 -07:00

benchmark.gitlab-ci.yml

Add nightly benchmark regression detection pipeline

2026-03-29 16:03:56 -07:00

build.linux.gitlab-ci.yml

GPU: Add library dispatch module (DeviceMatrix, cuBLAS, cuSOLVER)

2026-04-09 19:05:25 -07:00

build.windows.gitlab-ci.yml

GPU: Raise CUDA/HIP minimum and remove legacy guards

2026-04-09 15:21:39 -07:00

checkformat.gitlab-ci.yml

CI: Reduce artifact size, cache clang-tidy, fix test retry, throttle QEMU

2026-03-17 21:41:29 -07:00

common.gitlab-ci.yml

CI: drop Clang-6, bump base image to Ubuntu 24.04 and Clang 12 to 14

2026-03-30 22:00:17 -07:00

CTest2JUnit.xsl

Add possibility to split test suit build targets and improved CI configuration

2020-08-19 18:27:45 +00:00

deploy.gitlab-ci.yml

Only build docs on push to master branch, not MRs.

2025-08-29 18:33:09 +00:00

README.md

Fix formatting in README.md

2024-07-03 19:16:56 +00:00

test.linux.gitlab-ci.yml

GPU: Add library dispatch module (DeviceMatrix, cuBLAS, cuSOLVER)

2026-04-09 19:05:25 -07:00

test.windows.gitlab-ci.yml

GPU: Raise CUDA/HIP minimum and remove legacy guards

2026-04-09 15:21:39 -07:00

README.md

Eigen CI infrastructure

Eigen's CI infrastructure uses three stages:

A checkformat stage to verify MRs satisfy proper formatting style, as defined by clang-format.
A build stage to build the unit-tests.
A test stage to run the unit-tests.

For merge requests, only a small subset of tests are built/run, and only on a small subset of platforms. This is to reduce our overall testing infrastructure resource usage. In addition, we have nightly jobs that build and run the full suite of tests on most officially supported platforms.