Add sparse Lanczos SVD solver by Intron7 · Pull Request #3034 · NVIDIA/raft

Intron7 · 2026-05-21T16:23:47Z

As discussed previously with @cjnolet I'm also adding my Lanczos SVD solver for sparse CSR matrices.

This is the more precise sparse SVD path next to the existing randomized solver. The solver repeatedly applies A @ v and A.T @ u to build Krylov bases, computes the SVD of the small bidiagonal problem, uses the resulting Ritz vectors to identify converged singular triplets, locks those vectors, and restarts on the remaining unconverged part. It also uses full reorthogonalization and a final A @ V refinement step to improve singular values and left singular vectors.

Compared with randomized SVD, this is aimed at quality: clustered spectra, slow singular-value decay, near-rank-deficient inputs, and PCA workloads where ARPACK-like accuracy matters.

Ran the #2999 -style row sweep on the same singlecell dataset, k=50, n_oversamples=10, n_power_iters=2, best-of-3 GPU timings. GPU: RTX PRO 6000 Blackwell.



  ┌──────┬───────┬─────────────────┬──────────────┬─────────────────────────┬─────────────────────┬──────────────────┐
  │ rows │   nnz │ raft randomized │ raft Lanczos │          Lanczos / rand │ randomized residual │ Lanczos residual │
  ├──────┼───────┼─────────────────┼──────────────┼─────────────────────────┼─────────────────────┼──────────────────┤
  │  50k │  101M │          0.180s │       0.252s │            1.40x slower │            3.11e-02 │         1.55e-07 │
  │ 200k │  400M │          0.698s │       1.328s │            1.90x slower │            3.12e-02 │         1.57e-07 │
  │ 500k │ 1.02B │          1.776s │       1.763s │          basically tied │            3.06e-02 │         1.57e-07 │
  │ 982k │ 2.01B │          3.536s │       3.423s │ Lanczos slightly faster │            3.09e-02 │         1.56e-07 │
  └──────┴───────┴─────────────────┴──────────────┴─────────────────────────┴─────────────────────┴──────────────────┘

Using the 2999 CPU baselines for context:

  ┌──────┬─────────────┬──────────────┬─────────────────────────────┬──────────────────────────┐
  │ rows │ sklearn CPU │ scipy ARPACK │ randomized speedup vs scipy │ Lanczos speedup vs scipy │
  ├──────┼─────────────┼──────────────┼─────────────────────────────┼──────────────────────────┤
  │  50k │       7.38s │       18.69s │                        104x │                      74x │
  │ 200k │      25.61s │       62.49s │                         90x │                      47x │
  │ 500k │      64.97s │      153.40s │                         86x │                      87x │
  │ 982k │     126.16s │      307.04s │                         87x │                      90x │
  └──────┴─────────────┴──────────────┴─────────────────────────────┴──────────────────────────┘

copy-pr-bot · 2026-05-21T16:23:50Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

aamijar · 2026-05-26T22:40:40Z

Hi @Intron7, I haven't looked at this PR closely yet, but one thing to think about is that we should try and reuse parts of the existing lanczos eigensolver as much as possible. They should have some things in common right?

aamijar · 2026-05-27T03:20:22Z

/ok to test eabeaa4

Intron7 · 2026-06-03T13:58:07Z

@aamijar the algorithm is different even is the name is similar. The two paths share the Lanczos name but the kernels are different algorithms: the eigensolver builds a symmetric tridiagonal via a one-vector recurrence and ritz-solves with syevd; the SVD builds a bidiagonal via Golub-Kahan with two coupled bases (A @ v and Aᵀ @ u) and ritz-solves with gesvdj. Restart is also different, the SVD path locks converged singular triplets and restarts on the unconverged subspace.
The realistic shared surface is the reorthogonalization helpers (CGS2/MGS2) and the cublas wrapper calls. The existing lanczos.cuh does its reorthogonalization inline against its own single-vector layout, so factoring CGS2/MGS2 into a shared utility would require touching the existing eigensolver too. I'd rather land this PR as-is and do a separate refactor PR to extract a shared bidiag_reorth /lanczos_reorth utility if you want.

cjnolet · 2026-06-04T00:05:22Z

/ok to test b80ca30

cjnolet · 2026-06-23T16:24:30Z

/ok to test 93e9a19

Add sparse Lanczos SVD solver

eabeaa4

Intron7 requested review from a team as code owners May 21, 2026 16:23

github-project-automation Bot added this to Unstructured Data Processing May 21, 2026

aamijar assigned Intron7 May 26, 2026

aamijar added non-breaking Non-breaking change feature request New feature or request labels May 26, 2026

aamijar moved this to In Progress in Unstructured Data Processing May 26, 2026

Intron7 and others added 2 commits June 3, 2026 15:54

Merge branch 'main' into feat/add-lanczos-svds

c06c2a8

fix func decl

b80ca30

cjnolet reviewed Jun 4, 2026

View reviewed changes

Comment thread cpp/include/raft/sparse/solver/solver_types.hpp

cjnolet reviewed Jun 4, 2026

View reviewed changes

Comment thread cpp/include/raft/sparse/solver/lanczos_svds.cuh

adress comments

93e9a19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add sparse Lanczos SVD solver#3034

Add sparse Lanczos SVD solver#3034
Intron7 wants to merge 4 commits into
NVIDIA:mainfrom
Intron7:feat/add-lanczos-svds

Intron7 commented May 21, 2026

Uh oh!

copy-pr-bot Bot commented May 21, 2026

Uh oh!

aamijar commented May 26, 2026

Uh oh!

aamijar commented May 27, 2026

Uh oh!

Intron7 commented Jun 3, 2026

Uh oh!

cjnolet commented Jun 4, 2026

Uh oh!

Uh oh!

Uh oh!

cjnolet commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Intron7 commented May 21, 2026

Uh oh!

copy-pr-bot Bot commented May 21, 2026

Uh oh!

aamijar commented May 26, 2026

Uh oh!

aamijar commented May 27, 2026

Uh oh!

Intron7 commented Jun 3, 2026

Uh oh!

cjnolet commented Jun 4, 2026

Uh oh!

Uh oh!

Uh oh!

cjnolet commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants