Fastest way to compute J' * J, where J is sparse

1 view (last 30 days)
I have a sparse rectangular matrix, J, for which I want to compute:
>> H = J' * J;
It's a bit slow (transpose is taking 5s and the matrix multiplication 9s), and given this is a special and very common case of a transpose and multiply, I was wondering if MATLAB had a faster way, e.g. one which avoids an explicit transpose.
  15 Comments
Matt J
Matt J on 20 Nov 2015
I don't know the complexity. It will have to be tested. I imagine this could be advantageous only for certain kinds of sparsity structure.
Oliver Woodford
Oliver Woodford on 20 Nov 2015
For my size of matrix, qr(J,0) is vastly slower (i.e. several orders of magnitude; actually it crashed MATLAB after a minute or so) than a mex file that computes J'*J (a second or so). The mex file is a bit slower than standard MATLAB for smaller sparse matrices, but at least works for very tall matrices (which MATLAB produces an "Matrix too large" error on when transposing).

Sign in to comment.

Answers (2)

Azzi Abdelmalek
Azzi Abdelmalek on 3 Nov 2014
Edited: Azzi Abdelmalek on 3 Nov 2014
c=sparse(J);
H=full(c*c');
  2 Comments
Oliver Woodford
Oliver Woodford on 3 Nov 2014
The matrix is already stored as a sparse matrix.
John D'Errico
John D'Errico on 3 Nov 2014
Edited: John D'Errico on 3 Nov 2014
Um, J is already assumed to be in sparse form, and one would definitely not want to compute a full result when working with sparse matrices. Finally, you put the transpose on the wrong term, computing J*J', not J'*J.

Sign in to comment.


Matt J
Matt J on 4 Nov 2014
Edited: Matt J on 5 Nov 2014
I don't think there's anything available to accelerate an exact calculation of J'*J for general J. However, if you know in advance that J'*J happens to be banded to diagonals -k:k for small k (or if it can be approximated as such), then it might help to compute the 2*k+1 non-trivial diagonals individually. You can do so without transposition as below.
[m,n]=size(J);
k=2;
kc=k+1;
tic;
B=zeros(n);
B(:,kc)=sum(J.^2);
for i=1:k
tmp=sum(J(:,1:end-i).*J(:,i+1:end));
B(1:end-i,kc-i)=tmp;
B(i+1:end,kc+i)=tmp;
end
result=spdiags(B,-k:k,n,n);
toc;
Whether this is actually faster will probably depend on the specifics of J. If nothing else, it spares you the large memory consumption of holding wide sparse matrices such as J' in RAM
>> J=sparse(m,n); Jt=J'; whos J Jt
Name Size Bytes Class Attributes
J 3192027x3225 25824 double sparse
Jt 3225x3192027 25536240 double sparse
Replacing J'*J by a banded approximation is something I haven't tried myself with Gauss-Newton specifically, but the role of J'*J is already as an approximation there, so I think it could work. Other minimization algorithms tend to be robust to small errors in the derivatives.

Categories

Find more on Sparse Matrices in Help Center and File Exchange

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!