Re: [math] Q-R -decomposition

Luc Maisonobe Sun, 21 May 2006 16:06:46 -0700

> Beyond what is available in the API (Q and R), what exactly does the
> QR Decomp "know" that makes solving easier?


foreword :

what is stated below is oriented only for least squares problems, it isnot a general discussion about decomposition algorithms.

When using QR decomposition in least squares problems, you NEVER reallycompute Q explicitely, and if you don't retrieve Q, you don't retrieve Reither. The decomposition algorithm keeps some information about Q (theHouseholder vectors, but also some coefficients and permutation indicesif you want to support rank-deficient problems) and you use thisinformation to compute transpose(Q).Q.V for some vector V withoutcomputing Q itself, and it uses R internally also to provide some higherlevel result, not Q and R to let you compute something with them. Q is ahuge matrix, much larger than the Householder vectors it can be deducedfrom. This is especially true when the problem as a few parameters but alot of measurements (in orbit determination problems, for example, youoften have less than 10 parameters in the model and several tens ofthousands of measurements).

What makes least squares solving easier is not the QR decompositionitself, but the way it is used in the surrounding algorithm(Levenberg-Marquardt for example). In this case, you do NOT compute thenormal equations (i.e. transpose(J).J where J is the jacobian matrix)and decompose the resulting square matrix like you would do in aGauss-Newton solver. You decompose the jacobian matrix itself (this isthe reason for the transpose(Q).Q.V part of the solver). Bothdecomposition are therefore not used the same way.

The QR way is more accurate because there are situations where thesquaring of the jacobian matrix involved in the normal equationscomputation leads to cancellation of some tiny values (epsilon ->epsilon^2). For difficult problems, this is really important.

On the other hand, using LU has other benefits. First, you may build thenormal equations iteratively (i.e. build Jt.J by updating a matrix asmeasurements are considered one after the other) and second, the matrixsize is small (mXm where m is the number of parameters of the model),which is smaller than the nXm matrix appearing in the QR decomposition(but beware, nobody really computes the nXn Q matrix). QR decompositioninvolves about twice the number of operations the LU decomposition.

So the choice is size versus accuracy for simple problems, and you mayonly choose accuracy for difficult problems, as the other wayalternative simply fails. For optimization problems, theLevenberg-Marquardt algorithm (which uses a QR decomposition as one ofthe many parts of the algorithm) is the method of choice you will findin many applications. It works really well and few people botherstudying really what is the better alternative.

In any case, for least squares problems, the decomposition used is animplementation choice and the user doesn't really need to see theintermediate steps (building J or not, decomposing Jt.J using LU or Jusing QR, applying residuals, updating the parameters). He chooses onemethod or the other and get the final result.


Luc


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [math] Q-R -decomposition

Reply via email to