Misplaced Pages

Orthogonal Procrustes problem

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Matrix approximation problem in linear algebra

The orthogonal Procrustes problem is a matrix approximation problem in linear algebra. In its classical form, one is given two matrices A {\displaystyle A} and B {\displaystyle B} and asked to find an orthogonal matrix Ω {\displaystyle \Omega } which most closely maps A {\displaystyle A} to B {\displaystyle B} . Specifically, the orthogonal Procrustes problem is an optimization problem given by

minimize Ω Ω A B F subject to Ω T Ω = I , {\displaystyle {\begin{aligned}{\underset {\Omega }{\text{minimize}}}\quad &\|\Omega A-B\|_{F}\\{\text{subject to}}\quad &\Omega ^{T}\Omega =I,\end{aligned}}}

where F {\displaystyle \|\cdot \|_{F}} denotes the Frobenius norm. This is a special case of Wahba's problem (with identical weights; instead of considering two matrices, in Wahba's problem the columns of the matrices are considered as individual vectors). Another difference is that Wahba's problem tries to find a proper rotation matrix instead of just an orthogonal one.

The name Procrustes refers to a bandit from Greek mythology who made his victims fit his bed by either stretching their limbs or cutting them off.

Solution

This problem was originally solved by Peter Schönemann in a 1964 thesis, and shortly after appeared in the journal Psychometrika.

This problem is equivalent to finding the nearest orthogonal matrix to a given matrix M = B A T {\displaystyle M=BA^{T}} , i.e. solving the closest orthogonal approximation problem

min R R M F s u b j e c t   t o R T R = I {\displaystyle \min _{R}\|R-M\|_{F}\quad \mathrm {subject\ to} \quad R^{T}R=I} .

To find matrix R {\displaystyle R} , one uses the singular value decomposition (for which the entries of Σ {\displaystyle \Sigma } are non-negative)

M = U Σ V T {\displaystyle M=U\Sigma V^{T}\,\!}

to write

R = U V T . {\displaystyle R=UV^{T}.\,\!}

Proof of Solution

One proof depends on the basic properties of the Frobenius inner product that induces the Frobenius norm:

R = arg min Ω | | Ω A B F 2 = arg min Ω Ω A B , Ω A B F = arg min Ω Ω A F 2 + B F 2 2 Ω A , B F = arg min Ω A F 2 + B F 2 2 Ω A , B F = arg max Ω Ω A , B F = arg max Ω Ω , B A T F = arg max Ω Ω , U Σ V T F = arg max Ω U T Ω V , Σ F = arg max Ω S , Σ F where  S = U T Ω V {\displaystyle {\begin{aligned}R&=\arg \min _{\Omega }||\Omega A-B\|_{F}^{2}\\&=\arg \min _{\Omega }\langle \Omega A-B,\Omega A-B\rangle _{F}\\&=\arg \min _{\Omega }\|\Omega A\|_{F}^{2}+\|B\|_{F}^{2}-2\langle \Omega A,B\rangle _{F}\\&=\arg \min _{\Omega }\|A\|_{F}^{2}+\|B\|_{F}^{2}-2\langle \Omega A,B\rangle _{F}\\&=\arg \max _{\Omega }\langle \Omega A,B\rangle _{F}\\&=\arg \max _{\Omega }\langle \Omega ,BA^{T}\rangle _{F}\\&=\arg \max _{\Omega }\langle \Omega ,U\Sigma V^{T}\rangle _{F}\\&=\arg \max _{\Omega }\langle U^{T}\Omega V,\Sigma \rangle _{F}\\&=\arg \max _{\Omega }\langle S,\Sigma \rangle _{F}\quad {\text{where }}S=U^{T}\Omega V\\\end{aligned}}}
This quantity S {\displaystyle S} is an orthogonal matrix (as it is a product of orthogonal matrices) and thus the expression is maximised when S {\displaystyle S} equals the identity matrix I {\displaystyle I} . Thus
I = U T R V R = U V T {\displaystyle {\begin{aligned}I&=U^{T}RV\\R&=UV^{T}\\\end{aligned}}}

where R {\displaystyle R} is the solution for the optimal value of Ω {\displaystyle \Omega } that minimizes the norm squared | | Ω A B F 2 {\displaystyle ||\Omega A-B\|_{F}^{2}} .

Generalized/constrained Procrustes problems

There are a number of related problems to the classical orthogonal Procrustes problem. One might generalize it by seeking the closest matrix in which the columns are orthogonal, but not necessarily orthonormal.

Alternately, one might constrain it by only allowing rotation matrices (i.e. orthogonal matrices with determinant 1, also known as special orthogonal matrices). In this case, one can write (using the above decomposition M = U Σ V T {\displaystyle M=U\Sigma V^{T}} )

R = U Σ V T , {\displaystyle R=U\Sigma 'V^{T},\,\!}

where Σ {\displaystyle \Sigma '\,\!} is a modified Σ {\displaystyle \Sigma \,\!} , with the smallest singular value replaced by det ( U V T ) {\displaystyle \det(UV^{T})} (+1 or -1), and the other singular values replaced by 1, so that the determinant of R is guaranteed to be positive. For more information, see the Kabsch algorithm.

The unbalanced Procrustes problem concerns minimizing the norm of A U B {\displaystyle AU-B} , where A R m × , U R × n {\displaystyle A\in \mathbb {R} ^{m\times \ell },U\in \mathbb {R} ^{\ell \times n}} , and B R m × n {\displaystyle B\in \mathbb {R} ^{m\times n}} , with m > n {\displaystyle m>\ell \geq n} , or alternately with complex valued matrices. This is a problem over the Stiefel manifold U U ( m , ) {\displaystyle U\in U(m,\ell )} , and has no currently known closed form. To distinguish, the standard Procrustes problem ( A R m × m {\displaystyle A\in \mathbb {R} ^{m\times m}} ) is referred to as the balanced problem in these contexts.

See also

References

  1. Gower, J.C; Dijksterhuis, G.B. (2004), Procrustes Problems, Oxford University Press
  2. Hurley, J.R.; Cattell, R.B. (1962), "Producing direct rotation to test a hypothesized factor structure", Behavioral Science, 7 (2): 258–262, doi:10.1002/bs.3830070216
  3. Golub, G.H.; Van Loan, C. (2013). Matrix Computations (4 ed.). JHU Press. p. 327. ISBN 978-1421407944.
  4. Schönemann, P.H. (1966), "A generalized solution of the orthogonal Procrustes problem" (PDF), Psychometrika, 31: 1–10, doi:10.1007/BF02289451, S2CID 121676935.
  5. Everson, R (1997), Orthogonal, but not Orthonormal, Procrustes Problems (PDF)
  6. Eggert, DW; Lorusso, A; Fisher, RB (1997), "Estimating 3-D rigid body transformations: a comparison of four major algorithms", Machine Vision and Applications, 9 (5): 272–290, doi:10.1007/s001380050048, S2CID 1611749
Categories: