Multiplying a matrix by its transpose

May 20, 2025
Rss Fetcher

An earlier post claimed that there practical advantages to partitioning a matrix, thinking of the matrix as a matrix of matrices. This post will give an example.

Let M be a square matrix and suppose we need to multiply M by its transpose M^T. We can compute this product faster than multiplying two arbitrary matrices of the same size by exploiting the fact that MM^T will be a symmetric matrix.

We start by partitioning M into four blocks

$M = begin{bmatrix}A & B \ C & D end{bmatrix}$

Then

$M^intercal = begin{bmatrix}A^intercal & C^intercal \ B^intercal & D^intercal end{bmatrix}$

and

$MM^intercal = begin{bmatrix} AA^intercal + BB^intercal & AC^intercal + BD^intercal \CA^intercal + DB^intercal & CC^intercal + DD^intercal end{bmatrix}$

Now for the first clever part: We don’t have to compute both of the off-diagonal blocks because each is the transpose of the other. So we can reduce our calculation by 25% by not calculating one of the blocks, say the lower left block.

And now for the second clever part: apply the same procedure recursively. The diagonal blocks in MM^T involve a matrix times its transpose. That is, we can partition A and use the same idea to compute AA^T and do the same for BB^T, CC^T, and DD^T. The off diagonal blocks require general matrix multiplications.

The net result is that we can compute MM^T in about 2/3 the time it would take to multiply two arbitrary matrices of the same size.

Recently a group of researchers found a way to take this idea even further, partitioning a matrix into a 4 by 4 matrix of 16 blocks and doing some clever tricks. The RXTX algorithm can compute MM^T in about 36/41 the time required to multiply arbitrary matrices, a savings of about 5%. A 5% improvement may be significant if it appears in the inner loop of a heavy computation. According to the authors, “The algorithm was discovered by combining Machine Learning-based search methods with Combinatorial Optimization.”

The post Multiplying a matrix by its transpose first appeared on John D. Cook.

Related posts

Previous Post

Next Post

Solutions

Regions Covered