As part of a complex task, I need to compute matrix cofactors. I did this in a straightforward way using this nice code for computing matrix minors. Here is my code:
def matrix_cofactor(matrix):
C = np.zeros(matrix.shape)
nrows, ncols = C.shape
for row in xrange(nrows):
for col in xrange(ncols):
minor = matrix[np.array(range(row)+range(row+1,nrows))[:,np.newaxis],
np.array(range(col)+range(col+1,ncols))]
C[row, col] = (-1)**(row+col) * np.linalg.det(minor)
return C
It turns out that this matrix cofactor code is the bottleneck, and I would like to optimize the code snippet above. Any ideas as to how to do this?
And the output (which is cofactor matrix) is:
If your matrix is invertible, the cofactor is related to the inverse:
This gives large speedups (~ 1000x for 50x50 matrices). The main reason is fundamental: this is an
O(n^3)
algorithm, whereas the minor-det-based one isO(n^5)
.This probably means that also for non-invertible matrixes, there is some clever way to calculate the cofactor (i.e., not use the mathematical formula that you use above, but some other equivalent definition).
If you stick with the det-based approach, what you can do is the following:
The majority of the time seems to be spent inside
det
. (Check out line_profiler to find this out yourself.) You can try to speed that part up by linking Numpy with the Intel MKL, but other than that, there is not much that can be done.You can speed up the other part of the code like this:
This gains some 10-50% total runtime depending on the size of your matrices. The original code has Python
range
and list manipulations, which are slower than direct slice indexing. You could try also to be more clever and copy only parts of the minor that actually change --- however, already after the above change, close to 100% of the time is spent insidenumpy.linalg.det
so that furher optimization of the othe parts does not make so much sense.The calculation of
np.array(range(row)+range(row+1,nrows))[:,np.newaxis]
does not depended oncol
so you could could move that outside the inner loop and cache the value. Depending on the number of columns you have this might give a small optimization.