sqrtm is not type-stable #4006

johnmyleswhite · 2013-08-10T02:55:42Z

Maybe we should change sqrtm so that it always returns a complex array?

julia> X = sqrtm([1.0 -0.9; -0.9 1.0])
2x2 Float64 Array:
  0.847316  -0.531089
 -0.531089   0.847316

julia> X * X
2x2 Float64 Array:
  1.0  -0.9
 -0.9   1.0

julia> X = sqrtm([1.0 -0.9; 0.9 1.0])
2x2 Complex{Float64} Array:
   1.0829+0.0im  -0.415549+1.11022e-16im
 0.415549+0.0im     1.0829+1.38778e-16im

julia> X * X
2x2 Complex{Float64} Array:
 1.0+4.61352e-17im  -0.9+1.82784e-16im
  0.9+5.7669e-17im   1.0+3.46701e-16im

The text was updated successfully, but these errors were encountered:

ViralBShah · 2013-08-10T03:47:10Z

This is also the case with sqrt. This is what dynamic languages are meant for. The types also look fine. We can keep it the way it is IMO.

johnmyleswhite · 2013-08-10T03:48:58Z

That's not how sqrt behaves:

julia> sqrt(1.0)
1.0

julia> sqrt(-1.0)
ERROR: DomainError()
 in sqrt at math.jl:118

julia> sqrt(1.0 + 0.0 * im)
1.0 + 0.0im

julia> sqrt(-1.0 + 0.0 * im)
0.0 + 1.0im

JeffBezanson · 2013-08-10T04:37:31Z

eig behaves the same as sqrtm.

ViralBShah · 2013-08-10T08:29:24Z

Oops, sqrt was a bad example. But yes, eig does this.

johnmyleswhite · 2013-08-10T13:02:56Z

Ok. This behavior seems undesirable to me, but I suppose this is traditional enough that I should just learn to accept it. I would have thought we'd want behavior like fft has.

StefanKarpinski · 2013-08-10T14:36:34Z

I agree that this does not seem great from the type-stability perspective.

StefanKarpinski · 2014-01-24T23:20:53Z

One option would be to return a real matrix when the input is a matrix of Symmetric type and a complex matrix otherwise. That also has the advantage of avoiding the check when you've constructed the matrix to be symmetric. A big disadvantage of the current arrangement is that there isn't any way to call sqrtm type stably. We could do a similar scheme for eig too. The nice: the returned array is always complex and it's always correct but it might happen to be a purely real-valued complex array. If you know the matrix is symmetric and you want a real matrix back, you can wrap it in Symmetric first and get exactly what you want.

jiahao · 2014-01-24T23:41:20Z

The underlying issue is that special symmetries of the matrix (Hermitian, real symmetric etc) allow for special-cased algorithms that are significantly faster than the generic algorithms. Functions like sqrtm can be computed more stably and quickly for complex Hermitian than for general unsymmetric complex, and so the current methods actually check for special symmetries and explicitly dispatch appropriate specialized methods. If we turn off these symmetry detection checks, we compromise performance. If we explicitly cast the return into a Complex matrix type, we also potentially compromise performance if we have to force memory copies to create the desired type.

If a user wants to explicitly cast the return into a Complex matrix type, they should feel free to do so. But there seems no way to force the issue without compromising performance.

ivarne · 2014-01-25T06:52:03Z

It might be nice to mention the type instability issue in the documentation for sqrtm?

jiahao · 2014-01-25T08:34:27Z

Yeah, we should document this. This behavior isn't specific to sqrtm however; rather it is also a feature of other functions, most notably linear solve and eig.

Reopening as documentation issue.

johnmyleswhite · 2014-01-25T15:50:01Z

We should really be documenting the return type of every function.

jiahao · 2014-01-26T02:40:51Z

My earlier analysis of the sqrtm polyalgorithm turns out to be not quite right. This is what actually happens when sqrtm(A) is called:

check if A is Hermitian or symmetric.
If A is neither Hermitian nor symmetric, triangularize A by computing its Schur factorization. Compute the matrix square root of the triangular factor, then rotate the new matrix back into the original basis.
If A is either Hermitian or symmetric, diagonalize A by computing its eigendecomposition. Take the square roots of the eigenvalues, converting them as necessary to handle the possibility of complex values being produced, then rotate the new matrix back into the original basis.

A matrix that is neither Hermitian nor symmetric will in general have a square root with complex elements. A matrix that is Hermitian or symmetric will also have a square root with complex elements unless it is positive definite, in which case we can specialize to the case of square root with real elements. Since you can determine whether a Hermitian/symmetric matrix is positive definite essentially for free after computing its eigenvalues, and recomposing a real matrix as a product of real factors is cheaper than in the complex case, I still think that the current polyalgorithm is optimally performant.

The current implementation of the individual components, however, suggests some beneficial refactoring. For example:

The computation of the square root of a triangular matrix is buried in the general method and could be usefully exposed to compute square roots of Triangulars quickly.
The current check for positive definiteness converts all the eigenvalues to complex, then takes their square root, then checks if the result has zero imaginary component. This could be done more simply.

- Reimplement expm and sqrtm (ref #4006)

GunnarFarneback · 2014-01-27T12:41:47Z

A matrix that is neither Hermitian nor symmetric will in general have a square root with complex elements.

In general yes, but e.g. the special case of real matrices with no eigenvalues on the non-positive real axis (positive and conjugate pairs are fine) have real square roots. I wrote about some of the peculiarities of the matrix square root in a mailing list thread about sqrtm when it was implemented:

Like the scalar square root, the matrix square root often has multiple solutions, and even more so than in the scalar case. E.g. a positive definite nxn Hermitian matrix with distinct eigenvalues has 2^n different square roots, whereas repeated eigenvalues can allow infinitely many square roots. E.g. [-1 0;0 -1] has the obvious square roots [i 0;0 i], [i 0;0 -i], [-i 0;0 i], and [-i 0;0 i] but also non-diagonal solutions like [1 2;-1 -1], which has the property of being real. In fact real matrices often have real square roots, and it could be interesting to have a real variation of sqrtm. This is thoroughly investigated, at least in the non-singular case, in

N. J. Higham, Computing real square roots of a real matrix, Linear Algebra Appl. 88/89 (1987) 405–430.

As a special case, I believe that the implemented algorithm does compute a real solution, up to numerical noise, for real matrices with no eigenvalues on the non-positive real axis. This is easily checked in the Schur decomposition and the imaginary part could be dropped in that case. It would be good if someone could verify that this is correct.

The singular non-diagonalizable case can be quite tricky. E.g. [0 1;0 0] has no square root at all, neither real nor complex, but if we add another dimension we have [0 0 1;0 0 0;0 1 0]^2 = [0 1 0;0 0 0;0 0 0]. The implemented method not-a-numbers both of these.

I did write a Julia implementation of Higham's real square root method (starting from the real Schur decomposition instead of the complex Schur decomposition) but it's really not worth the code complexity.

jiahao · 2014-01-27T15:59:26Z

@GunnarFarneback thanks for the back history. I'll have to think for a bit about how easy it would be to check for the special case you mentioned in the triangular sqrtm routine.

The mailing list discussion belies a much more complicated issue, namely that of defining a principal square root. It bears repeating that matrices don't have unique square roots. I think most people would expect matrix-valued functions to produce the "canonical" version that is implemented, which is to factorize, square root, and rotate back to the original basis. But perhaps this is worth mentioning in the docs.

…y reflectors. Add Float16 and BigFloat to tests and test promotion. Cleaup promotion in LinAlg. Avoid promotion in mutating ! functions. Make Symmetric, Hermitian and QRs immutable. Make thin a keyword argument in svdfact. Remove cond argument in sqrtm.

jiahao closed this as completed Jan 24, 2014

jiahao reopened this Jan 25, 2014

jiahao closed this as completed in e87a929 Jan 26, 2014

jiahao added a commit that referenced this issue Jan 26, 2014

Consolidate Hermitian and Symmetric routines

de6c1f6

- Reimplement expm and sqrtm (ref #4006)

jiahao added a commit that referenced this issue Jan 26, 2014

Provide sqrtm of Triangular matrices (ref #4006)

9bd4618

JeffBezanson mentioned this issue Mar 1, 2014

Bad type inference for eig #5992

Closed

timholy mentioned this issue Jun 10, 2015

Real-valued results for sqrtm #11655

Closed

andreasnoack mentioned this issue Jun 4, 2016

zillions of allocations in eig? #16751

Closed

afniedermayer mentioned this issue Aug 21, 2017

^(::Matrix{Float64},::Float64) not type stable #23369

Open

fredrikekre mentioned this issue Sep 1, 2017

deprecate sqrtm in favor of sqrt #23504

Merged

fredrikekre mentioned this issue Oct 23, 2017

type instability in log, sqrt, acos, asin, and acosh for symmetric/hermitian arguments #24296

Open

mateuszbaran mentioned this issue Feb 8, 2019

Possible faster matrix sqrt for upper triangular matrices #31007

Closed

sethaxen mentioned this issue Mar 10, 2021

Compute real matrix logarithm and matrix square root using real arithmetic #39973

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sqrtm is not type-stable #4006

sqrtm is not type-stable #4006

johnmyleswhite commented Aug 10, 2013

ViralBShah commented Aug 10, 2013

johnmyleswhite commented Aug 10, 2013

JeffBezanson commented Aug 10, 2013

ViralBShah commented Aug 10, 2013

johnmyleswhite commented Aug 10, 2013

StefanKarpinski commented Aug 10, 2013

StefanKarpinski commented Jan 24, 2014

jiahao commented Jan 24, 2014

ivarne commented Jan 25, 2014

jiahao commented Jan 25, 2014

johnmyleswhite commented Jan 25, 2014

jiahao commented Jan 26, 2014

GunnarFarneback commented Jan 27, 2014

jiahao commented Jan 27, 2014

sqrtm is not type-stable #4006

sqrtm is not type-stable #4006

Comments

johnmyleswhite commented Aug 10, 2013

ViralBShah commented Aug 10, 2013

johnmyleswhite commented Aug 10, 2013

JeffBezanson commented Aug 10, 2013

ViralBShah commented Aug 10, 2013

johnmyleswhite commented Aug 10, 2013

StefanKarpinski commented Aug 10, 2013

StefanKarpinski commented Jan 24, 2014

jiahao commented Jan 24, 2014

ivarne commented Jan 25, 2014

jiahao commented Jan 25, 2014

johnmyleswhite commented Jan 25, 2014

jiahao commented Jan 26, 2014

GunnarFarneback commented Jan 27, 2014

jiahao commented Jan 27, 2014