15
$\begingroup$

It's easy for scalars, $(\exp(a(x)))' = a' e^a$. But can anything be said about matrices? Do $A(x)$ and $A'(x)$ commute such that $(\exp(A(x)))' = A' e^A = e^A A'$ or is this only a special case?

  • 4
    Have a look at this wikipedia article: [the exponential map](http://en.wikipedia.org/wiki/Matrix_exponential#The_exponential_map), although I don't know if this article directly addresses the commutative part of your question.2010-08-10
  • 0
    @Tom Stephens: That is the answer already, thanks. Please feel free to add it as such (maybe quoting $\frac{d}{dt}e^{X(t)} = \int_0^1 e^{\alpha X(t)} \frac{dX(t)}{dt} e^{(1-\alpha) X(t)}\,d\alpha$)2010-08-10
  • 0
    I don't have anything intelligent to say about this. I would like to see someone else post an answer who is more familiar with these operations. In particular, I would like to see the issue of commutativity addressed since it is not at all obvious to me.2010-08-10
  • 1
    I've no access to http://dx.doi.org/10.1137/S0895479895283409 , but it might have what you need I suppose.2010-08-10
  • 0
    @JMangaldan: Via [google scholar](http://scholar.google.de/scholar?q=author%3AMathias+%22A+Chain+Rule+for+Matrix+Functions+and+Applications%22&hl=de&btnG=Suche&lr=) I found it [here](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.3890&rep=rep1&type=pdfhttp://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.3890&rep=rep1&type=pdf). I only skimmed it, but it seems to be specialised on triangular matrices. But thanks for the link2010-08-11
  • 1
    The reason for specializing to (block) triangular IIRC is that one does a preliminary Schur decomposition A=QTQ* of the input matrix first, evaluates the function at the resulting (block) triangular matrix T, and then multiplies back the unitary matrices Q. Now that I think about it, maybe you should take a look at http://www.maths.manchester.ac.uk/~higham/fm/index.php also.2010-08-11

3 Answers 3

6

Tom Stephens linked to the exponential map, which states that

$ \frac{d}{dt}e^{X(t)} = \int\limits_0^1 e^{\alpha X(t)} \frac{dX(t)}{dt} e^{(1-\alpha) X(t)} d\alpha $

If $X(t)$ and $\frac{d}{dt}X(t)$ commute, the latter also commutes with $\exp(X(t))$ and then it simplifies into $ \frac{d}{dt}e^{X(t)} = \frac{d X(t)}{dt} e^{X(t)}$.

A counter-example is $$X(t) = \begin{pmatrix} \cos(t) & \sin(t) \\\ \sin(t) & -\cos(t) \end{pmatrix}$$ at $t=0$, i.e. $X(0) = \sigma_3, \dot X(0) = \sigma_1$ ($\sigma_i$ are the non-commuting Pauli Matrices)

4

The question comes down to computing what's called the "derivative of", the "differential of", or the "tangent map to", the exponential map from $M_n(\mathbb R)$ into itself at a given matrix $A$ (not necessarily the zero matrix). There is a classical formula for this. Here is the first reference I found: pages 1 and 2 of

http://www.math.columbia.edu/~woit/notes4.pdf

by Peter Woit. Here it is (with Peter Woit's notation) $$\exp_*(X)\ Y=\exp(X)\ \frac{1-e^{-ad(X)}}{ad(X)}\ Y.$$ Here is a reference for the Chain Rule:

http://en.wikipedia.org/wiki/Chain_rule#The_fundamental_chain_rule

It reads, in Peter Woit's notation and under appropriate assumptions,

$$(f\circ g)\_*(x)=f_*(g(x))\circ g_*(x).$$

[Thank you to KennyTM for having edited this formula.]

EDIT 1. Here are the two formulas written in another notation:

$$\exp'(X)=\exp(X)\ \frac{1-e^{-ad(X)}}{ad(X)}\quad,$$

$$(f\circ g)'(x)=f'(g(x))\circ g'(x).$$

EDIT 2. Here is another reference. This is a post by Akhil Mathew:

http://deltaepsilons.wordpress.com/2009/11/07/helgasons-formula-ii/

  • 0
    Does $ad(X)$ mean $ad(X) Y = [X,Y], ad(X)^2 Y = [X, [X,Y]]$ etc.? In that case I'm missing a Matrix it's acting on, and also what $ad(X)^{-1}$ would mean.2010-08-11
  • 0
    Yes, ad(X) means what you said. If T is an indeterminate, then the expression (1-exp(-T))/T defines a formal power series with infinite radius of convergence. Hence, it can be evaluated on any endomorphism of any finite dimensional complex vector space.2010-08-11
  • 0
    Are the $a_*$ and $a'$ equivalent in this answer?2010-08-11
  • 0
    Dear KennyTM: Yes, you're perfectly right! Thanks and congratulation for having solved my LaTeX problem. You're a wizard!2010-08-11
0

Only I khnow is that if your matrix $A$ and $A\in GL(n,\mathbb{R})$ is invertible we can say that $A'(X)=-AXA^{-1}$. So $(e^{A(x)})' =A(x)'e^ {A(x)} =-AXA^{-1}e^{A(x)}$ , since $A$ is not commutative you can't say $A' e^{A(x)} =e^{A(x)}A'$