Linear System of ODEs Theory

8 minute read ·

Published:

An important kind of equation to learn is those linear equations, which are those can be written in the form of

Ly=bLy=b

for some linear differential operator LL and unknown yy. We begin by the example of second order homogeneous linear equations.

Second Order Linear Equations

Given a linear, second order, homogeneous ODE

y+p(t)y+q(t)y=0\begin{equation}y''+p\left(t\right)y'+q\left(t\right)y=0\end{equation}

Now, we are going to show the following two things:

  1. If y1,y2y_1,y_2 are solution to (1), then y=c1y1+c2y2y=c_1y_1+c_2y_2 is also a solution to (1) for some constants c1,c2Rc_1,c_2\in \mathbb{R}.
  1. If y1y_1 and y2y_2 are two linearly independent solutions to (1), then all solution to (1) have the form y=c1y1+c2y2y=c_1y_1+c_2y_2 for some constants c1,c2Rc_1,c_2\in\mathbb{R}.

The first one relies on the linearity of differential operator

L=d2dt+p(t)ddt+q(t)L=\frac{\mathrm{d}^2}{\mathrm{d}t}+p\left(t\right)\frac{\mathrm{d}}{\mathrm{d}t}+q\left(t\right)

We can show that this operator is linear by observing that

  • L(cy)=cL(y)L\left(cy\right)=cL\left(y\right) for any constant cc.
  • L(y1+y2)=L(y1)+L(y2)L\left(y_1+y_2\right)=L\left(y_1\right)+L\left(y_2\right).

Then, for any constant c1,c2c_1,c_2, we have

L(c1y1+c2y2)=c1L(y1)+c2L(y2)=0L\left(c_1y_1+c_2y_2\right)=c_1L\left(y_1\right)+c_2L\left(y_2\right)=0

and the first one is proved.

The second one relies the Picard-Lindelof theorem also known as existence and uniqueness theorem:

Let x0Ωx_0\in\Omega, where ΩRn\Omega\subset\mathbb{R}^n is open and let t0It_0\in I, where IRI\subset\mathbb{R} be an interval.

Suppose F:Ω×IRnF:\Omega\times I\to\mathbb{R}^n is continuous in tt and Lipschitz continuous in xx, i.e. there exist a constant L>0L>0, such that for all x,yΩx,y\in\Omega and all tIt\in I,

F(x,t)F(y,t)Lxy\left\|F\left(x,t\right)-F\left(y,t\right)\right\|\le L\left\|x-y\right\|

Then, the initial value problem

x=F(x,t)x(t0)=x0\begin{align}x'=F\left(x,t\right) &&x\left(t_0\right)=x_0\end{align}

has one and only one solution in some tt interval containing t0t_0.

An outline of the proof is as follows:

We can always construct a solution by using an iterative approximation method known as Picard iteration, which works as follows:

First, a solution to (2) will have to satisfy the integral equation

x(t)=x0+t0tF(x(s),s)dsx\left(t\right)=x_0+\int_{t_0}^tF\left(x\left(s\right),s\right)\mathrm{d}s

Then, by setting x(0)(t)=x0x^{(0)}\left(t\right)=x_0, and

x(k+1)(t)=x0+t0tF(x(k)(s),s)dsx^{\left(k+1\right)}\left(t\right)=x_0+\int_{t_0}^tF\left(x^{\left(k\right)}\left(s\right),s\right)\mathrm{d}s

yields a sequence of function x(k)(t)x^{(k)}\left(t\right). Then, under some condition on function FF, we can use Contraction Mapping Principle to show that the sequence converges to a unique function x(t)x\left(t\right).

Algebraic Structure of Linear Equation

An interesting question to consider in the proof is that why is there exactly two independent solutions to a second linear order equation? A simple answer to this might be that since it contain second order derivative, when we solve it we shall integrate twice and each gives an integrating constant cc which makes there are two constants c1,c2c_1,c_2 to determine.

In fact, if we apply some knowledge from linear algebra, we may easy to think of the solutions as a vector space, known as solution space which happens to be 2 dimensional. And the general case would be, for a linear equation of order nn, the dimension of its solution space is exactly nn. Now, we want to prove this.

First notice that given an explicit ODE of order nn,

x(n)(t)=f(x,x,x,,x(n1),t)\begin{equation}x^{\left(n\right)}\left(t\right)=f\left(x,x',x'',\dots,x^{\left(n-1\right)},t\right)\end{equation}

and set new variables

x1=x,x2=x,x3=x,,xn=x(n1)\begin{align*} x_1=x,&& x_2=x',&& x_3=x'',&& \cdots,&& x_n=x^{\left(n-1\right)} \end{align*}

Equation (3) can be written as a linear system of equations of order 1,

(x1x2xn)=(x2x3f(x1,x2,,xn,t))\left( \begin{array}{c} x_1'\\ x_2'\\ \vdots\\ x_n'\\\end{array} \right) =\left( \begin{array}{c} x_2\\ x_3\\ \vdots\\ f\left( x_1,x_2,\dots ,x_n,t \right)\\\end{array} \right)

or, simply denoted as

x=F(x,t)\boldsymbol{x}'=\boldsymbol{F}\left(\boldsymbol{x},t\right)

Therefore, for any linear equation of order nn, it is always equivalent to a system of nn first order linear equation.

Thence, we only need to consider the first order linear system of equations,

x=A(t)x+b(t)\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}+\boldsymbol{b}\left(t\right)

where b:RRn\boldsymbol{b}:\mathbb{R}\to\mathbb{R}^n, and A:RRn×Rn\boldsymbol{A}:\mathbb{R}\to\mathbb{R}^n\times\mathbb{R}^n .

Consider the associated homogeneous equation

x=A(t)x\begin{equation}\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}\end{equation}

Notice that given any two solution x1,x2\boldsymbol{x}_1,\boldsymbol{x}_2, and any number λ,μR\lambda,\mu\in\mathbb{R}, their linear combination λx1+μx2\lambda\boldsymbol{x}_1+\mu\boldsymbol{x}_2 is also a solution.

Suppose that a basis of vectors B={b1,,bn}\mathcal{B}=\left\{\boldsymbol{b}_1,\cdots,\boldsymbol{b}_n\right\}, of Rn\mathbb{R}^n is given and fix t0Rt_0\in \mathbb{R}. Denote by xk(t)\boldsymbol{x}_k\left(t\right) the unique solution (theorem of Picard-Lindelof) to the initial value problem

xk=A(t)xk,xk(t0)=bk\begin{align} \boldsymbol{x}_k'=\boldsymbol{A}\left(t\right)\boldsymbol{x}_k, && \boldsymbol{x}_k\left(t_0\right)=\boldsymbol{b}_k \end{align}

and denote

F={x1,,xn}\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\}

Then, by superposition principle, any element of spanF\mathrm{span}\mathcal{F} is a solution to (4)

Conversely, let x\boldsymbol{x} be any solution to (4), and suppose that x(t0)=x0Rn\boldsymbol{x}\left(t_0\right)=\boldsymbol{x}_0\in\mathbb{R}^n. We have

x(t0)=x0=k=1nλkbk=k=1nλkxk(t0)\boldsymbol{x}\left(t_0\right)=\boldsymbol{x}_0=\sum_{k=1}^{n}\lambda_k\boldsymbol{b}_k=\sum_{k=1}^{n}\lambda_k\boldsymbol{x}_k\left(t_0\right)

for some numbers λ1,,λnR\lambda_1,\dots,\lambda_n\in\mathbb{R}. Therefore, by theorem of Picard-Lindelof,

x=k=1nλkxk\boldsymbol{x}=\sum_{k=1}^n\lambda_k\boldsymbol{x}_k

This shows that the solution space is given by spanF\mathrm{span}\mathcal{F}.

In addition, we have,

t,k=1nλkxk(t)=0k=1nλkxk(t0)=k=1nλkbk=0λk=0\forall t, \sum_{k=1}^n{\lambda _k}\boldsymbol{x}_k\left( t \right) =0\Rightarrow \sum_{k=1}^n{\lambda _k}\boldsymbol{x}_k\left( t_0 \right) =\sum_{k=1}^n{\lambda _k}\boldsymbol{b}_k=0\Rightarrow \lambda _k=0

which means F\mathcal{F} is a basis.

Now, let’s explain why a second order linear equation will have exactly two independent solutions. For equation (1), let x1=y,x2=yx_1=y,x_2=y', and denote x=(x1,x2)\boldsymbol{x}=\left(x_1,x_2\right), it is equivalent to

x=(01p(t)q(t))x+(0g(t))\boldsymbol{x}'=\left( \begin{matrix} 0& 1\\ -p\left( t \right)& -q\left( t \right)\\\end{matrix} \right) \boldsymbol{x}+\left( \begin{array}{c} 0\\ g\left( t \right)\\\end{array} \right)

which is a first order linear system of equation in two dimension.

Solution to Linear System of Equations

Homogeneous, Constant Coefficients

First, consider the most basic case of linear system of equations, that is a homogeneous one with constant coefficients:

x=Ax,x(0)=x0\begin{align}\boldsymbol{x}'=\boldsymbol{A}\boldsymbol{x},&&\boldsymbol{x}\left(0\right)=\boldsymbol{x}_0 \end{align}

where x:RRn\boldsymbol{x}:\mathbb{R}\to\mathbb{R}^n and AM(n×n;R)\boldsymbol{A}\in \mathcal{M}\left(n\times n;\mathbb{R}\right).

With proper definition, it can be shown that the solution to the initial value problem (6) is

x(t)=eAtx0\boldsymbol{x}\left(t\right)=e^{\boldsymbol{A}t}\boldsymbol{x}_0

where

eAt=I+k=1Aktkk!e^{\boldsymbol{A}t}=\boldsymbol{I}+\sum_{k=1}^{\infty}{\frac{\boldsymbol{A}^kt^k}{k!}}

Now, the problem is how to calculate this matrix exponential. Recall from linear algebra that diagonal matrix, or Jordan matrix, are relative easy to calculate the matrix exponential. Fortunately, we have the Jordan Normal Form theorem which states that every complex matrix is similar to a Jordan matrix, i.e., for any matrix A\boldsymbol{A}, there exists a matrix U\boldsymbol{U} and a Jordan matrix J\boldsymbol{J} such that

A=UJU1\boldsymbol{A}=\boldsymbol{U}\boldsymbol{J}\boldsymbol{U}^{-1}

Then, we can easily see that

Ak=UJkU1\boldsymbol{A}^{k}=\boldsymbol{U}\boldsymbol{J}^k\boldsymbol{U}^{-1}

and then

eAt=I+k=1Aktkk!=UIU1+Uk1Jktkk!U1=UeJtU1e^{\boldsymbol{A}t}=\boldsymbol{I}+\sum_{k=1}^{\infty}{\frac{\boldsymbol{A}^kt^k}{k!}}=\boldsymbol{UIU}^{-1}+\boldsymbol{U}\sum_{k1}^{\infty}{\frac{\boldsymbol{J}^kt^k}{k!}}\boldsymbol{U}^{-1}=\boldsymbol{U}e^{\boldsymbol{J}t}\boldsymbol{U}^{-1}

The problem now becomes calculating eJt\displaystyle e^{\boldsymbol{J}t}.

Let

J=(Jk1(λ1)00Jkm(λm))\boldsymbol{J}=\left( \begin{matrix} \boldsymbol{J}_{k_1}\left( \lambda _1 \right)& & 0\\ & \ddots& \\ 0& & \boldsymbol{J}_{k_m}\left( \lambda _m \right)\\\end{matrix} \right)

be a Jordan matrix, where Jki(λi)\boldsymbol{J}_{k_i}\left(\lambda_i\right) is a Jordan block of size kik_i and diagonal element λi\lambda_i, i.e.,

Jki(λi)=(λi1010λi) \boldsymbol{J}_{k_i}\left( \lambda _i \right) =\left( \begin{matrix} \lambda _i& 1& 0\\ & \ddots& 1\\ 0& & \lambda _i\\ \end{matrix} \right)

It’s easy to show that

eJt=(eJk1(λ1)00eJkm(λm)) e^{\boldsymbol{J}t}=\left( \begin{matrix} e^{\boldsymbol{J}_{k_1}\left( \lambda _1 \right)}& & 0\\ & \ddots& \\ 0& & e^{\boldsymbol{J}_{k_m}\left( \lambda _m \right)}\\\end{matrix} \right) 

Hence, it’s sufficient to calculate the exponential of Jordan blocks. Notice that

Jk(λ)=λI+N\boldsymbol{J}_k\left(\lambda\right)=\lambda\boldsymbol{I}+\boldsymbol{N}

where N\boldsymbol{N} is some nilpotent matrix whose power will vanish after finite many times, and that unit matrix commutes with any other matrix in terms of multiplication, i.e., IN=NI\boldsymbol{I}\boldsymbol{N}=\boldsymbol{N}\boldsymbol{I}. Therefore,

eJk(λ)t=ediag(λ)teNt=(eλt00eλt)(I+k=1pNktkk!)e^{\boldsymbol{J}_k\left( \lambda \right) t}=e^{\mathrm{diag}\left( \lambda \right) t}e^{Nt}=\left( \begin{matrix} e^{\lambda t}& & 0\\ & \ddots& \\ 0& & e^{\lambda t}\\ \end{matrix} \right) \left( \boldsymbol{I}+\sum_{k=1}^p{\frac{\boldsymbol{N}^kt^k}{k!}} \right)

However, the above calculation seems too complicated that it would be impractical. If we trace back what we are actually doing, we know that we are actually finding a basis of the solution space.

From previous discussion of the structure of the solution, we know immediately that for any basis B={b1,,bn}\mathcal{B}=\left\{\boldsymbol{b}_1,\cdots,\boldsymbol{b}_n\right\} of Rn\mathbb{R}^n, the system of functions F={eAtb1,,eAtbn}\mathcal{F}=\left\{e^{\boldsymbol{A}t}\boldsymbol{b}_1,\cdots,e^{\boldsymbol{A}t}\boldsymbol{b}_n\right\} is a fundamental system of solution. We collect them into one matrix and denoted as X\boldsymbol{X}, and is also called the fundamental matrix.

It turns out that if we choose the basis wisely, we don’t even need to calculate the whole eAte^{\boldsymbol{A}t} to get a fundamental matrix. From theory of Jordan normal form, we know that there exists a basis consists of (generalized) eigenvectors U={u1,,un}\boldsymbol{U}=\left\{\boldsymbol{u}_1,\cdots, \boldsymbol{u}_n\right\}, such that the Jordan matrix of A\boldsymbol{A} is given by J=U1AU\boldsymbol{J}=\boldsymbol{U}^{-1}\boldsymbol{AU}. Using this basis, we have the fundamental matrix simply given by

X(t)=UeJt\boldsymbol{X}\left(t\right)=\boldsymbol{U}e^{\boldsymbol{J}t}

The columns of X\boldsymbol{X} will therefore be the fundamental system of solutions.

Inhomogeneous, Constant Coefficients

Next, we solve an inhomogeneous equation with constant coefficients. The general solution is the superposition of a particular solution and the solution to the associated homogeneous equation.

Consider the following linear system of differential equation

x=Ax+b(t)\boldsymbol{x}'=\boldsymbol{Ax}+\boldsymbol{b}\left(t\right)

suppose F={x1,,xn}\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\} is a fundamental system for the associated homogeneous equation, then the general solution is given by

x(t;c1,,cn)=k=1nckxk(t)+eAteAtb(t)dt\boldsymbol{x}\left(t;c_1,\cdots,c_n\right)=\sum_{k=1}^{n}c_k\boldsymbol{x}_k\left(t\right)+e^{\boldsymbol{A}t}\int e^{-\boldsymbol{A}t}\boldsymbol{b}\left(t\right)\mathrm{d}t

Variable Coefficients

There is no general method of solving homogeneous equations with variable coefficients,

x=A(t)x\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}

However, if we are given a fundamental system of solution F={x1,,xn}\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\}, then we can find a particular solution to the inhomogeneous equation

x=A(t)x+b(t)\begin{equation}\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}+\boldsymbol{b}\left(t\right)\end{equation}

by method of variation of parameter.

Let

xp=k=1nck(t)xk(t)\boldsymbol{x}_p=\sum_{k=1}^{n}c_k\left(t\right)\boldsymbol{x}_k\left(t\right)

and plug into equation (7), and get

ddtxp=k=1n(ck(t)xk(t)+ck(t)xk(t))=A(t)xp+b(t) \frac{\mathrm{d}}{\mathrm{d}t}\boldsymbol{x}_p=\sum_{k=1}^n{\left( c_k'\left( t \right) \boldsymbol{x}_k\left( t \right) +c_k\left( t \right) \boldsymbol{x}_k'\left( t \right) \right)}=\boldsymbol{A}\left( t \right) \boldsymbol{x}_p+\boldsymbol{b}\left( t \right) 

leading to

k=1nck(t)xk(t)=b(t) \sum_{k=1}^n{c_k'\left( t \right) \boldsymbol{x}_k\left( t \right)}=\boldsymbol{b}\left( t \right) 

Let X(t)=(x1(t),,xn(t))\boldsymbol{X}\left(t\right)=\left(\boldsymbol{x}_1\left(t\right),\cdots,\boldsymbol{x}_n\left(t\right)\right) be the fundamental matrix, and c(t)=(c1(t)c2(t)cn(t))\displaystyle \boldsymbol{c}\left( t \right) =\left( \begin{array}{c} c_1\left( t \right)\\ c_2\left( t \right)\\ \vdots\\ c_n\left( t \right)\\ \end{array} \right)  then the previous equation is indeed the following linear system of equation

Xc=b\boldsymbol{X}\boldsymbol{c}'=\boldsymbol{b}

Using Cramer’s rule, we can find the solution to ck(t)c_k'\left(t\right).

ck(t)=detX(k)(t)detX(t)=W(k)(t)W(t)c_k'\left( t \right) =\frac{\det \boldsymbol{X}^{\left( k \right)}\left( t \right)}{\det \boldsymbol{X}\left( t \right)}=\frac{W^{\left( k \right)}\left( t \right)}{W\left( t \right)}

where

W(t)=det(x1(t),,xn(t))W\left(t\right)=\det \left(\boldsymbol{x}_1\left(t\right),\cdots,\boldsymbol{x}_n\left(t\right)\right)

is called the Wronskian.

Categories: Mathematics