Linear System of ODEs Theory

8 minute read ·

Published: October 25, 2023

An important kind of equation to learn is those linear equations, which are those can be written in the form of

Ly=b

for some linear differential operator $L$ and unknown $y$ . We begin by the example of second order homogeneous linear equations.

Second Order Linear Equations

Given a linear, second order, homogeneous ODE

\begin{equation}y''+p\left(t\right)y'+q\left(t\right)y=0\end{equation}

Now, we are going to show the following two things:

If $y_1,y_2$ are solution to (1), then $y=c_1y_1+c_2y_2$ is also a solution to (1) for some constants $c_1,c_2\in \mathbb{R}$ .

If $y_1$ and $y_2$ are two linearly independent solutions to (1), then all solution to (1) have the form $y=c_1y_1+c_2y_2$ for some constants $c_1,c_2\in\mathbb{R}$ .

The first one relies on the linearity of differential operator

L=\frac{\mathrm{d}^2}{\mathrm{d}t}+p\left(t\right)\frac{\mathrm{d}}{\mathrm{d}t}+q\left(t\right)

We can show that this operator is linear by observing that

$L\left(cy\right)=cL\left(y\right)$ for any constant $c$ .

$L\left(y_1+y_2\right)=L\left(y_1\right)+L\left(y_2\right)$ .

Then, for any constant $c_1,c_2$ , we have

L\left(c_1y_1+c_2y_2\right)=c_1L\left(y_1\right)+c_2L\left(y_2\right)=0

and the first one is proved.

The second one relies the Picard-Lindelof theorem also known as existence and uniqueness theorem:

Let $x_0\in\Omega$ , where $\Omega\subset\mathbb{R}^n$ is open and let $t_0\in I$ , where $I\subset\mathbb{R}$ be an interval.

Suppose $F:\Omega\times I\to\mathbb{R}^n$ is continuous in $t$ and Lipschitz continuous in $x$ , i.e. there exist a constant $L>0$ , such that for all $x,y\in\Omega$ and all $t\in I$ ,

\left\|F\left(x,t\right)-F\left(y,t\right)\right\|\le L\left\|x-y\right\|

Then, the initial value problem

\begin{align}x'=F\left(x,t\right) &&x\left(t_0\right)=x_0\end{align}

has one and only one solution in some $t$ interval containing $t_0$ .

An outline of the proof is as follows:

We can always construct a solution by using an iterative approximation method known as Picard iteration, which works as follows:

First, a solution to (2) will have to satisfy the integral equation

x\left(t\right)=x_0+\int_{t_0}^tF\left(x\left(s\right),s\right)\mathrm{d}s

Then, by setting $x^{(0)}\left(t\right)=x_0$ , and

x^{\left(k+1\right)}\left(t\right)=x_0+\int_{t_0}^tF\left(x^{\left(k\right)}\left(s\right),s\right)\mathrm{d}s

yields a sequence of function $x^{(k)}\left(t\right)$ . Then, under some condition on function $F$ , we can use Contraction Mapping Principle to show that the sequence converges to a unique function $x\left(t\right)$ .

Algebraic Structure of Linear Equation

An interesting question to consider in the proof is that why is there exactly two independent solutions to a second linear order equation? A simple answer to this might be that since it contain second order derivative, when we solve it we shall integrate twice and each gives an integrating constant $c$ which makes there are two constants $c_1,c_2$ to determine.

In fact, if we apply some knowledge from linear algebra, we may easy to think of the solutions as a vector space, known as solution space which happens to be 2 dimensional. And the general case would be, for a linear equation of order $n$ , the dimension of its solution space is exactly $n$ . Now, we want to prove this.

First notice that given an explicit ODE of order $n$ ,

\begin{equation}x^{\left(n\right)}\left(t\right)=f\left(x,x',x'',\dots,x^{\left(n-1\right)},t\right)\end{equation}

and set new variables

\begin{align*} x_1=x,&& x_2=x',&& x_3=x'',&& \cdots,&& x_n=x^{\left(n-1\right)} \end{align*}

Equation (3) can be written as a linear system of equations of order 1,

\left( \begin{array}{c} x_1'\\ x_2'\\ \vdots\\ x_n'\\\end{array} \right) =\left( \begin{array}{c} x_2\\ x_3\\ \vdots\\ f\left( x_1,x_2,\dots ,x_n,t \right)\\\end{array} \right)

or, simply denoted as

\boldsymbol{x}'=\boldsymbol{F}\left(\boldsymbol{x},t\right)

Therefore, for any linear equation of order $n$ , it is always equivalent to a system of $n$ first order linear equation.

Thence, we only need to consider the first order linear system of equations,

\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}+\boldsymbol{b}\left(t\right)

where $\boldsymbol{b}:\mathbb{R}\to\mathbb{R}^n$ , and $\boldsymbol{A}:\mathbb{R}\to\mathbb{R}^n\times\mathbb{R}^n$ .

Consider the associated homogeneous equation

\begin{equation}\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}\end{equation}

Notice that given any two solution $\boldsymbol{x}_1,\boldsymbol{x}_2$ , and any number $\lambda,\mu\in\mathbb{R}$ , their linear combination $\lambda\boldsymbol{x}_1+\mu\boldsymbol{x}_2$ is also a solution.

Suppose that a basis of vectors $\mathcal{B}=\left\{\boldsymbol{b}_1,\cdots,\boldsymbol{b}_n\right\}$ , of $\mathbb{R}^n$ is given and fix $t_0\in \mathbb{R}$ . Denote by $\boldsymbol{x}_k\left(t\right)$ the unique solution (theorem of Picard-Lindelof) to the initial value problem

\begin{align} \boldsymbol{x}_k'=\boldsymbol{A}\left(t\right)\boldsymbol{x}_k, && \boldsymbol{x}_k\left(t_0\right)=\boldsymbol{b}_k \end{align}

and denote

\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\}

Then, by superposition principle, any element of $\mathrm{span}\mathcal{F}$ is a solution to (4)

Conversely, let $\boldsymbol{x}$ be any solution to (4), and suppose that $\boldsymbol{x}\left(t_0\right)=\boldsymbol{x}_0\in\mathbb{R}^n$ . We have

\boldsymbol{x}\left(t_0\right)=\boldsymbol{x}_0=\sum_{k=1}^{n}\lambda_k\boldsymbol{b}_k=\sum_{k=1}^{n}\lambda_k\boldsymbol{x}_k\left(t_0\right)

for some numbers $\lambda_1,\dots,\lambda_n\in\mathbb{R}$ . Therefore, by theorem of Picard-Lindelof,

\boldsymbol{x}=\sum_{k=1}^n\lambda_k\boldsymbol{x}_k

This shows that the solution space is given by $\mathrm{span}\mathcal{F}$ .

In addition, we have,

\forall t, \sum_{k=1}^n{\lambda _k}\boldsymbol{x}_k\left( t \right) =0\Rightarrow \sum_{k=1}^n{\lambda _k}\boldsymbol{x}_k\left( t_0 \right) =\sum_{k=1}^n{\lambda _k}\boldsymbol{b}_k=0\Rightarrow \lambda _k=0

which means $\mathcal{F}$ is a basis.

Now, let’s explain why a second order linear equation will have exactly two independent solutions. For equation (1), let $x_1=y,x_2=y'$ , and denote $\boldsymbol{x}=\left(x_1,x_2\right)$ , it is equivalent to

\boldsymbol{x}'=\left( \begin{matrix} 0& 1\\ -p\left( t \right)& -q\left( t \right)\\\end{matrix} \right) \boldsymbol{x}+\left( \begin{array}{c} 0\\ g\left( t \right)\\\end{array} \right)

which is a first order linear system of equation in two dimension.

Solution to Linear System of Equations

Homogeneous, Constant Coefficients

First, consider the most basic case of linear system of equations, that is a homogeneous one with constant coefficients:

\begin{align}\boldsymbol{x}'=\boldsymbol{A}\boldsymbol{x},&&\boldsymbol{x}\left(0\right)=\boldsymbol{x}_0 \end{align}

where $\boldsymbol{x}:\mathbb{R}\to\mathbb{R}^n$ and $\boldsymbol{A}\in \mathcal{M}\left(n\times n;\mathbb{R}\right)$ .

With proper definition, it can be shown that the solution to the initial value problem (6) is

\boldsymbol{x}\left(t\right)=e^{\boldsymbol{A}t}\boldsymbol{x}_0

where

e^{\boldsymbol{A}t}=\boldsymbol{I}+\sum_{k=1}^{\infty}{\frac{\boldsymbol{A}^kt^k}{k!}}

Now, the problem is how to calculate this matrix exponential. Recall from linear algebra that diagonal matrix, or Jordan matrix, are relative easy to calculate the matrix exponential. Fortunately, we have the Jordan Normal Form theorem which states that every complex matrix is similar to a Jordan matrix, i.e., for any matrix $\boldsymbol{A}$ , there exists a matrix $\boldsymbol{U}$ and a Jordan matrix $\boldsymbol{J}$ such that

\boldsymbol{A}=\boldsymbol{U}\boldsymbol{J}\boldsymbol{U}^{-1}

Then, we can easily see that

\boldsymbol{A}^{k}=\boldsymbol{U}\boldsymbol{J}^k\boldsymbol{U}^{-1}

and then

e^{\boldsymbol{A}t}=\boldsymbol{I}+\sum_{k=1}^{\infty}{\frac{\boldsymbol{A}^kt^k}{k!}}=\boldsymbol{UIU}^{-1}+\boldsymbol{U}\sum_{k1}^{\infty}{\frac{\boldsymbol{J}^kt^k}{k!}}\boldsymbol{U}^{-1}=\boldsymbol{U}e^{\boldsymbol{J}t}\boldsymbol{U}^{-1}

The problem now becomes calculating $\displaystyle e^{\boldsymbol{J}t}$ .

Let

\boldsymbol{J}=\left( \begin{matrix} \boldsymbol{J}_{k_1}\left( \lambda _1 \right)& & 0\\ & \ddots& \\ 0& & \boldsymbol{J}_{k_m}\left( \lambda _m \right)\\\end{matrix} \right)

be a Jordan matrix, where $\boldsymbol{J}_{k_i}\left(\lambda_i\right)$ is a Jordan block of size $k_i$ and diagonal element $\lambda_i$ , i.e.,

\boldsymbol{J}_{k_i}\left( \lambda _i \right) =\left( \begin{matrix} \lambda _i& 1& 0\\ & \ddots& 1\\ 0& & \lambda _i\\ \end{matrix} \right)

It’s easy to show that

e^{\boldsymbol{J}t}=\left( \begin{matrix} e^{\boldsymbol{J}_{k_1}\left( \lambda _1 \right)}& & 0\\ & \ddots& \\ 0& & e^{\boldsymbol{J}_{k_m}\left( \lambda _m \right)}\\\end{matrix} \right)

Hence, it’s sufficient to calculate the exponential of Jordan blocks. Notice that

\boldsymbol{J}_k\left(\lambda\right)=\lambda\boldsymbol{I}+\boldsymbol{N}

where $\boldsymbol{N}$ is some nilpotent matrix whose power will vanish after finite many times, and that unit matrix commutes with any other matrix in terms of multiplication, i.e., $\boldsymbol{I}\boldsymbol{N}=\boldsymbol{N}\boldsymbol{I}$ . Therefore,

e^{\boldsymbol{J}_k\left( \lambda \right) t}=e^{\mathrm{diag}\left( \lambda \right) t}e^{Nt}=\left( \begin{matrix} e^{\lambda t}& & 0\\ & \ddots& \\ 0& & e^{\lambda t}\\ \end{matrix} \right) \left( \boldsymbol{I}+\sum_{k=1}^p{\frac{\boldsymbol{N}^kt^k}{k!}} \right)

However, the above calculation seems too complicated that it would be impractical. If we trace back what we are actually doing, we know that we are actually finding a basis of the solution space.

From previous discussion of the structure of the solution, we know immediately that for any basis $\mathcal{B}=\left\{\boldsymbol{b}_1,\cdots,\boldsymbol{b}_n\right\}$ of $\mathbb{R}^n$ , the system of functions $\mathcal{F}=\left\{e^{\boldsymbol{A}t}\boldsymbol{b}_1,\cdots,e^{\boldsymbol{A}t}\boldsymbol{b}_n\right\}$ is a fundamental system of solution. We collect them into one matrix and denoted as $\boldsymbol{X}$ , and is also called the fundamental matrix.

It turns out that if we choose the basis wisely, we don’t even need to calculate the whole $e^{\boldsymbol{A}t}$ to get a fundamental matrix. From theory of Jordan normal form, we know that there exists a basis consists of (generalized) eigenvectors $\boldsymbol{U}=\left\{\boldsymbol{u}_1,\cdots, \boldsymbol{u}_n\right\}$ , such that the Jordan matrix of $\boldsymbol{A}$ is given by $\boldsymbol{J}=\boldsymbol{U}^{-1}\boldsymbol{AU}$ . Using this basis, we have the fundamental matrix simply given by

\boldsymbol{X}\left(t\right)=\boldsymbol{U}e^{\boldsymbol{J}t}

The columns of $\boldsymbol{X}$ will therefore be the fundamental system of solutions.

Inhomogeneous, Constant Coefficients

Next, we solve an inhomogeneous equation with constant coefficients. The general solution is the superposition of a particular solution and the solution to the associated homogeneous equation.

Consider the following linear system of differential equation

\boldsymbol{x}'=\boldsymbol{Ax}+\boldsymbol{b}\left(t\right)

suppose $\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\}$ is a fundamental system for the associated homogeneous equation, then the general solution is given by

\boldsymbol{x}\left(t;c_1,\cdots,c_n\right)=\sum_{k=1}^{n}c_k\boldsymbol{x}_k\left(t\right)+e^{\boldsymbol{A}t}\int e^{-\boldsymbol{A}t}\boldsymbol{b}\left(t\right)\mathrm{d}t

Variable Coefficients

There is no general method of solving homogeneous equations with variable coefficients,

\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}

However, if we are given a fundamental system of solution $\mathcal{F}=\left\{\boldsymbol{x}_1,\cdots,\boldsymbol{x}_n\right\}$ , then we can find a particular solution to the inhomogeneous equation

\begin{equation}\boldsymbol{x}'=\boldsymbol{A}\left(t\right)\boldsymbol{x}+\boldsymbol{b}\left(t\right)\end{equation}

by method of variation of parameter.

Let

\boldsymbol{x}_p=\sum_{k=1}^{n}c_k\left(t\right)\boldsymbol{x}_k\left(t\right)

and plug into equation (7), and get

\frac{\mathrm{d}}{\mathrm{d}t}\boldsymbol{x}_p=\sum_{k=1}^n{\left( c_k'\left( t \right) \boldsymbol{x}_k\left( t \right) +c_k\left( t \right) \boldsymbol{x}_k'\left( t \right) \right)}=\boldsymbol{A}\left( t \right) \boldsymbol{x}_p+\boldsymbol{b}\left( t \right)

leading to

\sum_{k=1}^n{c_k'\left( t \right) \boldsymbol{x}_k\left( t \right)}=\boldsymbol{b}\left( t \right)

Let $\boldsymbol{X}\left(t\right)=\left(\boldsymbol{x}_1\left(t\right),\cdots,\boldsymbol{x}_n\left(t\right)\right)$ be the fundamental matrix, and $\displaystyle \boldsymbol{c}\left( t \right) =\left( \begin{array}{c} c_1\left( t \right)\\ c_2\left( t \right)\\ \vdots\\ c_n\left( t \right)\\ \end{array} \right)$ then the previous equation is indeed the following linear system of equation

\boldsymbol{X}\boldsymbol{c}'=\boldsymbol{b}

Using Cramer’s rule, we can find the solution to $c_k'\left(t\right)$ .

c_k'\left( t \right) =\frac{\det \boldsymbol{X}^{\left( k \right)}\left( t \right)}{\det \boldsymbol{X}\left( t \right)}=\frac{W^{\left( k \right)}\left( t \right)}{W\left( t \right)}

where

W\left(t\right)=\det \left(\boldsymbol{x}_1\left(t\right),\cdots,\boldsymbol{x}_n\left(t\right)\right)

is called the Wronskian.

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

You May Also Enjoy

Optimization Part 1 Basics and Least Squares

March 31, 2026 Mathematics

In common engineering problems, the optimization problem is to minimize the residual of a parametric model with respect to some observed data points.Given N...

Occupancy Grid Mapping

March 22, 2026 Robotics

Besides state estimation or localization, which provide a robot with knowledge of where it is, it’s equally important for a mobile robot to perceive its surr...

Lie Theory in Robot Motion

March 10, 2026 Robotics

This note covers some of the fundamental concepts in Lie group and Lie algebra, and their applications to representing rigid body motion in robotics. A group...

Bayes Filtering and State Estimation

March 4, 2026 Robotics

In robot state estimation, the Bayes filter is a probabilistic approach that estimates the state from a sequence of controls and measurements by recursively ...

Guanyu Xu