Skip to content

Introduction to Linear Partial Differential Equations*

This is the note from MNE8108 Engineering Methods in the Department of Mechanical Engineering, CityU (semester A, 2025)

Contents mostly From Advanced Calculus for Applications by Francis Begnaud Hildebrand

Partial differential equations definition

Motivation

Why do we study partial deferential equations (PDEs) and analytic solutions?

We are interested in PDEs because most of mathematical physics is described by such equations. For example, fluids dynamics (and more generally continuous media dynamics), electromagnetic theory, quantum mechanics and etc.

Typically, a given PDE will only be accessible to numerical solution and analytic solutions in a practical or research scenario are often impossible. However, it is vital to understand the general theory in order to conduct a sensible investigation. For example, we may need to understand what type of PDE we have to ensure the numerical solution is valid. Indeed, certain types of equations need appropriate boundary conditions; without a knowledge of the general theory it is possible that the problem may be ill-posed and the solution is erroneous.

Definition

Definition: Partial derivatives

Partial derivatives: The differential (or differential form) of a function f of n independent variables, (x1,x2,,xn), is a linear combination of the basis form (dx1, dx2,, dxn)

df=i=1nfxi dxi=fx1 dx1+fx2 dx2++fxn dxn,

where the partial derivatives are defined by

fxi=limh0f(x1,x2,,xi+h,,xn)f(x1,x2,,xi,,xn)h.

A partial differential equation (PDE) is an equation for some quantity u (dependent variable) which depends on the independent variables x1,x2,x3,,xn,n2, and involves derivatives of u with respect to at least some of the independent variables.

F(x1,,xn,x1u,,xnu,x12u,x1x22u,,x1xnnu)=0.

Linear nth-order PDE

The order of the PDE is the order of the highest (partial) differential coefficient in the equation.

As with ordinary differential equations (ODEs) it is important to be able to distinguish between linear and nonlinear equations.

A linear equation is one in which the equation and any boundary or initial conditions do not include any product of the dependent variables or their derivatives; an equation that is not linear is a nonlinear equation.

ut+cux=0, first order linear PDE (simplest wave equation), 2ux2+2uy2=Φ(x,y), second order linear PDE (Poisson). 

Principle of superposition: A linear equation has the useful property that if u1 and u2 both satisfy the equation then so does αu1+βu2 for any α,βR. This is often used in constructing solutions to linear equations (for example, so as to satisfy boundary or initial conditions; c.f. Fourier series methods). This is not true for nonlinear equations, which helps to make this sort of equations more interesting, but much more difficult to deal with.

Example of Linear PDEs: Wave Equations

Waves on a string, sound waves, waves on stretch membranes, electromagnetic waves, etc.

2ux2=1c22ut2,

or more generally

1c22ut2=2u

where c is a constant (wave speed).

Example of Linear PDEs: Heat Conduction

ut=κ2ux2,

or more generally

ut=κ2u,

or even

ut=(κu)

where κ is a constant (diffusion coefficient or thermometric conductivity). Both those equations (wave and diffusion) are linear equations and involve time ( t ). They require some initial conditions (and possibly some boundary conditions) for their solution.

1st order PDE solution

First Order Linear PDE: Method of Characteristic

We consider linear first order partial differential equation in two independent variables:

(2.1)a(x,y)ux+b(x,y)uy+c(x,y)u=f(x,y),

where a,b,c and f are continuous in some region of the plane and we assume that a(x,y) and b(x,y) are not zero for the same (x,y).

The key to the solution of the equation (2.1) is to find a change of variables (or a change of coordinates)

ξξ(x,y),ηη(x,y)

which transforms (2.1) into the simpler equation

(2.3)wξ+h(ξ,η)w=F(ξ,η)

where w(ξ,η)=u(x(ξ,η),y(ξ,η)).

We shall define this transformation so that it is one-to-one, at least for all (x,y) in some set D of points in the xOy plane. Then, on D we can (in theory) solve for x and y as functions of ξ,η. To ensure that we can do this, we require that the Jacobian of the transformation does not vanish in D :

J=|ξxξyηxηy|=ξxηyξyηx{0,}

for (x,y) in D. We begin looking for a suitable transformation by computing derivatives via the chain rule

ux=wξξx+wηηx and uy=wξξy+wηηy.

We substitute these into equation (2.1) to obtain

a(wξξx+wηηx)+b(wξξy+wηηy)+cw=f.

We can rearrange this as

(2.4)(aξx+bξy)wξ+(aηx+bηy)wη+cw=f.

This is close to the form of equation (2.1) if we can choose ηη(x,y) so that

aηx+bηy=0 for (x,y) in D.

Provided that η/y0 we can express this required property of η as

xηyη=ba.

Then the equation is

(aξx+bξy)wξ+cw=f.

Suppose we can define a new variable (or coordinate) η which satisfies this constraint. What is the equation describing the curves of constant η ? Putting ηη(x,y)=k ( k an arbitrary constant), then

dη=ηx dx+ηy dy=0

implies that dy/dx=xη/yη=b/a. So, the equation η(x,y)=k defines solutions of the ODE

(2.5)dy dx=b(x,y)a(x,y).

Equation (2.5) is called the characteristic equation of the linear equation (2.1). Its solution can be written in the form F(x,y,η)=0 (where η is the constant of integration) and defines a family of curves in the plane called characteristics or characteristic curves of (2.1). (More on characteristics later.) Characteristics represent curves along which the independent variable η of the new coordinate system (ξ,η) is constant.

So, we have made the coefficient of w/η vanish in the transformed equation (2.4), by choosing ηη(x,y), with η(x,y)=k an equation defining the solution of the characteristic equation (2.5). We can now choose ξ arbitrarily (or at least to suit our convenience), providing we still have J0. An obvious choice is

ξξ(x,y)=x

Then

J=|10ηxηy|=ηy,

and we have already assumed this non-zero.

Now we see from equation (2.4) that this change of variables,

ξ=x,ηη(x,y)

transforms equation (2.1) to

α(x,y)wξ+c(x,y)w=f(x,y)

where α=aξ/x+bξ/y. To complete the transformation to the form of equation (2.3), we first write α(x,y),c(x,y) and f(x,y) in terms of ξ and η to obtain

A(ξ,η)wξ+C(ξ,η)w=ρ(ξ,η)

Finally, restricting the variables to a set in which A(ξ,η)0 we have

wξ+CAw=ρA,

which is in the form of (2.3) with

h(ξ,η)=C(ξ,η)A(ξ,η) and F(ξ,η)=ρ(ξ,η)A(ξ,η)

The point of this transformation is that we can solve equation (2.3). Think of

wξ+h(ξ,η)w=F(ξ,η)

as a linear first order ordinary differential equation in ξ, with η carried along as a parameter. Thus we use an integrating factor method

eh(ξ,η)dξwξ+h(ξ,η)eh(ξ,η)dξw=F(ξ,η)eh(ξ,η)dξξ(eh(ξ,η)dξw)=F(ξ,η)eh(ξ,η)dξ

Now we integrate with respect to ξ. Since η is being carried as a parameter, the constant of integration may depend on η

eh(ξ,η)dξw=F(ξ,η)eh(ξ,η)dξ dξ+g(η)

in which g is an arbitrary differentiable function of one variable. Now the general solution of the transformed equation is

w(ξ,η)=eh(ξ,η)dξF(ξ,η)eh(ξ,η)dξ dξ+g(η)eh(ξ,η)dξ

We obtain the general form of the original equation by substituting back ξ(x,y) and η(x,y) to get

u(x,y)=eα(x,y)[β(x,y)+g(η(x,y))].

Example: The method of characteristic

Example: Consider the constant coefficient equation

aux+buy+cu=0

where a,b,cR. Assume a0, the characteristic equation is

dy/dx=b/a

with general solution defined by the equation

bxay=k,k constant .

So the characteristics of the PDE are the straight line graphs of bxay=k and we make the transformation with

ξ=x,η=bxay

Using the substitution we find the equation transforms to

wξ+caw=0

The integrating factor method gives

ξ(ecξ/aw)=0

and integrating with respect to ξ gives

ecξ/aw=g(η)

where g is any differentiable function of one variable. Then

w=g(η)ecξ/a

and in terms of x and y we back transform

u(x,y)=g(bxay)ecx/a

Example 1: More specific Linear first order PDE

Consider

2ux+3uy+8u=0.

The characteristic equation is

dy dx=32

and the characteristics are the straight line graphs 3x2y=c. Hence we take η=3x2y and ξ=x.

(We can see that an η and ξ cross only once they are independent, i.e. J0;η and ξ have been properly chosen.)

This gives the solution

u(x,y)=e4xg(3x2y)

where g is a differentiable function defined over the real line. Simply specifying the solution at a given point (as in ODEs) does not uniquely determine g; we need to take a curve of initial conditions.

Suppose we specify values of u(x,y) along a curve Γ in the plane. For example, let's choose Γ as the x-axis and gives values of u(x,y) at points on Γ, say

u(x,0)=sin(x)

Then we need

u(x,0)=e4xg(3x)=sin(x) i.e. g(3x)=sin(x)e4x

and putting t=3x,

g(t)=sin(t/3)e4t/3

This determines g and the solution satisfying the condition u(x,0)=sin(x) on Γ is

u(x,y)=sin(x2y/3)e8y/3

We have determined the unique solution of the PDE with u specified along the x-axis. We do not have to choose an axis - say, along x=y,u(x,y)=u(x,x)=x4. From the general solution this requires,

u(x,x)=e4xg(x)=x4, so g(x)=x4e4x

to give the unique solution

u(x,y)=(3x2y)4e8(xy)

satisfying u(x,x)=x4.

However, not every curve in the plane can be used to determine g. Suppose we choose Γ to be the line 3x2y=1 and prescribe values of u along this line, say

u(x,y)=u(x,(3x1)/2)=x2.

Now we must choose g so that

e4xg(3x(3x1))=x2.

This requires g(1)=x2e4x (for all x ). This is impossible and hence there is no solution taking the value x2 at points (x,y) on this line.

Last, we consider again Γ to be the line 3x2y=1 but choose values of u along this line to be

u(x,y)=u(x,(3x1)/2)=e4x.

Now we must choose g so that

e4xg(3x(3x1))=e4x

This requires g(1)=1, condition satisfied by an infinite number of functions and hence there is an infinite number of solutions taking the values e4x on the line 3x2y=1.

Depending on the initial conditions, the PDE has one unique solution, no solution at all or an infinite number or solutions. The difference is that the x-axis and the line y=x are not the characteristics of the PDE while the line 3x2y=1 is a characteristic.

Example 2

xuxyuy=u with u=x2 on y=x,1y2

Characteristics:

dy dx=yx d(xy)=0xy=c, constant. 

So, take η=xy and ξ=x. Then the equation becomes

xywη+xwξxywη=wξwξw=0ξwξ=0.

Finally the general solution is, w=ξg(η) or equivalently u(x,y)=xg(xy). When y=x with 1y2,u=x2; so x2=xg(x2)g(x)=x and the solution is

u(x,y)=xxy.

This figure presents the characteristic curves given by xy= constant. The red characteristics show the domain where the initial conditions permit us to determine the solution.

Application: Linear Waves

If u is small (i.e. u2u ), then the equation approximate to the linear wave equation

ut+cux=0 with u(x,0)=f(x).

The solution of the equation of characteristics, dx/dt=c, gives the first integral of the PDE, η(x,t)=xct, and then general solution u(x,t)=g(xct), where the function g is determined by the initial conditions. Applying u(x,0)=f(x) we find that the linear wave equation has the solution u(x,t)=f(xct), which represents a wave (unchanging shape) propagating with constant wave speed c.

Note that u is constant where xct= constant, i.e. on the characteristics.

2nd order PDE solution

Second Order Linear PDE

Consider a general second order linear equation in two independent variables

a(x,y)2ux2+2b(x,y)2uxy+c(x,y)2uy2+d(x,y)ux+e(x,y)uy+f(x,y)u=g(x,y);

Recall, for a first order linear and semilinear equation, au/x+bu/y=c, we could define new independent variables, ξ(x,y) and η(x,y) with J=(ξ,η)/(x,y){0,}, to reduce the equation to the simpler form, u/ξ=κ(ξ,η).

For the second order equation, can we also transform the variables from (x,y) to (ξ,η) to put the equation into a simpler form?

As before we compute chain rule derivations

ux=uξξx+uηηx,uy=uξξy+uηηy,2ux2=2uξ2(ξx)2+22uξηξxηx+2uη2(ηx)2+uξ2ξx2+uη2ηx2,2uy2=2uξ2(ξy)2+22uξηξyηy+2uη2(ηy)2+uξ2ξy2+uη2ηy2,2uxy=2uξ2ξxξy+2uξη(ξxηy+ξyηx)+2uη2ηxηy+uξ2ξxy+uη2ηxy.

The equation becomes

(3.1)A2uξ2+2B2uξη+C2uη2+F(uξ,uη,u,ξ,η)=0,

where

A=a(ξx)2+2bξxξy+c(ξy)2,B=aξxηx+b(ξxηy+ξyηx)+cξyηy,C=a(ηx)2+2bηxηy+c(ηy)2.

Equation (3.1) can be simplified if we can choose ξ and η so that some of the coefficients A, B or C are zero. Let us define,

Dξ=ξ/xξ/y and Dη=η/xη/y

then we can write

A=(aDξ2+2bDξ+c)(ξy)2B=(aDξDη+b(Dξ+Dη)+c)ξyηyC=(aDη2+2bDη+c)(ηy)2.

Now consider the quadratic equation Vanish the item: C2uη2 or A2uξ2

(3.2)aD2+2bD+c=0,

whose solution is given by

D=b±b2aca.
  • If the discriminant b2ac0:

equation (3.2) has two distinct roots; so, we can make both coefficients A and C zero if we arbitrarily take the root with the negative sign for Dξ and the one with the positive sign for Dη,

(3.3)Dξ=ξ/xξ/y=bb2acaA=0,Dη=η/xη/y=b+b2acaC=0.

Then, using DξDη=c/a and Dξ+Dη=2b/a we have

B=2a(acb2)ξyηyB0.

Furthermore, if the discriminant b2ac>0:

then Dξ and Dη as well as ξ and η are real. So, we can define two families of one-parameter characteristics of the PDE as the curves described by the equation ξ(x,y)= constant and the equation η(x,y)= constant. Differentiate ξ along the characteristic curves given by ξ= constant,

dξ=ξx dx+ξy dy=0

and make use of (3.3) to find that this characteristics satisfy

(3.4)dy dx=b+b2aca.
ξx dx=ξy dy

Similarly we find that the characteristic curves described by η(x,y)= constant satisfy

(3.5)dy dx=bb2aca.
  • If the discriminant b2ac=0:

equation (3.2) has one unique root and if we take this root for Dξ say, we can make the coefficient A zero,

Dξ=ξ/xξ/y=baA=0

To get η independent of ξ,Dη has to be different from Dξ, so C0 in this case, but B is now given by

B=(abaDη+b(ba+Dη)+c)ξyηy=(b2a+c)ξyηy,

so that B=0. When b2ac=0 the PDE has only one family of characteristic curves, for ξ(x,y)=constant, whose equation is now

(3.6)dy dx=ba.

Classification of 2nd  Order Linear PDE

  1. If b2>ac we can apply the change of variable (x,y)(η,ξ) to transform the original PDE to
2uξη+( lower order terms )=0.

In this case the equation is said to be hyperbolic and has two families of characteristics given by equation (3.4) and equation (3.5).

  1. If b2=ac, a suitable choice for ξ still simplifies the PDE, but now we can choose η arbitrarily - provided η and ξ are independent - and the equation reduces to the form
2uη2+( lower order terms )=0

The equation is said to be parabolic and has only one family of characteristics given by equation (3.6).

  1. If b2<ac we can again apply the change of variables (x,y)(η,ξ) to simplify the equation but now this functions will be complex conjugate. To keep the transformation real, we apply a further change of variables (ξ,η)(α,β) via
α=ξ+η=2(η),β=i(ξη)=2(η), i.e., 2uξη=2uα2+2uβ2 (via the chain rule); 

so, the equation can be reduced to

2uα2+2uβ2+( lower order terms )=0.

In this case the equation is said to be elliptic and has no real characteristics.

Example

  • The wave equation,
2ut2cw22ux2=0,

is hyperbolic ( b2ac=cw2>0 ) and the two families of characteristics are described by dx/dt=±cw i.e. ξ=xcwt and η=x+cwt. So, the equation transforms into its canonical form 2u/ξη=0 whose solutions are waves travelling in opposite direction at speed cw.

  • The diffusion (heat conduction) equation,
2ux21κut=0,

is parabolic ( b2ac=0 ). The characteristics are given by dt/dx=0 i.e. ξ=t= constant.

  • Laplace's equation,
2ux2+2uy2=0,

is elliptic (b2ac=1<0).

Example 1: Reduce to the canonical form

y22ux22xy2uxy+x22uy2=1xy(y3ux+x3uy).

Here a=y2b=xyc=x2} so b2ac=(xy)2x2y2=0 parabolic equation. On ξ= constant,

dy dx=b+b2aca=ba=xyξ=x2+y2.

We can choose η arbitrarily provided ξ and η are independent. We choose η=y. (Exercise, try it with η=x.) Then

ux=2xuξ,uy=2yuξ+uη,2ux2=2uξ+4x22uξ22uxy=4xy2uξ2+2x2uξη,2uy2=2uξ+4y22uξ2+4y2uξη+2uη2

and the equation becomes

2y2uξ+4x2y22uξ28x2y22uξ24x2y2uξη+2x2uξ+4x2y22uξ2+4x2y2uξη+x22uη2=1xy(2xy3uξ+2x3yuξ+x3uη)i.e.2uη21ηuη=0

This has solution

u=f(ξ)+η2g(ξ),

where f and g are arbitrary functions (via integrating factor method), i.e.

u=f(x2+y2)+y2g(x2+y2).

Released under the MIT License.