6.2. Semi-discrete form for diffusion#

In this section we go back to the Poisson equation one last time. We have seen this PDE before in 1D and 2D, but only for steady-state (time-independent) problems. Here we make one more extension by allowing time-dependent behavior to arise. We will see one common way to treat time in FEM problems, by arriving at a semi-discrete system of equations we can solve numerically.

This page also allows you to switch between tensor notation and index notation. Try it out and see how different parts of the formulation become easier to write in one or the other notation.

Strong form equation#

As always, we start the formulation with the PDE in its strong form:

(6.4)#qρcu˙+f=0,withq=κu

where u is our solution field of interest: in heat conduction problems it would represent temperature; in mass diffusion it would be a concentration; in Darcy flow it would be the hydraulic head. Furthermore, q is a gradient-driven flux vector that depends on a diffusivity matrix κ. The new term here is ρcu˙, with ρc being a capacity parameter and u˙ the first time derivative of our solution, i.e. a velocity field.

The problem is completed by Dirichlet, Neumann and initial boundary conditions:

(6.5)#u=gonΓDqn=honΓNu=u0att=0

where g is a prescribed value for the solution field and n is the normal to the surface where Neumann boundary conditions are applied. Since we are solving a time-dependent problem, the third condition fixes an initial field u0 from which we will predict into the future.

As always, we start the formulation with the PDE in its strong form:

(6.6)#qi,iρcu˙+f=0,withqi=κiju,j

where u is our solution field of interest: in heat conduction problems it would represent temperature; in mass diffusion it would be a concentration; in Darcy flow it would be the hydraulic head. Furthermore, qi is a gradient-driven flux vector that depends on a diffusivity matrix κij. The new term here is ρcu˙, with ρc being a capacity parameter and u˙ the first time derivative of our solution, i.e. a velocity field.

The problem is completed by Dirichlet, Neumann and initial boundary conditions:

(6.7)#u=gonΓDqini=honΓNu=u0att=0

where g is a prescribed value for the solution field and ni is the normal to the surface where Neumann boundary conditions are applied. Since we are solving a time-dependent problem, the third condition fixes an initial field u0 from which we will predict into the future.

Weak form#

To get to the weak form we first multiply the PDE by a weight function w and integrate over the domain Ω:

(6.8)#Ωw(q)dΩΩwρcu˙dΩ+ΩwfdΩ=0

where w for this problem is always a scalar field, irrespective of the dimensionality of Ω. The next step is to eliminate the derivative q using the divergence theorem and integration by parts. Here we can directly use the result we already derived in Eq. (1.32):

(6.9)#ΩwqdΩΩwρcu˙dΩ+ΩwfdΩΓNwqndΓ=0

and we see that the derivative has now been transferred to w. This also causes an integral over Γ to appear. From the requirement that w=0 at ΓD, only the integral over ΓN remains. Finally, note that we did not touch the time derivative u˙, which means we are effectively treating u and u˙ as two independent fields.

As a final step, we substitute the Neumann boundary condition of Eq. (6.5) and the constitutive relation q=κu to get to the final weak form:

(6.10)#ΩwκudΩΩwρcu˙dΩ+ΩwfdΩ+ΓhwhdΓ=0

which is valid for any wV, an infinite-dimensional function space. Weak solutions to the PDE are equally valid and exact as solutions to the strong form. The only limitation is that they only obey the original PDE in a distribution sense. This is a sensible tradeoff, in exchange we get a whole new set of solutions that do not need to be twice differentiable.

To get to the weak form we first multiply the PDE by a weight function w and integrate over the domain Ω:

(6.11)#Ωwqi,idΩΩwρcu˙dΩ+ΩwfdΩ=0

where w for this problem is always a scalar field, irrespective of the dimensionality of Ω. The next step is to eliminate the derivative qi,i using the divergence theorem and integration by parts. Here we can directly use the result we already derived in Eq. (1.32):

(6.12)#Ωw,iqidΩΩwρcu˙dΩ+ΩwfdΩΓhwqinidΓ=0

and we see that the derivative has now been transferred to w. This also causes an integral over Γ to appear. From the requirement that w=0 at ΓD, only the integral over ΓN remains. Finally, note that we did not touch the time derivative u˙, which means we are effectively treating u and u˙ as two independent fields.

As a final step, we substitute the Neumann boundary condition of Eq. (6.7) and the constitutive relation qi=κiju,j to get to the final weak form:

(6.13)#Ωw,iκiju,jdΩΩwρcu˙dΩ+ΩwfdΩ+ΓhwhdΓ=0

which is valid for any wV, an infinite-dimensional function space. Weak solutions to the PDE are equally valid and exact as solutions to the strong form. The only limitation is that they only obey the original PDE in a distribution sense. This is a sensible tradeoff, in exchange we get a whole new set of solutions that do not need to be twice differentiable.

Semi-discrete form#

To get to a tractable FEM problem we need to restrict the space V. We do this through the Galerkin method, by defining shape function-based approximations to all of our fields:

(6.14)#uh=Na,u˙h=Na˙,wh=Nc

where the shape function matrices and DOF vectors take the shape:

(6.15)#N=[N1N2Nnn]a={a1a2ann}a˙={a˙1a˙2a˙nn}c={c1c2cnn}

with nn being the number of nodes in our finite element mesh. Now the problem has two DOFs per node, namely a solution value an and a velocity a˙n. We will also need gradients of uh and wh:

(6.16)#uh=Ba,wh=Bc

with:

(6.17)#B=[N1,xN2,xNnn,xN1,yN2,yNnn,y]

We now substitute the approximations above into the weak form. The goal is to reach a so-called semi-discrete form, indicating that only spatial derivatives are discretized while time derivatives remain untouched and are treated as additional DOFs.

Substitution yields:

(6.18)#Ω(Bc)TκBadΩΩ(Nc)TρcNa˙dΩ+Ω(Nc)TfdΩ+Γh(Nc)ThdΓ=0

Making use of Eq. (1.17) we can remove c, a and a˙ from the integrals:

(6.19)#cT(ΩBTκBdΩ)acT(ΩNTρcNdΩ)a˙+cT(ΩNTfdΩ)+cT(ΓhNThdΓ)=0

and cancel out c from all terms. We then gather the expressions into the more compact form:

(6.20)#Ka+Ma˙=f

where the time derivative makes the new term Ma˙ appear. We have then reached the final discretized form and can write down the terms one more time for completeness:

(6.21)#K=ΩBTκBdΩ
(6.22)#M=ΩNTρcNdΩ
(6.23)#f=ΩNTfdΩ+ΓhNThdΓ

To get to a tractable FEM problem we need to restrict the space V. We do this through the Galerkin method, by defining shape function-based approximations to all of our fields:

(6.24)#uh=Nnan,u˙h=Nna˙n,wh=Nmcm

where the shape function matrices and DOF vectors take the shape:

(6.25)#Nn=[N1N2Nnn]an={a1a2ann}a˙n={a˙1a˙2a˙nn}cm={c1c2cnn}

with nn being the number of nodes in our finite element mesh. Now the problem has two DOFs per node, namely a solution value an and a velocity a˙n. We will also need gradients of uh and wh:

(6.26)#u,jh=Nn,jan,w,ih=Nm,icm

with:

(6.27)#Nn,j=[N1,xN2,xNnn,xN1,yN2,yNnn,y]

We now substitute the approximations above into the weak form. The goal is to reach a so-called semi-discrete form, indicating that only spatial derivatives are discretized while time derivatives remain untouched and are treated as additional DOFs.

Substitution yields:

(6.28)#ΩNm,icmκijNn,jandΩΩNmcmρcNna˙ndΩ+ΩNmcmfdΩ+ΓhNmcmhdΓ=0

Making use of Eq. (1.17) we can remove cm, an and a˙n from the integrals:

(6.29)#cm(ΩNm,iκijNn,jdΩ)ancm(ΩNmρcNndΩ)a˙n+cm(ΩNmfdΩ)+cm(ΓhNmhdΓ)=0

and cancel out cm from all terms. We then gather the expressions into the more compact form:

(6.30)#Kmnan+Mmna˙n=fm

where the time derivative makes the new term Mmna˙n appear. We have then reached the final discretized form and can write down the terms one more time for completeness:

(6.31)#Kmn=ΩNm,iκijNn,jdΩ
(6.32)#Mmn=ΩNmρcNndΩ
(6.33)#fm=ΩNmfdΩ+ΓhNmhdΓ

Solving the semi-discrete system#

We have reached a semi-discrete system with nn equations, the number of nodes in the FE mesh. However, we now have twice as many DOFs: solution values a and velocities a˙. The problem seems under-determined, but that is only because we still did not take into account that a and a˙ are of course coupled in time and for now the FEM discretization is only taking care of spatial derivatives. We will address this on the next page.