Thermal Convection¶
Thermal convection describes the a process in which a fluid organizes itself into a structured flow pattern on a macroscopic scale to transport energy. Convection may be mechanically driven by stirring, but more commonly we refer to natural convection in which buoyancy due to a source of heat (and/or compositional variation) induces flow which transports and dissipates this anomalous buoyancy.
The Earth’s interior, on a geological timescale is a highly viscous fluid which is heated from below by heat escaping from the core, and internally by the decay of radioactive elements. In this respect
Description - what is involved …
Goal is to understand finite amplitude convection with complicated rheology and realistic initial, boundary conditions.
Critical Rayleigh Number for a layer¶
Does convection always occur in a layer heated from below ? In principle this would always provide a way to transport additional heat, but how much work would convection have to do in order to transport this extra heat ? One way to determine the answer is to consider small disturbances to a layer with otherwise uniform temperature and see under what conditions the perturbations grow (presumably into fully developed convection). This approach allows us to make linear approximations to the otherwise non-linear equations by dropping the small, high order non-linear terms.
We solve the incompressible flow equations (stream function form, [eq:biharm]{reference-type=”ref” reference=”eq:biharm”}) and energy conservation equation in stream function form:
By substituting throughout for a temperature which is a conductive profile with a small amplitude disturbance, \(\theta \ll 1\)
Remember that the equations are non-dimensional so that the layer depth is one, and the temperature drop is one.
The advection term
is dominated by the \(\partial \psi / \partial x_1\) since all others are the product of small terms. (Since we also know that \(\psi \sim \theta\) from equation ([eq:biharm]{reference-type=”ref” reference=”eq:biharm”})). Therefore the energy conservation equation becomes
which is linear.
Boundary conditions for this problem are zero normal velocity on \(x_2 = 0,1\) which implies \(\psi=0\) at these boundaries. The form of the perturbation is such that \(\theta =0\) on \(x_2 = 0,1\), and we allow free slip along these boundaries such that
when \(x_2 = 0,1\) which implies \(\nabla^2 \psi =0\) there.
Now introduce small harmonic perturbations to the driving terms and assuming a similar (i.e. harmonic) response in the flow. This takes the form
So that we can now separate variables. \(\sigma\) is unknown, however, if \(\sigma < 0\) then the perturbations will decay, whereas if \(\sigma > 0\) they will grow.

Fig. 10 Critical Rayleigh Number determination. A plot of growth rates for harmonic perturbations as a function of wavenumber for different \({\rm Ra}\). The critical value occurs when the maximum of the curve just touches the horizontal axis at zero.¶
Substituting for the perturbations into the biharmonic equation and the linearized energy conservation equation gives
and
Here we have shown and used the fact that
when a function is expanded in the form \(\phi(x,z) = \Phi(z).\sin kx\) – more generally, this is the fourier transform of the Laplacian operator.
Eliminating \(\Psi\) between ([eq:psitheta1]{reference-type=”ref” reference=”eq:psitheta1”}) and ([eq:psitheta2]{reference-type=”ref” reference=”eq:psitheta2”}) gives
This has a solution
which satisfies all the stated boundary conditions and implies
a real function of \(k\) and \(\rm Ra\).
For a given wavenumber, what is the lowest value of \(\rm Ra\) for which perturbations at that wavenumber will grow ?
The absolute minimum value of \({\rm Ra}\) which produces growing perturbations is found by differentiating \({\rm Ra_0}\) with respect to \(k\) and setting equal to zero to find the extremum.
for a wavenumber of
corresponding to a wavelength of 2.828 times the depth of the layer.
Different boundary conditions produce different values of the critical Rayleigh number. If no-slip conditions are used, for example, then the \(\Theta\) solution applied above does not satisfy the boundary conditions. In general, the critical Rayleigh number lies between about 100 and 3000.
Boundary layer theory, Boundary Rayleigh Number¶
Having determined the conditions under which convection will develop, we next consider what can be calculated about fully developed convection – i.e. when perturbations grow well beyond the linearization used to study the onset of instability.
Let’s consider fully developed convection with high Rayleigh number. From observations of real fluids in laboratory situations, it is well known how this looks. High Rayleigh number convection is dominated by the advection of heat. Diffusion is too slow to carry heat far into the fluid before the buoyancy anomaly becomes unstable. This leads to thin, horizontal “boundary layers” where diffusive heat transfer into and out of the fluid occurs. These are separated by approximately isothermal regions in the fluid interior. The horizontal boundary layers are connected by vertical boundary layers which take the form of sheets or cylindrical plumes depending on a number of things including the Rayleigh number. For the time being we consider only the sheet like downwellings since that allows us to continue working in 2D.

Fig. 11 Boundary Layer Theory in its simplest form¶
Boundary layer analysis is a highly sophisticated field, and is used in a broad range of situations where differences in scales between different physical effects produce narrow accommodation zones where the ‘weaker’ term dominates (e.g viscosity in ‘invicid’ flow around an obstacle).
Here we first make a wild stab at an approximate theory describing the heat flow from a layer with a given Rayleigh number. The convective flow is shown in Figure 1{reference-type=”ref” reference=”fig:blt1”} together with our rough sketch of what actually happens.
Assuming the simplified flow pattern of the sketch, steady state, and replacing all derivatives by crude differences we obtain (using a vorticity form)
and
where \(\omega\sim v / d\) from the local rotation interpretation of vorticity and the approximate rigid-body rotation of the core of the convection cell, and \(v/d \sim \kappa / \delta^2\).
This gives
This theory balances diffusion of vorticity and temperature across and out of the boundary layer with advection of each quantity along the boundary layer to maintain a steady state.
The Nusselt number is the ratio of advected heat transport to that purely conducted in the absence of fluid motion, or, using the above approximations,
This latter result being observed in experiments to be in reasonably good agreement with observation.
If we define a boundary Rayleigh number
then the expression for \(\delta\) gives
so the boundary layer does not become more or less stable with increasing Rayleigh number (this is not universal – for internal heating the boundary layer becomes less stable at higher Rayleigh number).

Fig. 12 Boundary Layer Theory which accounts for the thickness variations along the boundary layer as it matures.¶
Another wrinkle can be added to the boundary layer theory by trying to account for the variation in the boundary layer thickness as it moves along the horizontal boundary. This refinement in the theory can account for the form of this thickness, the potential energy change in rising or sinking plumes, and the aspect ratio of the convection (width to height of convection roll) by maximizing Nusselt number as a function of aspect ratio.
Consider the boundary layer to be very thin above the upwelling plume (left side). As it moves to the right, it cools and the depth to any particular isotherm increases (this is clearly seen in the simulation). This can be treated exactly like a one dimensional problem if we work in the Lagrangian frame of reference attached to the boundary layer. That is, take the 1D half-space cooling model and replace the time with \(x_1/v\) (cf. the advection equation in which time and velocity / lengths are mixed).
The standard solution is as follows. Assume a half-space at an intial temperature everywhere of \(T_0\) to which a boundary condition, \(T=T_s\) is applied at \(t=0,x_2=0\).
We solve for \(T(x_2,t)\) by first making a substitution,
which is a dimensionless temperature, into the standard diffusion equation to obtain
The boundary conditions on \(\theta\) are simple:

Fig. 13 Cooling half-space calculation¶
In place of \(t,x_2\), we use the similarity transformation,
which is found (more or less) intuitively. Now we need to substitute
to transform ([eq:difftheta]{reference-type=”ref” reference=”eq:difftheta”}) into
Boundary conditions transform to give
Write \(\phi = d\theta / d\eta\) (for convenience only) to rewrite ([eq:diffode]{reference-type=”ref” reference=”eq:diffode”}) as
This is a standard integral with solution
This latter form is then integrated to give the solution:
Boundary conditions give
Which is the definition of the complementary error function (\(\mbox{\rm erfc}(\eta)\)).
Undoing the remaining substitutions gives
In our original context of the cooling boundary layer, then, the \(T_ s\) is the surface temperature, \(T_0\) is the interior temperature of the convection cell (\(\Delta T /2\)) and \(t \leftarrow x_1/v\). The thickness of the boundary layer is found by assuming it is defined by a characteristic isotherm (doesn’t much matter which). The progression of this isotherm is
or, in the Eulerian frame
Internal Heating¶
The definition of the Rayleigh number when the layer is significantly internally heated is
where \(\sigma\) is a stress, \(d\) is grain size, \(E\) is an activation energy, \(V\) is an activation volume, and \(T\) is absolute temperature. (\(R\) is the universal gas constant).
This translates to a viscosity
In the mantle two forms of creep are possible, dislocation creep with \(n ~ 3.0\), \(m~0\), \(E ~ 430-540 KJ/mol\), \(V ~ 10 - 20 cm^3/mol\); and diffusion creep with \(n ~ 1.0\), \(m~2.5\), \(E ~ 240-300 KJ/mol\), \(V ~ 5-6 cm^3/mol\). This is for olivine — other minerals will produce different results, of course.
Convection with Temperature Dependent Viscosity¶
More sophisticated models included the effect of temperature dependent viscosity as a step towards more realistic simulations. In fact, the opposite was observed: convection with temperature dependent viscosity is a much worse description of the oceanic lithosphere than constant viscosity convection. It may, however, describe Venus rather well.
Theoretical studies of the asymptotic limit of convection in which the viscosity variation becomes very large (comparable to values determined for mantle rocks in laboratory experiments) find that the upper surface becomes entirely stagnant with little or no observable motion. Vigorous convection continues underneath the stagnant layer with very little surface manifestation.
This theoretical work demonstrates that the numerical simulations are producing correct results, and suggests that we should look for physics beyond pure viscous flow in explaining plate motions.
Non-linear Viscosity and localisation¶
Realistic rheological laws show the viscosity may depend upon stress. This makes the problem non-linear since the stress clearly depends upon viscosity. In order to obtain a solution it is necessary to iterate velocity and viscosity until they no longer change.
The obvious association of plate boundaries with earthquake activity suggests that relevant effects are to be found in the brittle nature of the cold plates. Brittle materials have a finite strength and if they are stressed beyond that point they break. This is a familiar enough property of everyday materials, but rocks in the lithosphere are non-uniform, subject to great confining pressures and high temperatures, and they deform over extremely long periods of time. This makes it difficult to know how to apply laboratory results for rock breakage experiments to simulations of the plates.
An ideal, very general rheological model for the brittle lithosphere would incorporate the effects due to small-scale cracks, faults, ductile shear localization due to dynamic recrystalization, anisotropy (… kitchen sink). Needless to say, most attempts to date to account for the brittle nature of the plates have greatly simplified the picture. Some models have imposed weak zones which represent plate boundaries, others have included sharp discontinuities which represent the plate-bounding faults, still others have used continuum methods in which the yield properties of the lithosphere are known but not the geometry of any breaks. Of these approaches, the continuum approach is best able to demonstrate the spectrum of behaviours as convection in the mantle interacts with brittle lithospheric plates. For studying the evolution of individual plate boundaries methods which explicitly include discontinuities work best.
The simplest possible continuum formulation includes a yield stress expressed as an non-linear effective viscosity.
This formulation can be incorporated very easily into the mantle dynamics modeling approach that we have outlined above as it involves making modifications only to the viscosity law. There may be some numerical difficulties, however, as the strongly non-linear rheology can lead to dramatic variations in the viscosity across relatively narrow zones.
Thermochemical Convection¶
The Rayleigh number is defined in terms of thermal buoyancy but other sources of buoyancy are possible in fluids.
For example, in the oceans, dissolved salt makes water heavy. When hot salty water (e.g. the outflows of shallow seas such as the Mediterranean) mixes with cold less salty water, there is a complicated interaction.
This is double diffusive convection and produces remarkable layerings etc since the diffusion coefficients of salt and heat are different by a factor of 100.
In the mantle, bulk chemical differences due to subduction of crustal material can be treated in a similar manner. From the point of view of the diffusion equations, the diffusivity of bulk chemistry in the mantle is tiny (pure advection).
Fluid flows with chemical v. thermal bouyancy are termed thermochemical convection problems.