Click here to view G.U. B.Sc. 3rd yr. examination questions on statistical thermodynamics & data analysis

A Brief Introduction to Statistical Thermodynamics
A Web-Book by Rituraj Kalita, Dept. of Chemistry, Cotton College, Guwahati-781001 (Assam, India)
Preface   Ch. 1   Ch. 2   Ch. 3   Ch. 4   Ch. 5   Ch. 6   Ch. 7   Bibliog.
     Background topics/ vocabulary           General topics           Advanced (avoidable) topics
© 2006. Copyright reserved. The book or any portion of it can't be reproduced/ re-published/ circulated.

Ch. 2: Distributions, Boltzmann Distribution Law, Molecular Partition Function and Expressions for the Thermodynamic Functions
 

2.1 Distributions, Most Probable & Average Distributions, and Distribution Laws:

The ideal (or practically ideal) pure gas system we are mostly interested in (in this book) consists of a set of indistinguishable molecules, and so must be either Bosons obeying B-E statistics or Fermions obeying F-D statistics. However ideal or nearly-ideal gases always obey the restriction gl >> Nl, for which (as under that condition gl + Nl gl Nl gl)
either
[(gl + Nl – 1)!/{( gl – 1)!.Nl!}]  = [{(gl + Nl – 1).(gl + Nl – 2)......(gl + 1).gl}/Nl!]
or
[gl ! / {( gl Nl )! . Nl!}] =
[{gl .(gl – 1).(gl – 2)........(gl Nl +1)}/Nl!]
simply equals / Nl!
So, either Pl [(gl+Nl–1)!/{(gl–1)!.Nl!}] or Pl [gl!/{(glNl)!.Nl!}] equals Pl /Nl! giving Wd = Pl /Nl! which is the characteristic relation
corrected-Boltzmann statistics. This means that ideal or nearly-ideal pure gas systems always obey this statistics.

For any macroscopic system in a given macrostate, there are many possible distributions consistent with the constancy requirements characteristic of a given macrostate, e.g., the constancy requirement Sl Nl = N for the total number N and the requirement Sl Nl el = U for the total molecular energy U (equal to the system internal energy). However these possible distributions have differing probabilities of being observed (depending, obviously, on their pd = Wd/W values). The m. p. distribution has the highest individual probability of being observed. Next to this, (remember we're talking about macroscopic systems) it is only the distributions very similar to the m. p. distribution that have appreciable probabilities of being observed; the distributions significantly differing from the m. p. distribution have so miniscule probabilities of being observed that they are never observed in any observation. [people state this as: the Wd and the pd functions are sharply peaked around the m.p. distribution] So, for macroscopic systems, we say that it is practically the m. p. distribution that is always observed! Averaging over all possible distributions (obviously weighing as per their respective probabilities) and noting that the odd type of distributions with negligible probabilities hardly contribute to this average, we immediately recognize that the average distribution is also practically the m. p. distribution!

A law that gives (for a macroscopic system) the m. p. distribution (and so also gives both the usually observed distribution-set and the average distribution, these three distributions being practically the same as discussed above), is called a distribution law. As a distribution is characterized simply by the populations of all the molecular levels, so a distribution law is naturally an expression for the population Nl of the (arbitrary) molecular level l. An example is the well known Boltzmann distribution law, which will be discussed in detail in the next section.
 

2.2 The Boltzmann Distribution Law and One of Its Derivations:

The Boltzmann distribution law is the most popular distribution law, valid for both Maxwell-Boltzmann and corrected-Boltzmann systems, dictating populations in the m. p. (or in the average) distributions. According to this law, the population Nl of the l-th level (while in the m. p. distribution, which is denoted by *) is given as: Nl* = (N/q) gl exp{(-el / (kT)} [where q is the sum Sl gl exp {-el /(kT)}, referred to as the molecular partition function]. As for a macroscopic system, it is practically the m. p. distribution which is always observed,  the * sign may as well be withdrawn (it is OK if Nl* >> 1) giving Nl = (N/q) gl exp{(-el / (kT)}

An obvious corollary of this law is about the ratio of populations in two levels l΄ & l as:
Nl΄ / Nl  =  (gl΄ / gl) exp{-(el΄ - el)/(kT )}    
[N & q being same for any of the levels, cancels off].
Though derived (in this chapter) for complete molecular levels, both the law and its corollary can be successfully applied for specific-mode energies (e.g., rotational, vibrational etc.) of the molecules, to give populations or population-ratios in specific-mode energy levels (irrespective of other specific-mode energy levels occupied) [see section 3.6].

The thermodynamic probability Wd of the distribution d of molecules in an ideal gas is given  (as per the Corrected Boltzmann Statistics relation) as: Wd = Pl / Nl!
where Nl is the occupation number (population) in the l-th level, gl is its degeneracy (statistical weight), and the product extends over all molecular energy-levels.

The Boltzmann distribution is the m. p. (most probable) distribution, and so for it the probability and the thermodynamic probability are the maximum. However, the gaseous isolated (or just in a definite macrostate) system considered must obey its two obvious restrictions: Sl Nl = N, a constant, and Sl el Nl = U, another constant. So, Lagrange method of undetermined multipliers (for maximization of a function under one or more constraints) must be used during maximization. Now, the two restrictions in the differential form are:
S
l dNl = 0 and Sl el dNl = 0.  For mathematical simplicity, lnWd instead of Wd is maximized (when Wd is maximum, lnWd is also maximum, so there is no problem about that).
So, for the m.p. distribution, Lagrange method1 gives: 
d ln W* + a Sl dNl* b Sl el dNl* = 0  for any set of values of dNl*
(where a and b are the Lagrange undetermined multipliers)
As ln W* = Sl (Nl* ln gl - ln Nl!* ) = Sl (Nl* ln gl - Nl* ln Nl* + Nl*) using the Stirling2 approximation (according to which, for a large natural number y, ln y! = y ln y y)
we get,  d ln W* = Sl (ln gl dNl* - Nl* (1/ Nl*) dNl*  - ln Nl* dNl* + dNl*)
= Sl ln (gl / Nl*) dNl*
So Lagrange method gives:
S
l ln (gl / Nl*) dNl + a Sl dNl* - b Sl el dNl* = 0   (for any set of dNl*)
or, Sl [ ln (gl / Nl*) + a - b el ] dNl* = 0    (for any set of dNl*)

As this relation is true for any set of dNl*, so every coefficient of dNl* is zero3. This means,
ln (gl / Nl*) + a - b e= 0. This gives ln (gl / Nl*) = -a + b e
orgl / Nl* = exp (-a + b el)
or,  Nl* = gl exp (a) exp (-b el)
Summing over both sides for all energy levels and noting that Sl Nl* = N, the total number of molecules, we get
exp (a) = N / Sl [ gl exp (-b el)]. Noting that b could be found to equal 1/(kT), [here k is the Boltzmann constant, and T is the thermodynamic, i.e., absolute-scale temperature; see section 3.9 for a verification of this relation], the Boltzmann distribution law is thus4
        Nl* = (N/q) gl exp {-el /(kT)}  
where the molecular partition function q is defined by q = Sl gl exp {-el /(kT)}
(The summation here is over all the molecular levels l).

However, for greatly occupied levels in any macroscopic system, any observed occupation number Nl is very close to the occupation number Nl* in the m. p. distribution. So, the Boltzmann distribution law may be written in a simplified way as: Nl = (N/q) gl exp (-el/ kT)

Notes: (1) The Lagrange method of undetermined multipliers, useful for maximization of a multi-argument function under inter-argument constraint(s), may be applied for maximization of the function f (y,z) = yz under the constraint y + z = 2 in the following way: Constraint is (dy + dz) = 0. Differential of function is df = y dz + z dy. So, df + l(dy+dz) = 0 => (y+l)dz + (z+l)dy = 0 (for any set of values of dy & dz). So y+l = z+l = 0 => y = z = l => –2l = 2 => l = –1  and so y = z = 1, and maximum value of f = yz is 1.1 = 1. (Here l is a Lagrange multiplier) Cross-check it by maximizing f = y(2–y) in the straight way of making df/dy = 0. (2) Using a scientific calculator, check the validity of the Stirling approximation by finding, say, 63!, then finding ln63!, and then comparing this result with 63.ln6363 (3) If ax+by+cz = 0 is true for any value of the variables x, y, z; then the coefficients a, b and c must be all zero. Isn't it so? Just think about it a little!
(4) This statement of the distribution law is in terms of the m. p. distribution. Another version exists that talks about the average occupation number of a level <Nl>, averaged over all possible distributions for the system (averaged, taking care of the differing probabilities pd of the possible distributions), and finally states: <Nl> = (N/q) gl exp (-el / kT).    (i.e., the average population <Nl> is just the same as the m. p. population Nl*). This version may be stated also in terms of average occupation number <Ns> of a molecular state s as: <Ns> = (N/q) exp(-es/kT). [The per-level form is obtainable just as a corollary of this per-state form (how?).]
(5) Calculated as (N/q) gl exp (-el / kT), this Nl* is obviously, in general, a fraction! So, the m. p. distribution we're calculating is, exactly speaking, a hypothetical m. p. distribution (with fractional populations - an obvious absurdity) closest to the actual m. p. one. (When Nl* is a large number, this error is negligible; we'll always overlook this!)
 

2.3 Functional Form and Significance of the Molecular Partition Function:

The molecular partition function q expressed as a sum of terms over levels or over states (also denoted by the letter z, from the original German name Zustandsumme meaning 'sum over states') is a dimensionless quantity (pure number) of a central importance in corrected-Boltzmann and Maxwell-Boltzmann systems, as most of the thermodynamic functions (e.g., U, P, S) of such systems can be expressed in terms of q (and their value obtained from the knowledge of q). In addition to the sum over levels form
q = Sl gl exp {-el /(kT)} (as previously mentioned), q may also be expressed
in a sum over states form q = Ss exp{-es /(kT)} (where the summation is over all the molecular states s, es being the energy of the molecular state s). The molecular p. f. is a function of only the system temperature T and the system volume V i.e., q = q(T,V) (it is not a function of the number N of constituent molecules). This function is of the simple form
q = V.f (T), where f (T) is a function of temperature T only.

The numerical value of q indicates the pattern of distribution of molecules among the different levels. Generally the energies of the levels are measured with the ground-level energy taken as zero. Under this popular convention, we find that at very low temperatures, q is small and the upper levels are hardly occupied. When the temperature is raised, q increases and the upper levels start getting occupied. When no level other than the ground one is occupied, q has the lowest possible value g0, which is the degeneracy of the ground molecular level (this lowest value of q is exactly encountered at the absolute zero of temperature i.e., at T = 0, though this value may be practically encountered also at somewhat higher temperatures).
 

2.4 An ST-Form of 1st Law of Thermodynamics (for Independent-Particle Systems):

This 1st law tells us that dU = dq + dw, where U is the system (internal) energy, dq is the heat absorbed by the system and dw is the work done on the system. On the other hand, for a system of independent (non-interacting) particles, we have (from U = Sl Nlel) that dU = Sl Nl del  + Sl el dNl . Now, let us consider a situation in which no work is done (dw = 0) but some heat is absorbed (i.e., dU = dq + 0). As no work is done, the volume and/or surface area etc. haven't changed, and so the molecular energy levels remains constant (as known from quantum mechanical considerations). So the changes del in the energy levels el are zero, and so dU = Sl el dNl + 0. This gives the value of dq as: dq = Sl el dNl , and from which it is obvious that dw = dU dq = Sl Nl del.

Thus we get two generalized relations dq = Sl eldNl and dw = Sl Nldel with dU = dq+dw. These relations give the molecular interpretations of heat and work, according to which (for independent-molecule systems): Heat absorbed by the system gets expressed through changes only in the populations of the molecular energy levels; work done on the system gets expressed in changes only in the energies of the molecular energy levels.
 

2.5 Expressions for the Thermodynamic Functions in Terms of q (for Ideal Gases):

2.5.1 Expression for system's internal energy, U:
The defining relation of q is:
q = Sl gl exp {-el /(kT)}. Differentiating q w.r.t. T at constant volume V, and noting (from quantum-mechanics considerations) that at constant volume the degeneracies gl and energies el are constants, we get
(∂q
/∂T)V = Sl gl exp {-el /(kT)}.(-el /k).(-1/T2 )
or, Sl gl el exp {-el /(kT)} = k T2
(∂q/∂T)V
Now, as U =
 Sl Nl el
Applying the Boltzmann distribution law Nl
= (N/q) gl exp {-el /(kT)}, we get
U
=  Sl Nl e = (N/q) Sl gl el exp {-el / (kT)} = (N/q) k T2 (∂q/∂T)V = NkT2 (∂ ln q/∂T)V
So we have this expression for the internal energy U, U = NkT
2 (∂ ln q/∂T)V

2.5.2 Expression for pressure, P:
Using the expression dw =
Sl Nl del  for work done on the system, and noting that for a simple system with only P-V work allowed dw equals (–P dV), we get  –PdV = Sl Nl del
Applying the Boltzmann distribution law Nl
= (N/q) gl exp (-el / kT) for Nl, we get:
 
–PdV = (N/q) Sl gl exp (-el / kT) del
or,
–P = (N/q) Sl gl exp (-el / kT) (del/dV)       or,  Sl gl exp (-el / kT) (del/dV) = –qP/N
Differentiating the expression for q w.r.t. V at constant T, and noting that
(∂el/∂V)T is same as (del/dV), as the molecular level energy-value el (it is dictated by quantum mechanics alone) is independent of temperature T, we get (∂q/∂V)T = Sl gl exp (-el / kT) (∂el/∂V)T.{-1/( kT)}
or, Sl gl exp (-el / kT) (del
/dV) = -kT.(∂q/∂V)T
This gives, –qP/N =
-kT.(∂q/∂V)T , or, P =  (NkT/q).(∂q/∂V)T = NkT (∂ ln q/∂V)T
So we have this expression for pressure, P = NkT (∂ ln q/∂V)T 

2.5.3 Expression for entropy, S:
Using relations dS = dqrev/T and dq =
Sl el dNl* (as the system practically exists in the m.p. distribution) , and noting that for a system of non-reacting pure substance, dq = dqrev , we have:
dS = (1/T).
Sl el dNl* = k Sl (bel).dNl*              [as 1/T = kb]
For the corrected-Boltzmann system of indistinguishable particles, ln (gl / Nl*) = -a + b e
or, bel = ln (gl / Nl*) + a 
So, dS = k Sl {ln (gl /Nl*) + a}.dNl*
 = k Sl ln (gl /Nl).dNl* + k Sl a dNl*
 = k Sl ln (gl /Nl).dNl* + ka Sl dNl* = k Sl ln (gl /Nl*).dNl* + ka dN       [as Sl Nl* = N]
But Sl ln (gl /Nl*) was nothing other than dW*, while for the system in a given macrostate (or for an isolated system) N is constant i.e., dN is zero. This means that dS = k (d lnW*)
giving S = k lnW*.
Note: Mathematics-experts among you may surely utter: where's the constant of integration C [while integrating from dS = k (d lnW*)]? Well, 'it can be shown' that here C has to be zero. [I like to talk about such situations as: whatever can't be shown (by the author, or right now) is stated as 'it can be shown'!]

This approximate form of Boltzmann equation S = k.lnW*, in contrast to the exact Boltzmann equation S = k ln W,  gives an mathematically easy way to calculate the entropy S as:
S = k lnW* = k Sl (Nl* ln gl - ln Nl* ! )
= k Sl (Nl* ln gl - Nl* ln Nl* + Nl*)  = k Sl Nl* ln (gl /Nl*) + k Sj Nl
= k Sl Nl* ln (gl /Nl*) + k N
As  
 Nl* = (N/q) gl exp (-el / kT)      So, gl /Nl* = (q/N).exp(el / kT)
Or,   ln (gl /Nl*) = ln (q/N).+ el / kT
So,   S = k Sl Nl* {ln (q/N).+ el / kT} + k N
= k ln (q/N) Sl Nl* + Sl Nl*el / T + k N = Nk ln(q/N) + U/ T + Nk
So the expression for entropy (for corrected Boltzmann system of indistinguishable particles, e.g., gases) in terms of q, N, T and V is [using the above expression for U]:
S =
Nk ln(q/N) + U/T + Nk  = Nk ln(q/N) + NkT (d lnq/dT)V + Nk 
Note: The two expressions k lnW* and k lnW for S give practically the same value for S simply because both W* and W are so extremely large numbers that the difference between lnW* and lnW is negligible in comparison with either lnW* or lnW, even though W may be several times larger than W* [to get an idea of the situation, try comparing values of, say, ln 1023 and ln (5x1023). W  is unthinkably larger than 1023.] This means we've verified the Bolzmann equation, starting from the thermodynamic definition of entropy!

For the proper M-B system of identical distinguishable particles (e.g., normal modes of vibration in solid as per Einstein theory), the relation S = k lnW* is similarly obtained. However, because of an additional N! factor in the expression of W*, the additional term (k.lnN! = Nk.lnN Nk) appears in S, giving S = (Nk.lnN Nk) + Nk ln(q/N) + U/ T + kN
So the expression for entropy for Maxwell-Boltzmann system of distinguishable particles is
S = Nk ln q + U/T = Nk ln q + NkT (d lnq/dT)V
Note: Don't forget to use only the earlier expression (the blue one) for entropy of gases (not this one!).

2.5.4 Expression for the Helmholtz free energy, A:
Using above relations for U & S, an expression for the Helmholtz Free Energy A (where A = U – TS) for gas-systems may be immediately obtained as: A = U – TS =
U – T
{Nk ln(q/N) + U/T + Nk}  = –NkT ln(q/N) – NkT   i.e., A =  –NkT ln(q/N) – NkT
Note: This expression for A is, obviously, not valid for a system of distinguishable particles.

2.5.5 Expression for the chemical potential per molecule, u:
In statistical thermodynamics we come across frequent references to the chemical potential per molecule in contrast to the chemical potential per mole (m) popular in classical thermodynamics. To distinguish this quantity from m, this author prefers using a different symbol u to indicate this molecular chemical potential. Just as m is defined mathematically by m = (∂A/∂n)V,T = (∂G/∂n)P,T , u is also defined as u = (∂A/∂N)V,T = (∂G/∂N)P,T [obviously, noting that N = n NA, this means that m = u NA (NA is the Avogadro number)].

From the above expression for A we get:
       A =
–NkT ln q + NkT ln N – NkT
So,  u =
 (dA/dN)V,T = –kT ln q + kT ln N + NkT (1/N) – kT
=
–kT ln q + kT ln N + kT – kT = –kT ln (q/N). So, m = uNA –RT ln(q/N)  [as kNA = R]
So we have, 
u = –kT ln (q/N) and m –RT ln(q/N)
Note: These expressions for u and m are, obviously, not valid for a system of distinguishable particles.