Partition function (statistical mechanics): Difference between revisions

Content deleted Content added

Inline

Revision as of 14:54, 19 November 2016

In physics, a partition function describes the statistical properties of a system in thermodynamic equilibrium. Partition functions are functions of the thermodynamic state variables, such as the temperature and volume. Most of the aggregate thermodynamic variables of the system, such as the total energy, free energy, entropy, and pressure, can be expressed in terms of the partition function or its derivatives.

Each partition function is constructed to represent a particular statistical ensemble (which, in turn, corresponds to a particular free energy). The most common statistical ensembles have named partition functions. The canonical partition function applies to a canonical ensemble, in which the system is allowed to exchange heat with the environment at fixed temperature, volume, and number of particles. The grand canonical partition function applies to a grand canonical ensemble, in which the system can exchange both heat and particles with the environment, at fixed temperature, volume, and chemical potential. Other types of partition functions can be defined for different circumstances; see partition function (mathematics) for generalizations. The partition function has many physical meanings, as discussed in Meaning and significance.

Canonical partition function

Definition

As a beginning assumption, assume that a thermodynamically large system is in thermal contact with the environment, with a temperature T, and both the volume of the system and the number of constituent particles are fixed. This kind of system is called a canonical ensemble. Let us label with s = 1, 2, 3, ... the exact states (microstates) that the system can occupy, and denote the total energy of the system when it is in microstate s as E_s. Generally, these microstates can be regarded as analogous to discrete quantum states of the system.

The canonical partition function is

Z=\sum _{s}\mathrm {e} ^{-\beta E_{s}}

,

where the "inverse temperature", β, is conventionally defined as

\beta \equiv {\frac {1}{k_{B}T}}

with k_B denoting Boltzmann's constant. The exponential factor exp(−βE_s) is known as the Boltzmann factor.

In systems with multiple quantum states s sharing the same E_s, it is said that the energy levels of the system are degenerate. In the case of degenerate energy levels, we can write the partition function in terms of the contribution from energy levels (indexed by j) as follows:

Z=\sum _{j}g_{j}\cdot \mathrm {e} ^{-\beta E_{j}}

,

where g_j is the degeneracy factor, or number of quantum states s which have the same energy level defined by E_j = E_s.

The above treatment applies to quantum statistical mechanics, where a physical system inside a finite-sized box will typically have a discrete set of energy eigenstates, which we can use as the states s above. In classical statistical mechanics, it is not really correct to express the partition function as a sum of discrete terms, as we have done. In classical mechanics, the position and momentum variables of a particle can vary continuously, so the set of microstates is actually uncountable. In this case we must describe the partition function using an integral rather than a sum. For instance, the partition function of a gas of N identical classical particles is

Z={\frac {1}{N!h^{3N}}}\int \,\exp[-\beta H(p_{1}\cdots p_{N},x_{1}\cdots x_{N})]\;d^{3}p_{1}\cdots d^{3}p_{N}\,d^{3}x_{1}\cdots d^{3}x_{N}

where

p_i indicate particle momenta

x_i indicate particle positions

d³ is a shorthand notation serving as a reminder that the p_i and x_i are vectors in three-dimensional space, and

H is the classical Hamiltonian.

The reason for the factorial factor N! is discussed below. For simplicity, we will use the discrete form of the partition function in this article. Our results will apply equally well to the continuous form. The extra constant factor introduced in the denominator was introduced because, unlike the discrete form, the continuous form shown above is not dimensionless. To make it into a dimensionless quantity, we must divide it by h^3N where h is some quantity with units of action (usually taken to be Planck's constant).

In quantum mechanics, the partition function can be more formally written as a trace over the state space (which is independent of the choice of basis):

Z=\operatorname {tr} (\mathrm {e} ^{-\beta {\hat {H}}})

,

where Ĥ is the quantum Hamiltonian operator. The exponential of an operator can be defined using the exponential power series. The classical form of Z is recovered when the trace is expressed in terms of coherent states ^[1] and when quantum-mechanical uncertainties in the position and momentum of a particle are regarded as negligible. Formally, one inserts under the trace for each degree of freedom the identity:

{\boldsymbol {1}}=\int |x,p\rangle \,\langle x,p|~{\frac {dx\,dp}{h}}

where |x, p⟩ is a normalised Gaussian wavepacket centered at position x and momentum p. Thus,

Z=\int \operatorname {tr} \left(\mathrm {e} ^{-\beta {\hat {H}}}|x,p\rangle \,\langle x,p|\right){\frac {dx\,dp}{h}}=\int \langle x,p|\mathrm {e} ^{-\beta {\hat {H}}}|x,p\rangle ~{\frac {dx\,dp}{h}}

A coherent state is an approximate eigenstate of both operators ${\hat {x}}$ and ${\hat {p}}$ , hence also of the Hamiltonian Ĥ, with errors of the size of the uncertainties. If Δx and Δp can be regarded as zero, the action of Ĥ reduces to multiplication by the classical Hamiltonian, and Z reduces to the classical configuration integral.

Consider a system S embedded into a heat bath B. Let the total energy of both systems be E. Let p_i denote the probability that the system S is in microstate i with energy E_i. According to the fundamental postulate of statistical mechanics (which states that all attainable microstates of a system are equally probable), the probability p_i will be proportional to the number of microstates in the total closed system where S is in microstate i with energy E_i. Equivalently, p_i will be proportional to the number of microstates of the heat bath B with energy E - E_i:

${\begin{aligned}p_{i}=\Omega \left(E-E_{i}\right)\\\end{aligned}}$

The number of microstates of the heat bath at a given energy E is denoted by Ω(E). Assuming that the heat bath's internal energy is much larger than the energy of S (E>>E_i), we can Taylor expand Ω to first order in E_i and use the thermodynamic relation $\partial S/\partial E=1/T$ :

${\begin{aligned}k\ln p_{i}=k\ln \Omega \left(E-E_{i}\right)&\approx k\ln \Omega \left(E\right)-{\frac {\partial \left(k\ln \Omega \left(E\right)\right)}{\partial E}}E_{i}\\&\approx k\ln \Omega \left(E\right)-{\frac {\partial S_{B}}{\partial E}}E_{i}\\&\approx k\ln \Omega \left(E\right)-{\frac {E_{i}}{T}}\\\Rightarrow k\ln p_{i}&\propto k\ln \Omega \left(E\right)-{\frac {E_{i}}{T}}\\\Rightarrow p_{i}&\propto e^{\ln \Omega \left(E\right)-{\frac {E_{i}}{kT}}}\\\Rightarrow p_{i}&\propto \Omega \left(E\right)e^{-{\frac {E_{i}}{kT}}}\\\Rightarrow p_{i}&\propto e^{-{\frac {E_{i}}{kT}}}.\end{aligned}}$

Since the total probability to find the system in some microstate (the sum of all p_i) must be equal to 1, we can define the partition function as the normalisation constant:

${\begin{aligned}Z&:=\sum _{i}e^{-\beta E_{i}}\end{aligned}}$

Meaning and significance

It may not be obvious why the partition function, as we have defined it above, is an important quantity. First, let us consider what goes into it. The partition function is a function of the temperature T and the microstate energies E₁, E₂, E₃, etc. The microstate energies are determined by other thermodynamic variables, such as the number of particles and the volume, as well as microscopic quantities like the mass of the constituent particles. This dependence on microscopic variables is the central point of statistical mechanics. With a model of the microscopic constituents of a system, one can calculate the microstate energies, and thus the partition function, which will then allow us to calculate all the other thermodynamic properties of the system.

The partition function can be related to thermodynamic properties because it has a very important statistical meaning. The probability P_s that the system occupies microstate s is

P_{s}={\frac {1}{Z}}\mathrm {e} ^{-\beta E_{s}}.

Thus, as shown above, the partition function plays the role of a normalizing constant (note that it does not depend on s), ensuring that the probabilities sum up to one:

\sum _{s}P_{s}={\frac {1}{Z}}\sum _{s}\mathrm {e} ^{-\beta E_{s}}={\frac {1}{Z}}Z=1.

This is the reason for calling Z the "partition function": it encodes how the probabilities are partitioned among the different microstates, based on their individual energies. The letter Z stands for the German word Zustandssumme, "sum over states". The usefulness of the partition function stems from the fact that it can be used to relate macroscopic thermodynamic quantities to the microscopic details of a system through the derivatives of its partition function.

Calculating the thermodynamic total energy

In order to demonstrate the usefulness of the partition function, let us calculate the thermodynamic value of the total energy. This is simply the expected value, or ensemble average for the energy, which is the sum of the microstate energies weighted by their probabilities:

\langle E\rangle =\sum _{s}E_{s}P_{s}={\frac {1}{Z}}\sum _{s}E_{s}e^{-\beta E_{s}}=-{\frac {1}{Z}}{\frac {\partial }{\partial \beta }}Z(\beta ,E_{1},E_{2},\cdots )=-{\frac {\partial \ln Z}{\partial \beta }}

or, equivalently,

\langle E\rangle =k_{B}T^{2}{\frac {\partial \ln Z}{\partial T}}.

Incidentally, one should note that if the microstate energies depend on a parameter λ in the manner

E_{s}=E_{s}^{(0)}+\lambda A_{s}\qquad {\mbox{for all}}\;s

then the expected value of A is

\langle A\rangle =\sum _{s}A_{s}P_{s}=-{\frac {1}{\beta }}{\frac {\partial }{\partial \lambda }}\ln Z(\beta ,\lambda ).

This provides us with a method for calculating the expected values of many microscopic quantities. We add the quantity artificially to the microstate energies (or, in the language of quantum mechanics, to the Hamiltonian), calculate the new partition function and expected value, and then set λ to zero in the final expression. This is analogous to the source field method used in the path integral formulation of quantum field theory. ^{[citation needed]}

Derivation

There are multiple approaches to deriving the partition function. The following derivation follows the powerful and general information-theoretic Jaynesian maximum entropy approach.

According to the second law of thermodynamics, a system assumes a configuration of maximum entropy at thermodynamic equilibrium. We seek to find a probability distribution of states $p_{i}$ which maximizes the entropy

S=-k_{B}\sum _{i}p_{i}\ln(p_{i})

subject to two physical constraints:

1. The probabilities of all states add to unity.

\sum _{i}p_{i}=1

2. In the canonical ensemble, the average energy is fixed.

\langle E\rangle =\sum _{i}p_{i}E_{i}=U

Using the method of Lagrange multipliers, we rewrite $S$ as:

S=-k_{B}\sum _{i}p_{i}\ln(p_{i})+\lambda _{1}(\sum _{i}p_{i}-1)+\lambda _{2}(\sum _{i}p_{i}E_{i}-U)

To immediately obtain $\lambda _{2}$ , we differentiate $S$ with respect to the average energy $U$ and apply the first law of thermodynamics:

{\frac {\partial S}{\partial U}}=-\lambda _{2}={\frac {1}{T}}

Differentiating and extremizing $S$ with respect to $p_{i}$ leads to:

{\begin{aligned}0&={\frac {\partial S}{\partial p_{i}}}=-k_{B}\sum _{i}\ln(p_{i})-k_{B}\sum _{i}1+\lambda _{1}\sum _{i}1-{\frac {1}{T}}\sum _{i}E_{i}\\&=\sum _{i}(-k_{B}\ln(p_{i})-k_{B}+\lambda _{1}-{\frac {E_{i}}{T}})\\0&=-k_{B}\ln(p_{i})-k_{B}+\lambda _{1}-{\frac {E_{i}}{T}}\end{aligned}}

Isolating for $p_{i}$ yields:

{\begin{aligned}p_{i}&=\mathrm {e} ^{{\frac {1}{k_{B}}}(-1+\lambda _{1}-{\frac {E_{i}}{T}})}\\&=\mathrm {e} ^{{\frac {1}{k_{B}}}(-1+\lambda _{1})}\mathrm {e} ^{-\beta E_{i}}\end{aligned}}

where $\beta :={\frac {1}{k_{B}T}}$ . To obtain $\lambda _{1}$ , we substitute the probability into the first constraint:

{\begin{aligned}1&=\sum _{i}p_{i}\\&=\mathrm {e} ^{{\frac {1}{k_{B}}}(-1+\lambda _{1})}\sum _{i}\mathrm {e} ^{-\beta E_{i}}\end{aligned}}

\mathrm {e} ^{{\frac {1}{k_{B}}}(-1+\lambda _{1})}={\frac {1}{\sum _{i}\mathrm {e} ^{-\beta E_{i}}}}

We now define the partition function:

Z=\sum _{i}\mathrm {e} ^{-\beta E_{i}}

Rewriting $p_{i}$ in terms of $Z$ gives:

{\begin{aligned}p_{i}&=\mathrm {e} ^{{\frac {1}{k_{B}}}(-1+\lambda _{1})}\mathrm {e} ^{-\beta E_{i}}\\&={\frac {1}{Z}}\mathrm {e} ^{-\beta E_{i}}\end{aligned}}

Rewriting $U$ in terms of $Z$ gives:

{\begin{aligned}U&=\sum _{i}p_{i}E_{i}\\&={\frac {1}{Z}}\sum _{i}\mathrm {e} ^{-\beta E_{i}}E_{i}\end{aligned}}

Finally, rewriting $S$ in terms of $Z$ gives:

{\begin{aligned}S&=-k_{B}\sum _{i}p_{i}\ln(p_{i})\\&={\frac {k_{B}}{Z}}\sum _{i}\mathrm {e} ^{-\beta E_{i}}(\beta E_{i}+k_{B}\ln(Z))\\&={\frac {U}{T}}+k_{B}\ln(Z)\end{aligned}}

Relation to thermodynamic variables

In this section, we will state the relationships between the partition function and the various thermodynamic parameters of the system. These results can be derived using the method of the previous section and the various thermodynamic relations.

As we have already seen, the thermodynamic energy is

\langle E\rangle =-{\frac {\partial \ln Z}{\partial \beta }}.

The variance in the energy (or "energy fluctuation") is

\langle (\Delta E)^{2}\rangle \equiv \langle (E-\langle E\rangle )^{2}\rangle ={\frac {\partial ^{2}\ln Z}{\partial \beta ^{2}}}.

The heat capacity is

C_{v}={\frac {\partial \langle E\rangle }{\partial T}}={\frac {1}{k_{B}T^{2}}}\langle (\Delta E)^{2}\rangle .

The entropy is

S\equiv -k_{B}\sum _{s}P_{s}\ln P_{s}=k_{B}(\ln Z+\beta \langle E\rangle )={\frac {\partial }{\partial T}}(k_{B}T\ln Z)=-{\frac {\partial A}{\partial T}}

where A is the Helmholtz free energy defined as A = U − TS, where U = ⟨E⟩ is the total energy and S is the entropy, so that

A=\langle E\rangle -TS=-k_{B}T\ln Z.

Partition functions of subsystems

Suppose a system is subdivided into N sub-systems with negligible interaction energy, that is, we can assume the particles are essentially non-interacting. If the partition functions of the sub-systems are ζ₁, ζ₂, ..., ζ_N, then the partition function of the entire system is the product of the individual partition functions:

Z=\prod _{j=1}^{N}\zeta _{j}.

If the sub-systems have the same physical properties, then their partition functions are equal, ζ₁ = ζ₂ = ... = ζ, in which case

Z=\zeta ^{N}.

However, there is a well-known exception to this rule. If the sub-systems are actually identical particles, in the quantum mechanical sense that they are impossible to distinguish even in principle, the total partition function must be divided by a N! (N factorial):

Z={\frac {\zeta ^{N}}{N!}}.

This is to ensure that we do not "over-count" the number of microstates. While this may seem like a strange requirement, it is actually necessary to preserve the existence of a thermodynamic limit for such systems. This is known as the Gibbs paradox.

Grand canonical partition function

We can define a grand canonical partition function for a grand canonical ensemble, which describes the statistics of a constant-volume system that can exchange both heat and particles with a reservoir. The reservoir has a constant temperature T, and a chemical potential μ.

The grand canonical partition function, denoted by ${\mathcal {Z}}$ , is the following sum over microstates

{\mathcal {Z}}(\mu ,V,T)=\sum _{i}\exp((N_{i}\mu -E_{i})/k_{B}T).

Here, each microstate is labelled by $i$ , and has total particle number $N_{i}$ and total energy $E_{i}$ . This partition function is closely related to the Grand potential, $\Phi _{\rm {G}}$ , by the relation

-k_{B}T\ln {\mathcal {Z}}=\Phi _{\rm {G}}=\langle E\rangle -TS-\mu \langle N\rangle .

This can be contrasted to the canonical partition function above, which is related instead to the Helmholtz free energy.

It is important to note that the number of microstates in the grand canonical ensemble may be much larger than in the canonical ensemble, since here we consider not only variations in energy but also in particle number. Again, the utility of the grand canonical partition function is that it is related to the probability that the system is in state $i$ :

p_{i}={\frac {1}{\mathcal {Z}}}\exp((N_{i}\mu -E_{i})/k_{B}T).

An important application of the grand canonical ensemble is in deriving exactly the statistics of a non-interacting many-body quantum gas (Fermi–Dirac statistics for fermions, Bose–Einstein statistics for bosons), however it is much more generally applicable than that. The grand canonical ensemble may also be used to describe classical systems, or even interacting quantum gases.

The grand partition function is sometimes written (equivalently) in terms of alternate variables as^[2]

{\mathcal {Z}}(z,V,T)=\sum _{N_{i}}z^{N_{i}}Z(N_{i},V,T),

where $z\equiv \exp(\mu /kT)$ is known as the absolute activity (or fugacity) and $Z(N_{i},V,T)$ is the canonical partition function.

References

^ J. R. Klauder, B.-S. Skagerstam, Coherent States --- Applications in Physics and Mathematical Physics, World Scientific, 1985, p. 71-73.
^ Baxter, Rodney J. (1982). Exactly solved models in statistical mechanics. Academic Press Inc. ISBN 9780120831807.

Huang, Kerson, "Statistical Mechanics", John Wiley & Sons, New York, 1967.
A. Isihara, "Statistical Physics", Academic Press, New York, 1971.
Kelly, James J, (Lecture notes)
L. D. Landau and E. M. Lifshitz, "Statistical Physics, 3rd Edition Part 1", Butterworth-Heinemann, Oxford, 1996.
Vu-Quoc, L., Configuration integral (statistical mechanics), 2008. this wiki site is down; see this article in the web archive on 2012 April 28.

[1] J. R. Klauder, B.-S. Skagerstam, Coherent States --- Applications in Physics and Mathematical Physics, World Scientific, 1985, p. 71-73.

[2] Baxter, Rodney J. (1982). Exactly solved models in statistical mechanics. Academic Press Inc. ISBN 9780120831807.

[1]

[2]

v t e Statistical mechanics
Theory	Principle of maximum entropy ergodic theory
Statistical thermodynamics	Ensembles partition functions equations of state thermodynamic potential: U H F G Maxwell relations
Models	Ferromagnetism models Ising Potts Heisenberg percolation Particles with force field depletion force Lennard-Jones potential
Mathematical approaches	Boltzmann equation H-theorem Vlasov equation BBGKY hierarchy stochastic process mean-field theory and conformal field theory
Critical phenomena	Phase transition Critical exponents correlation length size scaling
Entropy	Boltzmann Shannon Tsallis Rényi von Neumann
Applications	Statistical field theory elementary particle superfluidity Condensed matter physics Complex system chaos information theory Boltzmann machine