In classical statistical mechanics, the H-theorem, introduced by Ludwig Boltzmann in 1872, describes the tendency to increase in the quantity H (defined below) in a nearly-ideal gas of molecules. As this quantity H was meant to represent the entropy of thermodynamics, the H-theorem was an early demonstration of the power of statistical mechanics as it claimed to derive the second law of thermodynamics—a statement about fundamentally irreversible processes—from reversible microscopic mechanics.
The H-theorem is a natural consequence of the kinetic equation derived by Boltzmann that has come to be known as Boltzmann's equation. The H-theorem has led to considerable discussion about its actual implications, with major themes being:
- What is entropy? In what sense does Boltzmann's quantity H correspond to the thermodynamic entropy?
- Are the assumptions (such as the Stosszahlansatz described below) behind Boltzmann's equation too strong? When are these assumptions violated?
- 1 Definition and meaning of Boltzmann's H
- 2 Boltzmann's H theorem
- 3 Impact
- 4 Criticism of the H-theorem and exceptions
- 5 Connection to information theory
- 6 Tolman's H-theorem
- 7 Gibbs' H-theorem
- 8 See also
- 9 Notes
- 10 References
Definition and meaning of Boltzmann's H
The H value is determined from the function f(E,t) dE, which is the energy distribution function of molecules at time t. The value f(E,t) dE is the number of molecules that have kinetic energy between E and E + dE. H itself is defined as:
For an isolated ideal gas (with fixed total energy and fixed total number of particles), the function H is at a minimum when the particles have a Maxwell–Boltzmann distribution; if the molecules of the ideal gas are distributed in some other way (say, all having the same kinetic energy), then the value of H will be higher. Boltzmann's H-theorem, described in the next section, shows that when collisions between molecules are allowed, such distributions are unstable and tend to irreversibly seek towards the minimum value of H (towards the Maxwell–Boltzmann distribution).
(Note on notation: Boltzmann originally used the letter E for quantity H; most of the literature after Boltzmann uses the letter H as here. Boltzmann also used the symbol x to refer to the kinetic energy of a particle.)
Boltzmann's H theorem
Boltzmann considered what happens during the collision between two particles. It is a basic fact of mechanics that in the elastic collision between two particles (such as hard spheres), the energy transferred between the particles varies depending on initial conditions (angle of collision, etc.).
Boltzmann made a key assumption known as the Stosszahlansatz (molecular chaos assumption), that during any collision event in the gas, the two particles participating in the collision have 1) independently chosen kinetic energies from the distribution, 2) independent velocity directions, 3) independent starting points. Under these assumptions, and given the mechanics of energy transfer, the energies of the particles after the collision will obey a certain new random distribution that can be computed.
Considering repeated uncorrelated collisions, between any and all of the molecules in the gas, Boltzmann constructed his kinetic equation (Boltzmann's equation). From this kinetic equation, a natural outcome is that the continual process of collision causes the quantity H to decrease until it has reached a minimum.
Although Boltzmann's H-theorem turned out not to be the absolute proof of the second law of thermodynamics as originally claimed (see Criticisms below), the H-theorem led Boltzmann in the last years of the 19th century to more and more probabilistic arguments about the nature of thermodynamics. The probabilistic view of thermodynamics culminated in 1902 with Josiah Willard Gibbs's statistical mechanics for fully general systems (not just gases), and the introduction of generalized statistical ensembles.
The kinetic equation and in particular Boltzmann's molecular chaos assumption inspired a whole family of Boltzmann equations that are still used today to model the motions of particles, such as the electrons in a semiconductor. In many cases the molecular chaos assumption is highly accurate, and the ability to discard complex correlations between particles makes calculations much simpler.
Criticism of the H-theorem and exceptions
|This article does not cite any references or sources. (October 2014)|
There are several notable reasons described below why the H-theorem, at least in its original 1871 form, is not completely rigorous. As Boltzmann would eventually go on to admit, the arrow of time in the H-theorem is not in fact purely mechanical, but really a consequence of assumptions about initial conditions.
Soon after Boltzmann published his H theorem, Johann Josef Loschmidt objected that it should not be possible to deduce an irreversible process from time-symmetric dynamics and a time-symmetric formalism. If the H decreases over time in one state, then there must be a matching reversed state where H increases over time (Loschmidt's paradox). The explanation is that Boltzmann's equation is based on the assumption of "molecular chaos", i.e., that it follows from, or at least is consistent with, the underlying kinetic model that the particles be considered independent and uncorrelated. It turns that this assumption breaks time reversal symmetry in a subtle sense, and therefore begs the question. Once the particles are allowed to collide, their velocity directions and positions in fact do become correlated (however, these correlations are encoded in an extremely complex manner). This shows that an (ongoing) assumption of independence is not consistent with the underlying particle model.
Boltzmann's reply to Loschmidt was to concede the possibility of these states, but noting that these sorts of states were so rare and unusual as to be impossible in practice. Boltzmann would go on to sharpen this notion of the "rarity" of states, resulting in his famous equation, his entropy formula of 1877 (see Boltzmann's entropy formula).
As a demonstration of Loschmidt's paradox, a famous modern counterexample (not to Boltzmann's original gas-related H-theorem, but to a closely related analogue) is the phenomenon of spin echo. In the spin echo effect, it is physically possible to induce time reversal in an interacting system of spins.
An analogue to Boltzmann's H for the spin system can be defined in terms of the distribution of spin states in the system. In the experiment, the spin system is initially perturbed into a non-equilibrium state (high H), and, as predicted by the H theorem the quantity H soon decreases to the equilibrium value. At some point, a carefully constructed electromagnetic pulse is applied that reverses the motions of all the spins. The spins then undo the time evolution from before the pulse, and after some time the H actually increases away from equilibrium (once the evolution has completely unwound, the H decreases once again to the minimum value). In some sense, the time reversed states noted by Loschmidt turned out to be not completely impractical.
|This section requires expansion. (September 2013)|
In 1896, Ernst Zermelo noted a further problem with the H theorem, which was that if the system's H is at any time not a minimum, then by Poincaré recurrence, the non-minimal H must recur (though after some extremely long time). Boltzmann admitted that these recurring rises in H technically would occur, but pointed out that, over long times, the system spends only a tiny fraction of its time in one of these recurring states.
Fluctuations of H in small systems
Since H is a mechanically defined variable that is not conserved, then like any other such variable (pressure, etc.) it will show thermal fluctuations. This means that H regularly shows spontaneous increases from the minimum value. Technically this is not an exception to the H theorem, since the H theorem was only intended to apply for a gas with a very large number of particles. These fluctuations are only perceptible when the system is small.
If H is interpreted as entropy as Boltzmann intended, then this can be seen as a manifestation of the fluctuation theorem.
Connection to information theory
H is a forerunner of Shannon's information entropy. Claude Shannon denoted his measure of information entropy H after the H-theorem. The article on Shannon's information entropy contains an explanation of the discrete counterpart of the quantity H, known as the information entropy or information uncertainty (with a minus sign). By extending the discrete information entropy to the continuous information entropy, also called differential entropy, one obtains the expression in Eq.(1), and thus a better feel for the meaning of H.
The H-theorem's connection between information and entropy plays a central role in a recent controversy called the Black hole information paradox.
Tolman's 1938 book "The Principles of Statistical Mechanics" dedicates a whole chapter to the study of Boltzmann's H theorem, and its extension in the generalized classical statistical mechanics of Gibbs. A further chapter is devoted to the quantum mechanical version of the H-theorem.
Starting with a function f that defines the number of molecules in small region of phase space[clarification needed] denoted by
Tolman offers the following equations for the definition of the quantity H in Boltzmann's original H theorem.
Here we sum over the regions into which phase space is divided, indexed by i.
This relation can also be written in integral form.
H can also be written in terms of the number of molecules present in each of the cells.
An additional way to calculate the quantity H is:
where P is the probability of finding a system chosen at random from the specified microcanonical ensemble. It can finally be written as:
where G is the number of classical states.[clarification needed]
The quantity H can also be defined as the integral over velocity space :
where P(v) is the probability distribution.
Using the Boltzmann equation one can prove that H can only decrease.
For a system of N statistically independent particles, H is related to the thermodynamic entropy S through:
so, according to the H-theorem, S can only increase.
In Quantum statistical mechanics (which is the quantum version of classical statistical mechanics), the H-function is the function:
where summation runs over all possible distinct states of the system, and pi is the probability that the system could be found in the i-th state.
This is closely related to the entropy formula of Gibbs,
and we shall (following e.g., Waldram (1985), p. 39) proceed using S rather than H.
First, differentiating with respect to time gives
(using the fact that ∑ dpi/dt = 0, since ∑ pi = 1).
Now Fermi's golden rule gives a master equation for the average rate of quantum jumps from state α to β; and from state β to α. (Of course, Fermi's golden rule itself makes certain approximations, and the introduction of this rule is what introduces irreversibility. It is essentially the quantum version of Boltzmann's Stosszahlansatz.) For an isolated system the jumps will make contributions
where the reversibility of the dynamics ensures that the same transition constant ναβ appears in both expressions.
But the two brackets will have the same sign, so each contribution to dS/dt cannot be negative.
for an isolated system.
Josiah Willard Gibbs described another way in which the entropy of a microscopic system would tend to increase over time. Later writers have called this "Gibbs' H-theorem" as its conclusion resembles that of Boltzmann's. Gibbs himself never called it an H-theorem, and in fact his definition of entropy—and mechanism of increase—are very different from Boltzmann's. This section is included for historical completeness.
The setting of Gibbs' entropy production theorem is in ensemble statistical mechanics, and the entropy quantity is the Gibbs entropy (information entropy) defined in terms of the probability distribution for the entire state of the system. This is in contrast to Boltzmann's H defined in terms of the distribution of states of individual molecules, within a specific state of the system.
Gibbs considered the motion of an ensemble which initially starts out confined to a small region of phase space, meaning that the state of the system is known with fair precision though not quite exactly (low Gibbs entropy). The evolution of this ensemble over time proceeds according to Liouville's equation. For almost any kind of realistic system, the Liouville evolution tends to "stir" the ensemble over phase space, a process analogous to the mixing of a dye in an incompressible fluid. After some time, the ensemble appears to be spread out over phase space, although it is actually a finely striped pattern, with the total volume of the ensemble (and its Gibbs entropy) conserved. Liouville's equation is guaranteed to conserve Gibbs entropy since there is no random process acting on the system; in principle, the original ensemble can be recovered at any time by reversing the motion.
The critical point of the theorem is thus: If the fine structure in the stirred-up ensemble is very slightly blurred, for any reason, then the Gibbs entropy increases, and the ensemble becomes an equilibrium ensemble. As to why this blurring should occur in reality, there are a variety of suggested mechanisms. For example, one suggested mechanism is that the phase space is coarse-grained for some reason (analogous to the pixelization in the simulation of phase space shown in the figure). For any required finite degree of fineness the ensemble becomes "sensibly uniform" after a finite time. Or, if the system experiences a tiny uncontrolled interaction with its environment, the sharp coherence of the ensemble will be lost. Edwin Thompson Jaynes argued that the blurring is subjective in nature, simply corresponding to a loss of knowledge about the state of the system. In any case, however it occurs, the Gibbs entropy increase is irreversible provided the blurring cannot be reversed.
The exactly evolving entropy, which does not increase, is known as fine-grained entropy. The blurred entropy is known as coarse-grained entropy. Leonard Susskind analogizes this distinction to the notion of the volume of a fibrous ball of cotton: On one hand the volume of the fibers themselves is constant, but in another sense there is a larger coarse-grained volume, corresponding to the outline of the ball.
Gibbs' entropy increase mechanism solves some of the technical difficulties found in Boltzmann's H-theorem: The Gibbs entropy does not fluctuate nor does it exhibit Poincare recurrence, and so the increase in Gibbs entropy, when it occurs, is therefore irreversible as expected from thermodynamics. The Gibbs mechanism also applies equally well to systems with very few degrees of freedom, such as the single-particle system shown in the figure. To the extent that one accepts that the ensemble becomes blurred, then, Gibbs' approach is a cleaner proof of the second law of thermodynamics.
- L. Boltzmann, "Weitere Studien über das Wärmegleichgewicht unter Gasmolekülen." Sitzungsberichte Akademie der Wissenschaften 66 (1872): 275-370.
English translation: Boltzmann, L. (2003). "Further Studies on the Thermal Equilibrium of Gas Molecules". The Kinetic Theory of Gases. History of Modern Physical Sciences 1. pp. 262–349. doi:10.1142/9781848161337_0015. ISBN 978-1-86094-347-8.
- J. Uffink, "Compendium of the foundations of classical statistical physics." (2006)
- Rothstein, J. (1957). "Nuclear Spin Echo Experiments and the Foundations of Statistical Mechanics". American Journal of Physics 25 (8): 510–511. doi:10.1119/1.1934539.
- Gleick 2011
- Tolman 1938 pg. 135 formula 47.5
- Tolman 1938 pg. 135 formula 47.6
- Tolman 1938 pg. 135 formula 47.7
- Tolman 1938 pg. 135 formula 47.8
- Tolman 1939 pg. 136 formula 47.9
- Tolman 1938 pg 460 formula 104.7
- Chapter XII, from Gibbs, Josiah Willard (1902). Elementary Principles in Statistical Mechanics. New York: Charles Scribner's Sons.
- Tolman, R. C. (1938). The Principles of Statistical Mechanics. Dover Publications. ISBN 9780486638966.
- E.T. Jaynes; Gibbs vs Boltzmann Entropies; American Journal of Physics,391,1965
- Leonard Susskind, Statistical Mechanics Lecture 7 (2013). Video at YouTube.
- Lifshitz, E. M.; Pitaevskii, L. P. (1981). Physical Kinetics. Course of Theoretical Physics 10 (3rd ed.). Pergamon. ISBN 0-08-026480-8.
- Waldram, J. R. (1985). The Theory of Thermodynamics. Cambridge University Press. ISBN 0-521-28796-0.
- Tolman, R. C. (1938). The Principles of Statistical Mechanics. Oxford University Press.
- Gull, S. F. (1989). "Some Misconceptions about Entropy". In Buck, B.; Macaulay, V. A. Maximum Entropy in Action. Oxford University Press (published 1991). ISBN 0-19-853963-0. Retrieved 2012-02-05.
- Reif, F. (1965). Fundamentals of Statistical and Thermal Physics. McGraw-Hill. ISBN 978-0-07-051800-1.
- Gleick, J. (2011). The Information: A History, a Theory, a Flood. Random House Digital. ISBN 978-0-375-42372-7.
- Badino, M. (2011). "Mechanistic Slumber vs. Statistical Insomnia: The early history of Boltzmann's H-theorem (1868–1877)". European Physical Journal H. Bibcode:2011EPJH...36..353B. doi:10.1140/epjh/e2011-10048-5.