|Renormalization and regularization|
In theoretical physics, the renormalization group (RG) refers to a mathematical apparatus that allows systematic investigation of the changes of a physical system as viewed at different distance scales. In particle physics, it reflects the changes in the underlying force laws (codified in a quantum field theory) as the energy scale at which physical processes occur varies, energy/momentum and resolution distance scales being effectively conjugate under the uncertainty principle (cf. Compton wavelength).
A change in scale is called a "scale transformation". The renormalization group is intimately related to "scale invariance" and "conformal invariance", symmetries in which a system appears the same at all scales (so-called self-similarity). (Note that scale transformations are a strict subset of conformal transformations, in general, the latter including additional symmetry generators associated with special conformal transformations.)
As the scale varies, it is as if one is changing the magnifying power of a notional microscope viewing the system. In so-called renormalizable theories, the system at one scale will generally be seen to consist of self-similar copies of itself when viewed at a smaller scale, with different parameters describing the components of the system. The components, or fundamental variables, may relate to atoms, elementary particles, atomic spins, etc. The parameters of the theory typically describe the interactions of the components. These may be variable "couplings" which measure the strength of various forces, or mass parameters themselves. The components themselves may appear to be composed of more of the self-same components as one goes to shorter distances.
For example, in quantum electrodynamics (QED), an electron appears to be composed of electrons, positrons (anti-electrons) and photons, as one views it at higher resolution, at very short distances. The electron at such short distances has a slightly different electric charge than does the "dressed electron" seen at large distances, and this change, or "running," in the value of the electric charge is determined by the renormalization group equation.
The idea of scale transformations and scale invariance is old in physics. Scaling arguments were commonplace for the Pythagorean school, Euclid and up to Galileo. They became popular again at the end of the 19th century, perhaps the first example being the idea of enhanced viscosity of Osborne Reynolds, as a way to explain turbulence.
The renormalization group was initially devised in particle physics, but nowadays its applications extend to solid-state physics, fluid mechanics, cosmology and even nanotechnology. An early article by Ernst Stueckelberg and André Petermann in 1953 anticipates the idea in quantum field theory. Stueckelberg and Petermann opened the field conceptually. They noted that renormalization exhibits a group of transformations which transfer quantities from the bare terms to the counterterms. They introduced a function h(e) in quantum electrodynamics (QED), which is now called the beta function (see below).
Murray Gell-Mann and Francis E. Low in 1954 restricted the idea to scale transformations in QED, which are the most physically significant, and focused on asymptotic forms of the photon propagator at high energies. They determined the variation of the electromagnetic coupling in QED, by appreciating the simplicity of the scaling structure of that theory. They thus discovered that the coupling parameter g(μ) at the energy scale μ is effectively given by the group equation
for some function G (unspecified—nowadays called Wegner's scaling function) and a constant d, in terms of the coupling g(M) at a reference scale M.
Gell-Mann and Low realized in these results that the effective scale can be arbitrarily taken as μ, and can vary to define the theory at any other scale:
The gist of the RG is this group property: as the scale μ varies, the theory presents a self-similar replica of itself, and any scale can be accessed similarly from any other scale, by group action, a formal transitive conjugacy of couplings in the mathematical sense (Schröder's equation).
On the basis of this (finite) group equation and its scaling property, Gell-Mann and Low could then focus on infinitesimal transformations, and invented a computational method based on a mathematical flow function ψ(g) = G d/(∂G/∂g) of the coupling parameter g, which they introduced. Like the function h(e) of Stueckelberg and Petermann, their function determines the differential change of the coupling g(μ) with respect to a small change in energy scale μ through a differential equation, the renormalization group equation:
The modern name is also indicated, the beta function, introduced by C. Callan and K. Symanzik in the early 1970s. Since it is a mere function of g, integration in g of a perturbative estimate of it permits specification of the renormalization trajectory of the coupling, that is, its variation with energy, effectively the function G in this perturbative approximation. The renormalization group prediction (cf Stueckelberg-Petermann and Gell-Mann-Low works) was confirmed 40 years later at the LEP accelerator experiments: the fine structure "constant" of QED was measured to be about 1/127 at energies close to 200 GeV, as opposed to the standard low-energy physics value of 1/137. (Early applications to quantum electrodynamics are discussed in the influential book of Nikolay Bogolyubov and Dmitry Shirkov in 1959.)
The renormalization group emerges from the renormalization of the quantum field variables, which normally has to address the problem of infinities in a quantum field theory (although the RG exists independently of the infinities). This problem of systematically handling the infinities of quantum field theory to obtain finite physical quantities was solved for QED by Richard Feynman, Julian Schwinger and Sin-Itiro Tomonaga, who received the 1965 Nobel prize for these contributions. They effectively devised the theory of mass and charge renormalization, in which the infinity in the momentum scale is cut off by an ultra-large regulator, Λ (which could ultimately be taken to be infinite — infinities reflect the pileup of contributions from an infinity of degrees of freedom at infinitely high energy scales.). The dependence of physical quantities, such as the electric charge or electron mass, on the scale Λ is hidden, effectively swapped for the longer-distance scales at which the physical quantities are measured, and, as a result, all observable quantities end up being finite, instead, even for an infinite Λ. Gell-Mann and Low thus realized in these results that, while, infinitesimally, a tiny change in g is provided by the above RG equation given ψ(g), the self-similarity is expressed by the fact that ψ(g) depends explicitly only upon the parameter(s) of the theory, and not upon the scale μ. Consequently, the above renormalization group equation may be solved for (G and thus) g(μ).
A deeper understanding of the physical meaning and generalization of the renormalization process, which goes beyond the dilation group of conventional renormalizable theories, came from condensed matter physics. Leo P. Kadanoff's paper in 1966 proposed the "block-spin" renormalization group. The blocking idea is a way to define the components of the theory at large distances as aggregates of components at shorter distances.
This approach covered the conceptual point and was given full computational substance in the extensive important contributions of Kenneth Wilson. The power of Wilson's ideas was demonstrated by a constructive iterative renormalization solution of a long-standing problem, the Kondo problem, in 1974, as well as the preceding seminal developments of his new method in the theory of second-order phase transitions and critical phenomena in 1971. He was awarded the Nobel prize for these decisive contributions in 1982.
Meanwhile, the RG in particle physics had been reformulated in more practical terms by C. G. Callan and K. Symanzik in 1970. The above beta function, which describes the "running of the coupling" parameter with scale, was also found to amount to the "canonical trace anomaly", which represents the quantum-mechanical breaking of scale (dilation) symmetry in a field theory. (Remarkably, quantum mechanics itself can induce mass through the trace anomaly and the running coupling.) Applications of the RG to particle physics exploded in number in the 1970s with the establishment of the Standard Model.
In 1973, it was discovered that a theory of interacting colored quarks, called quantum chromodynamics had a negative beta function. This means that an initial high-energy value of the coupling will eventuate a special value of μ at which the coupling blows up (diverges). This special value is the scale of the strong interactions, μ = ΛQCD and occurs at about 200 MeV. Conversely, the coupling becomes weak at very high energies (asymptotic freedom), and the quarks become observable as point-like particles, in deep inelastic scattering, as anticipated by Feynman-Bjorken scaling. QCD was thereby established as the quantum field theory controlling the strong interactions of particles.
Momentum space RG also became a highly developed tool in solid state physics, but its success was hindered by the extensive use of perturbation theory, which prevented the theory from reaching success in strongly correlated systems. In order to study these strongly correlated systems, variational approaches are a better alternative. During the 1980s some real-space RG techniques were developed in this sense, the most successful being the density-matrix RG (DMRG), developed by S. R. White and R. M. Noack in 1992.
The conformal symmetry is associated with the vanishing of the beta function. This can occur naturally if a coupling constant is attracted, by running, toward a fixed point at which β(g) = 0. In QCD, the fixed point occurs at short distances where g → 0 and is called a (trivial) ultraviolet fixed point. For heavy quarks, such as the top quark, it is calculated that the coupling to the mass-giving Higgs boson runs toward a fixed non-zero (non-trivial) infrared fixed point.
In string theory conformal invariance of the string world-sheet is a fundamental symmetry: β = 0 is a requirement. Here, β is a function of the geometry of the space-time in which the string moves. This determines the space-time dimensionality of the string theory and enforces Einstein's equations of general relativity on the geometry. The RG is of fundamental importance to string theory and theories of grand unification.
It is also the modern key idea underlying critical phenomena in condensed matter physics. Indeed, the RG has become one of the most important tools of modern physics. It is often used in combination with the Monte Carlo method.
This section introduces pedagogically a picture of RG which may be easiest to grasp: the block spin RG, devised by Leo P. Kadanoff in 1966.
Consider a 2D solid, a set of atoms in a perfect square array, as depicted in the figure.
Assume that atoms interact among themselves only with their nearest neighbours, and that the system is at a given temperature T. The strength of their interaction is quantified by a certain coupling J. The physics of the system will be described by a certain formula, say the hamiltonian H(T,J).
Now proceed to divide the solid into blocks of 2×2 squares; we attempt to describe the system in terms of block variables, i.e., variables which describe the average behavior of the block. Further assume that, by some lucky coincidence, the physics of block variables is described by a formula of the same kind, but with different values for T and J : H(T',J'). (This isn't exactly true, in general, but it is often a good first approximation.)
Perhaps, the initial problem was too hard to solve, since there were too many atoms. Now, in the renormalized problem we have only one fourth of them. But why stop now? Another iteration of the same kind leads to H(T",J"), and only one sixteenth of the atoms. We are increasing the observation scale with each RG step.
Of course, the best idea is to iterate until there is only one very big block. Since the number of atoms in any real sample of material is very large, this is more or less equivalent to finding the long range behaviour of the RG transformation which took (T,J) → (T',J') and (T',J')→ (T",J"). Often, when iterated many times, this RG transformation leads to a certain number of fixed points.
To be more concrete, consider a magnetic system (e.g., the Ising model), in which the J coupling denotes the trend of neighbour spins to be parallel. The configuration of the system is the result of the tradeoff between the ordering J term and the disordering effect of temperature.
For many models of this kind there are three fixed points:
- T=0 and J → ∞. This means that, at the largest size, temperature becomes unimportant, i.e., the disordering factor vanishes. Thus, in large scales, the system appears to be ordered. We are in a ferromagnetic phase.
- T → ∞ and J → 0. Exactly the opposite; here, temperature dominates, and the system is disordered at large scales.
- A nontrivial point between them, T = Tc and J = Jc. In this point, changing the scale does not change the physics, because the system is in a fractal state. It corresponds to the Curie phase transition, and is also called a critical point.
So, if we are given a certain material with given values of T and J, all we have to do in order to find out the large scale behaviour of the system is to iterate the pair until we find the corresponding fixed point.
In more technical terms, let us assume that we have a theory described by a certain function of the state variables and a certain set of coupling constants . This function may be a partition function, an action, a Hamiltonian, etc. It must contain the whole description of the physics of the system.
Now we consider a certain blocking transformation of the state variables , the number of must be lower than the number of . Now let us try to rewrite the function only in terms of the . If this is achievable by a certain change in the parameters, , then the theory is said to be renormalizable.
For some reason, most fundamental theories of physics such as quantum electrodynamics, quantum chromodynamics and electro-weak interaction, but not gravity, are exactly renormalizable. Also, most theories in condensed matter physics are approximately renormalizable, from superconductivity to fluid turbulence.
The change in the parameters is implemented by a certain beta function: , which is said to induce a renormalization flow (or RG flow) on the -space. The values of under the flow are called running couplings.
As was stated in the previous section, the most important information in the RG flow are its fixed points. The possible macroscopic states of the system, at a large scale, are given by this set of fixed points.
Since the RG transformations in such systems are lossy (i.e.: the number of variables decreases - see as an example in a different context, Lossy data compression), there need not be an inverse for a given RG transformation. Thus, in such lossy systems, the renormalization group is, in fact, a semigroup.
Relevant and irrelevant operators, universality classes
Consider a certain observable A of a physical system undergoing an RG transformation. The magnitude of the observable as the length scale of the system goes from small to large may be: (a) always increasing, (b) always decreasing or (c) other. In the first case, the observable is said to be a relevant observable; in the second, irrelevant and in the third, marginal.
A relevant observable is needed to describe the macroscopic behaviour of the system; an irrelevant observable is not. Marginal observables may or may not need be taken into account. A remarkable broad fact is that most observables are irrelevant, i.e., the macroscopic physics is dominated by only a few observables in most systems. As an example, in microscopic physics, to describe a system consisting of a mole of carbon-12 atoms we need of the order of 1023 (Avogadro's number) variables, while to describe it as a macroscopic system (12 grams of carbon-12) we only need a few.
Before Wilson's RG approach, there was an astonishing empirical fact to explain: the coincidence of the critical exponents (i.e., the exponents of the reduced-temperature dependence of several quantities near a second order phase transition) in very disparate phenomena, such as magnetic systems, superfluid transition (Lambda transition), alloy physics, etc. Thus, in general, thermodynamic features of a system near a phase transition depend only on a small number of variables, such as dimensionality and symmetry, but are insensitive to details of the underlying microscopic properties of the system.
This coincidence of critical exponents for ostensibly quite different physical systems is called universality−−and is now successfully explained by the RG: essentially by showing that the differences among all such phenomena are, in fact, traceable to such irrelevant observables, while the relevant observables are shared in common.
Renormalization groups, in practice, come in two main "flavours". The Kadanoff picture explained above refers mainly to the so-called real-space RG.
Momentum-space RG on the other hand, has a longer history despite its relative subtlety. It can be used for systems where the degrees of freedom can be cast in terms of the Fourier modes of a given field. The RG transformation proceeds by integrating out a certain set of high-momentum (large-wavenumber) modes. Since large wavenumbers are related to short-length scales, the momentum-space RG results in an essentially analogous coarse-graining effect as with real-space RG.
Momentum-space RG is usually performed on a perturbation expansion. The validity of such an expansion is predicated upon the actual physics of a system being close to that of a free field system. In this case, one may calculate observables by summing the leading terms in the expansion. This approach has proved successful for many theories, including most of particle physics, but fails for systems whose physics is very far from any free system, i.e., systems with strong correlations.
As an example of the physical meaning of RG in particle physics, consider an overview of charge renormalization in quantum electrodynamics (QED). Suppose we have a point positive charge of a certain true (or bare) magnitude. The electromagnetic field around it has a certain energy, and thus may produce some pairs of (e.g.) electrons-positrons, which will be annihilated very quickly. But, in their short life, the electron will be attracted by the charge, and the positron will be repelled. Since this happens continuously, these pairs are effectively screening the charge from abroad. Thus, the measured strength of the charge will depend on how close to our probes it may enter. Hence a dependence of a certain coupling constant (here, the electric charge) with distance scale.
Momentum and length scales are related inversely, according to the de Broglie relation: the higher the energy or momentum scale we may reach, the lower the length scale we may probe and resolve. Therefore, the momentum-space RG practitioners sometimes declaim to integrate out high momenta or high energy from their theories.
Exact renormalization group equations
An exact renormalization group equation (ERGE) is one that takes irrelevant couplings into account. There are several formulations.
The Wilson ERGE is the simplest conceptually, but is practically impossible to implement. Fourier transform into momentum space after Wick rotating into Euclidean space. Insist upon a hard momentum cutoff, p2 ≤ Λ2 so that the only degrees of freedom are those with momenta less than Λ. The partition function is
For any positive Λ' less than Λ, define SΛ' (a functional over field configurations φ whose Fourier transform has momentum support within p2 ≤ Λ' 2) as
In fact, this transformation is transitive. If you compute SΛ′ from SΛ and then compute SΛ″ from SΛ′, this gives you the same Wilsonian action as computing SΛ″ directly from SΛ.
The Polchinski ERGE involves a smooth UV regulator cutoff. Basically, the idea is an improvement over the Wilson ERGE. Instead of a sharp momentum cutoff, it uses a smooth cutoff. Essentially, we suppress contributions from momenta greater than Λ heavily. The smoothness of the cutoff, however, allows us to derive a functional differential equation in the cutoff scale Λ. As in Wilson's approach, we have a different action functional for each cutoff energy scale Λ. Each of these actions are supposed to describe exactly the same model which means that their partition functionals have to match exactly.
In other words, (for a real scalar field; generalizations to other fields are obvious),
and ZΛ is really independent of Λ! We have used the condensed deWitt notation here. We have also split the bare action SΛ into a quadratic kinetic part and an interacting part Sint Λ. This split most certainly isn't clean. The "interacting" part can very well also contain quadratic kinetic terms. In fact, if there is any wave function renormalization, it most certainly will. This can be somewhat reduced by introducing field rescalings. RΛ is a function of the momentum p and the second term in the exponent is
When , RΛ(p)/p2 is essentially 1. When , RΛ(p)/p2 becomes very very huge and approaches infinity. RΛ(p)/p2 is always greater than or equal to 1 and is smooth. Basically, this leaves the fluctuations with momenta less than the cutoff Λ unaffected but heavily suppresses contributions from fluctuations with momenta greater than the cutoff. This is obviously a huge improvement over Wilson.
The condition that
can be satisfied by (but not only by)
The effective average action ERGE involves a smooth IR regulator cutoff. The idea is to take all fluctuations right up to an IR scale k into account. The effective average action will be accurate for fluctuations with momenta larger than k. As the parameter k is lowered, the effective average action approaches the effective action which includes all quantum and classical fluctuations. In contrast, for large k the effective average action is close to the "bare action". So, the effective average action interpolates between the "bare action" and the effective action.
For a real scalar field, one adds an IR cutoff
to the action S, where Rk is a function of both k and p such that for , Rk(p) is very tiny and approaches 0 and for , . Rk is both smooth and nonnegative. Its large value for small momenta leads to a suppression of their contribution to the partition function which is effectively the same thing as neglecting large scale fluctuations.
One can use the condensed deWitt notation
for this IR regulator.
where J is the source field. The Legendre transform of Wk ordinarily gives the effective action. However, the action that we started off with is really S[φ]+1/2 φ⋅Rk⋅φ and so, to get the effective average action, we subtract off 1/2 φ⋅Rk⋅φ. In other words,
can be inverted to give Jk[φ] and we define the effective average action Γk as
is the ERGE which is also known as the Wetterich equation.
As there are infinitely many choices of Rk, there are also infinitely many different interpolating ERGEs. Generalization to other fields like spinorial fields is straightforward.
Although the Polchinski ERGE and the effective average action ERGE look similar, they are based upon very different philosophies. In the effective average action ERGE, the bare action is left unchanged (and the UV cutoff scale—if there is one—is also left unchanged) but the IR contributions to the effective action are suppressed whereas in the Polchinski ERGE, the QFT is fixed once and for all but the "bare action" is varied at different energy scales to reproduce the prespecified model. Polchinski's version is certainly much closer to Wilson's idea in spirit. Note that one uses "bare actions" whereas the other uses effective (average) actions.
- Quantum triviality
- Renormalization with reference to perturbation theory, associated to momentum-space RG.
- Scale invariance
- Schröder's equation
- Regularization (physics)
- Density matrix renormalization group
- Functional renormalization group
- Critical phenomena
- Universality (dynamical systems)
- Pedagogical site on Scaling
- Stueckelberg, E.C.G. and André Petermann (1953). "La renormalisation des constants dans la theorie de quanta", Helv. Phys. Acta 26, 499.
- Gell-Mann, M.; Low, F.E. (1954). "Quantum Electrodynamics at Small Distances". Physical Review. 95 (5): 1300–1312. Bibcode:1954PhRv...95.1300G. doi:10.1103/PhysRev.95.1300.
- Curtright, T.L.; Zachos, C.K. (March 2011). "Renormalization Group Functional Equations". Physical Review D. 83 (6): 065019. arXiv:. Bibcode:2011PhRvD..83f5019C. doi:10.1103/PhysRevD.83.065019.
- N.N. Bogoliubov, D.V. Shirkov (1959): The Theory of Quantized Fields. New York, Interscience.
- L.P. Kadanoff (1966): "Scaling laws for Ising models near ", Physics 2, 263.
- K.G. Wilson(1975): The renormalization group: critical phenomena and the Kondo problem, Rev. Mod. Phys. 47, 4, 773.
- Callan, C. (1970). "Broken Scale Invariance in Scalar Field Theory". Physical Review D. 2 (8): 1541. Bibcode:1970PhRvD...2.1541C. doi:10.1103/PhysRevD.2.1541.; Symanzik, K. (1970). "Small distance behaviour in field theory and power counting". Communications in Mathematical Physics. 18 (3): 227–246. Bibcode:1970CMaPh..18..227S. doi:10.1007/BF01649434.
- Shankar, R. (1994). "Renormalization-group approach to interacting fermions". Reviews of Modern Physics. 66: 129. arXiv:. Bibcode:1994RvMP...66..129S. doi:10.1103/RevModPhys.66.129. (For nonsubscribers see arXiv:cond-mat/9307009.)
- L.Ts. Adzhemyan, T.L. Kim, M.V. Kompaniets, V.K. Sazonov, Renormalization group in the infinite-dimensional turbulence: determination of the RG-functions without renormalization constants, Nanosystems: physics, chemistry, mathematics August 2015, Volume 6, Issue 4, pp 461
- Callaway, David J.E.; Petronzio, Roberto (1984). "Determination of critical points and flow diagrams by Monte Carlo renormalization group methods". Physics Letters B. 139 (3): 189–194. Bibcode:1984PhLB..139..189C. doi:10.1016/0370-2693(84)91242-5. ISSN 0370-2693.
- A superb, albeit technical, exposition by J. Zinn-Justin (2010) is the classic Scholarpedia article "Critical Phenomena: field theoretical approach". For example, for Ising-like systems with a ℤ2 symmetry or, more generally, for models with an O(N) symmetry, the Gaussian (free) fixed point is long-distance stable above space dimension four, marginally stable in dimension four, and unstable below dimension four. See Quantum triviality.
- Michael Fisher (1974), "The renormalization group in the theory of critical behavior", Rev. Mod. Phys. 46, 597.
Pedagogical and historical reviews
- S.R. White (1992): Density matrix formulation for quantum renormalization groups, Phys. Rev. Lett. 69, 2863. The most successful variational RG method.
- N. Goldenfeld (1993): Lectures on phase transitions and the renormalization group. Addison-Wesley.
- D.V. Shirkov (1999): Evolution of the Bogoliubov Renormalization Group. arXiv.org:hep-th/9909024. A mathematical introduction and historical overview with a stress on group theory and the application in high-energy physics.
- B. Delamotte (2004): A hint of renormalization. American Journal of Physics, Vol. 72, No. 2, pp. 170\u2013184, February 2004. A pedestrian introduction to renormalization and the renormalization group. For nonsubscribers see arXiv.org:hep-th/0212049
- H.J. Maris, L.P. Kadanoff (1978): Teaching the renormalization group. American Journal of Physics, June 1978, Volume 46, Issue 6, pp. 652-657. A pedestrian introduction to the renormalization group as applied in condensed matter physics.
- K. Huang (2013): A Critical History of Renormalization. arXiv:1310.5533
- Shirkov, D. V. (2001-08-31). "Fifty years of the renormalization group". CERN Courier. Retrieved 2008-11-12.
- T. D. Lee; Particle physics and introduction to field theory, Harwood academic publishers, 1981, [ISBN 3-7186-0033-1]. Contains a Concise, simple, and trenchant summary of the group structure, in whose discovery he was also involved, as acknowledged in Gell-Mann and Low's paper.
- L.Ts.Adzhemyan, N.V.Antonov and A.N.Vasiliev; The Field Theoretic Renormalization Group in Fully Developed Turbulence; Gordon and Breach, 1999. [ISBN 90-5699-145-0].
- Vasil'ev, A.N.; The field theoretic renormalization group in critical behavior theory and stochastic dynamics; Chapman & Hall/CRC, 2004. [ISBN 9780415310024] (Self-contained treatment of renormalization group applications with complete computations);
- Zinn-Justin, Jean (2002). Quantum field theory and critical phenomena, Oxford, Clarendon Press (2002), ISBN 0-19-850923-5 (an exceptionally solid and thorough treatise on both topics);
- Zinn-Justin, Jean: Renormalization and renormalization group: From the discovery of UV divergences to the concept of effective field theories, in: de Witt-Morette C., Zuber J.-B. (eds), Proceedings of the NATO ASI on Quantum Field Theory: Perspective and Prospective, June 15–26, 1998, Les Houches, France, Kluwer Academic Publishers, NATO ASI Series C 530, 375-388 (1999) [ISBN ]. Full text available in PostScript.
- Kleinert, H. and Schulte Frohlinde, V; Critical Properties of φ4-Theories, World Scientific (Singapore, 2001); Paperback ISBN 981-02-4658-7. Full text available in PDF.