Ising model

From Wikinfo

Jump to: navigation, search


Search for "Ising_model" on Wikipedia  • Wikimedia Commons • Wiktionary • Wikiquote • Wikibooks • Mediawiki Wikia • Wikitravel • Google Advanced Search • Yahoo Advanced Search • WorldCat Advanced Search • Amazon • Recent NY Times • Older NY Times.

For criticism see Criticism of Ising_model

The Ising model, named after the physicist Ernst Ising, is a mathematical model in statistical mechanics. It has since been used to model diverse phenomena in which bits of information, interacting in pairs, produce collective effects.

Contents

Definition

The Ising model is defined on a discrete collection of variables called spins, which can take on the value 1 or −1. The spins <math>S_i</math> interact in pairs, with energy that has one value when the two spins are the same, and a second value when the two spins are different.

Energy function

The energy of the Ising model is defined to be:

<math>

E = - \sum_{ij} J_{ij} S_i S_j \,</math>

where the sum counts each pair of spins only once. Notice that the product of spins is either +1 if the two spins are the same (aligned), or −1 if they are different (anti-aligned). J is half the difference in energy between the two possibilities. Magnetic interactions seek to align spins relative to one another. Spins become randomized when thermal energy is greater than the strength of the interaction.

For each pair, if

<math>J_{ij} > 0 </math> the interaction is called ferromagnetic
<math>J_{ij} < 0 </math> the interaction is called antiferromagnetic
<math>J_{ij} = 0 </math> the spins are noninteracting

A ferromagnetic interaction tends to align spins, and an antiferromagnetic tends to antialign them.

The spins can be thought of as living on a graph, where each node has exactly one spin, and each edge connects two spins with a nonzero value of J. If all the Js are equal, it is convenient to measure energy in units of J. Then a model is completely specified by the graph and the sign of J.

Simple examples

The antiferromagnetic one-dimensional Ising model has the energy function:

<math>

E = \sum_{i} S_{i} S_{i+1} \,</math> where i runs over all the integers. This links each pair of nearest neighbors.

The ferromagnetic two-dimensional Ising model on a square lattice is a collection of spins <math>S_{i,j}</math> on each node (i,j) of a two dimensional square lattice and the Energy is:

<math>

E= - \sum_{ij} (S_{i,j} S_{i,j+1} + S_{i,j} S_{i+1,j}). \,</math>

Notice that the sum links every site to its right-neighbor and its down-neighbor. In this way, every edge is only counted once.

The mean field Ising model is the Ising model on a complete graph, where all the nodes are connected to all the other nodes:

<math>

E = - \sum_{i<j} S_i S_j \,.</math>

Magnetic field

The energy of the Ising model may be modified to bias the entire system. Normally, an Ising model is completely symmetric under interchange of + and −. A magnetic field <math> h_i </math> may be added to the energy, and it breaks the symmetry, The full energy function is:

<math>

E= - \frac{1}{2} \sum_{\langle i,j\rangle} J_{ij} S_i S_j - \sum_i h_i S_i \,</math>

where the brackets indicate that i and j index neighboring positions on the graph.

Statistics

The model is a statistical model, so the energy is really the logarithm of the probability. The probability of each configuration of spins is the Boltzmann distribution with inverse temperature β.

<math>

P(S) \propto e^{-\beta E} \,.</math>

To actually generate configurations using this probability distribution is conceptually easiest using the Metropolis Algorithm:

  1. Pick a spin at random and calculate the contribution to the energy involving this spin.
  2. flip the value of the spin and calculate the new contribution.
  3. if the new energy is less, keep the flipped value.
  4. if the new energy is more, only keep with probability <math>e^{-\beta \Delta E}</math>
  5. Repeat

The change in energy <math>\Delta E</math> only depends on the value of the spin and its nearest graph neighbors. So if the graph is not too connected, the algorithm is fast. This process will eventually produce a pick from the distribution.

Questions

The interesting statistical questions to ask are all in the limit of large numbers of spins:

  1. In a typical configuration, are most of the spins +1 or −1, or are they split equally?
  2. If a spin at any given position is 1, what is the probability that the spin at position j is also 1?
  3. If β is changed, is there a phase transition?
  4. If J is random, how many different configurations are there at any given inverse temperature?
  5. On a lattice, what is the fractal dimension of the shape of a large cluster of +1 spins?

General discussion

File:Ising-tartan.png
Visualization of the translation-invariant probability measure of the one-dimensional Ising model.

In his 1925 PhD thesis, Ising solved the model for the 1D case. In one dimension, the solution admits no phase transition. On the basis of this result, he incorrectly concluded that his model does not exhibit phase behaviour in any dimension.

Most numerical solutions use the Metropolis-Hastings algorithm run inside a Monte Carlo loop. Depending on the complexity only adjacent vertices can be taken into account or for long-range models other vertices can be included.

The Ising model undergoes a phase transition between an ordered and a disordered phase in 2 dimensions or more. In 2 dimensions, the Ising model has a strong/weak duality (between high temperatures and low ones) called the Kramers-Wannier duality. The fixed point of this duality is at the second-order phase transition temperature.

While the Ising model is a simplified microscopic description of ferromagnetism, it is still extremely important because of the universality of the continuum limit. Universality means that the fluctuations near the phase transition are described by a continuum field with a free energy or Lagrangian which is a function of the field values. Just as there are many ways to discretize a differential equation, all of which give the same answer when the lattice spacing is small, there are many different discrete models that have the exact same critical behavior, because they have the same continuum limit.

The experimentally observed critical fluctuations of ferromagnets near the Curie point and of fluids at the vapor/liquid critical point are described exactly by the critical fluctuations of the Ising model. The same is true for the simplest statistical models in three dimensions whose fluctuations can be described by a single scalar field, the local magnetization in a near-critical magnet or the local density in a near-critical fluid. All these systems have fluctuating clusters whose fractal scaling laws and long distance correlation functions are quantitatively predicted by the model.

Apart from the continuum limit, many discrete systems can be mapped exactly or approximately to the Ising system. The grand canonical ensemble formulation of the lattice gas model, for example, can be mapped exactly to the canonical ensemble formulation of the Ising model. The mapping allows one to exploit simulation and analytical results of the Ising model to answer questions about the related models.

The Ising model on a two dimensional square lattice with no magnetic field was analytically solved in 1944 by Lars Onsager. Onsager showed that the correlation functions and free energy of the Ising model are determined by a noninteracting lattice fermion. The 3D Ising model does not have a representation in terms of free fields.

In 2000, Sorin Istrail established that computing the free energy of the Ising model on an arbitrary sublattice of a three dimensional square lattice is computationally intractable.[1] While this means that it is impossible to efficiently compute all possible thermodynamic quantities with arbitrary external fields, it does not mean that the critical exponents or spin-spin correlations cannot be computed near criticality. In particular, Istrail's proof is valid in four or higher dimensions, where the model is also exactly solvable near criticality, since its correlation functions at long distances are those of a free scalar field.

The Ising model can be thought of as a Markov random field on a square grid, where the maximal graph cliques are edges (i.e. pairs of neighboring verticies).

Historical significance

One of Democritus' arguments in support of atomism was that atoms naturally explain the sharp phase boundaries observed in materials, like when ice melts to water or water turns to steam. His idea was that small changes in the atomic scale properties would lead to big changes in their aggregate statistical behavior. Others believed that matter is inherently continuous, not atomic, and that the large-scale properties of matter are not reducible to basic atomic properties.

While the laws of chemical binding made it clear to nineteenth century chemists that atoms were real, among physicists the debate continued well into the early twentieth century. Atomists, notably James Clerk Maxwell and Ludwig Boltzmann, applied Hamilton's formulation of Newton's laws to large systems, and found that the statistical behavior of the atoms correctly describe room temperature gases. But classical statistical mechanics did not account for all of the properties of liquids and solids, nor of gases at low temperature.

Once modern quantum mechanics was formulated, atomism was no longer in conflict with experiment, but this did not lead to a universal acceptance of statistical mechanics, which went beyond atomism. Josiah Willard Gibbs had given a complete formalism to reproduce the laws of thermodynamics from the laws of mechanics. But from the 19th century, when statistical mechanics was considered dubious, many faulty arguments survived. The lapses in intuition mostly stemmed from the fact that the limit of an infinite statistical system has many zero-one laws which are absent in finite systems: an infinitesimal change in a parameter can lead to big differences in the statistical behavior, as Democritus expected.

No phase transitions in finite volume

In the early part of the twentieth century, some believed that the partition function could never describe a phase transition, based on the following argument:

  1. The partition function is a sum of <math>\scriptstyle e^{-\beta E}</math> over all configurations.
  2. the exponential function is everywhere analytic as a function <math>\beta</math>.
  3. the sum of analytic things is analytic.

But the logarithm of the partition function is not analytic as a function of the temperature near a phase transition, so the theory doesn't work.

This argument works for a finite sum of exponentials, and correctly establishes that there are no singularities in the free energy of a system of a finite size. For systems which in the thermodynamic limit, for infinite systems, the infinite sum can lead to singularities. The convergence to the thermodynamic limit is fast, so that the phase behavior is apparent already on a relatively small lattice, even though the singularities are smoothed out by the system's finite size.

This was first established by Rudolph Peierls in the Ising model.

Peierls droplets

Shortly after Lenz and Ising constructed the Ising model, Peierls was able to explicitly show that a phase transition occurs in two dimensions.

To do this, he compared the high-temperature and low temperature limits. At infinite temperature, zero beta, all configurations have equal probability. Each spin is completely independent of any other, and if typical configurations at infinite temperature are plotted so that plus/minus are represented by black and white, they look like television snow. For high, but not infinite temperature, there are small correlations between neighboring positions, the snow tends to clump a little bit, but the screen stays random looking, and there is no net excess of black or white.

A quantitative measure of the excess is the magnetization, which is the average value of the spin:

<math>

M= {1\over N} \sum_{i=0}^{N-1} S_i </math>

A bogus argument analogous to the argument in the last section now establishes that the magnetization in the Ising model is always zero.

  1. Every configurations of spin has equal energy to the configuration with all spins flipped.
  2. So for every configuration with magnetization M there is a configuration with magnetization -M with equal probability
  3. So the magnetization is zero.

As before, this only proves that the magnetization is zero at any finite volume. For an infinite system, fluctuations might not be able to push the system from a mostly-plus state to a mostly minus with any nonzero probability.

For very high temperatures, the magnetization is zero, as it is at infinite temperature. To see this, note that if spin A has only a small correlation <math>\epsilon</math> with spin B, and B is only weakly correlated with C, but C is otherwise independent of A, the amount of correlation of A and C goes like <math>\epsilon^2</math>. For two spins separated by distance L, the amount of correlation goes as <math>\epsilon^L</math> but if there is more than one path by which the correlations can travel, this amount is enhanced by the number of paths.

The number of paths of length L on a square lattice in d dimensions:

<math>

N(L) = (2d)^L \,</math> since there are 2d choices for where to go at each step.

A bound on the total correlation is given by the contribution to the correlation by summing over all paths linking two points, which is bounded above by the sum over all lengths paths of length L divided by the :

<math>

\sum_L (2d)^L (\epsilon)^L \,</math> which goes to zero when <math>\epsilon</math> is small.

At low temperatures, infinite beta, the configurations are near the lowest energy configuration, the one where all the spins are plus or all the spins are minus. Peierls asked whether it is statistically possible at low temperature, starting with all the spins minus, to fluctuate to a state where most of the spins are plus. For this to happen, droplets of plus spin must be able to congeal to make the plus state.

The energy of a droplet of plus spins in a minus background is proportional to the perimeter of the droplet L, where plus spins and minus spins neighbor each other. For a droplet with perimeter L, the area is somewhere between <math>(L-2)/2</math> (the straight line) and <math>(L/4)^2</math>(the square box). The probability cost for introducing a droplet is the factor:

<math>

e^{-\beta L} \,</math> but this contributes to the partition function multiplied by the total number of droplets with perimeter L, which is less than the total number of paths of length L:

<math>

N(L)< 4^{2L} \,</math> So that the total spin contribution from droplets, even overcounting by allowing each site to have a separate droplet, is bounded above by:

<math>

\sum_L L^2 4^{-2L} e^{-4\beta L} \,</math>

which goes to zero at large <math>\beta</math>. For <math>\beta</math> sufficiently large, this exponentially suppresses long loops, so that they cannot occur, and the magnetization never fluctuates too far from −1.

So Peierls established that the magnetization in the Ising model eventually defines superselection sectors, separated domains which are not linked by finite fluctuations.

Kramers-Wannier duality

Kramers and Wannier were able to show that the High temperature expansion and the Low temperature expansion of the model are equal up to an overall rescaling of the free energy. This allowed the phase transition point in the two-dimensional model to be determined exactly (under the assumption that there is a unique critical point).

Yang Lee zeros

After Onsager's solution, Yang and Lee investigated the way in which the partition function becomes singular as the temperature approaches the critical temperature.

Applications

Magnetism

The original motivation for the model was the phenomenon of ferromagnetism. Iron is magnetic; once it is magnetized it stays magnetized for a long time compared to any atomic time.

Since the 19th century, it was clear that magnetic fields are due to currents in matter, and Ampère postulated that permanent magnets are caused by permanent atomic currents. The motion of classical charged particles could not explain permanent currents though, as shown by Larmor. In order to have ferromagnetism, the atoms must have permanent magnetic moments which are not due to the motion of classical charges.

Once the electron's spin was discovered, it was clear that the magnetism should be due to a large number of electrons spinning in the same direction. It was natural to ask how the electrons all know which direction to spin, because the electrons on one side of a magnet don't directly interact with the electrons on the other side. They can only influence their neighbors. The Ising model was designed to investigate whether a large fraction of the electrons could be made to spin in the same direction using only local forces.

Lattice gas

The Ising model can be reinterpreted as a statistical model for the motion of atoms. Since the kinetic energy doesn't depend on the position only on the momentum, the statistics of the positions only depends on the potential energy, the thermodynamics of the gas only depends on the potential energy for each configuration of atoms.

A coarse model is to make space-time a lattice and imagine that each position either contains an atom or it doesn't. The space of configuration is that of independent bits <math>B_i</math>, where each bit is either 0 or 1 depending on whether the position is occupied or not. An attractive interaction reduces the energy of two nearby atoms. If the attraction is only between nearest neighbors, the energy is reduced by <math>- 4J B_i B_j</math> for each occupied neighboring pair.

The density of the atoms can be controlled by adding a chemical potential, which is a multiplicative probability cost for adding one more atom. A multiplicative factor in probability can be reinterpreted as an additive term in the logarithm – the energy. The extra energy of a configuration with N atoms is changed by <math>\mu N</math>. The probability cost of one more atom is a factor of <math>\exp{(-\beta\mu)}</math>.

So the energy of the lattice gas is:

<math>

E = - \frac{1}{2} \sum_{\langle i,j \rangle} 4 J B_i B_j + \sum_i \mu B_i \,</math>

Rewriting the bits in terms of spins, <math>B_i = (S_i + 1)/2 </math>.

<math>

E = - \frac{1}{2} \sum_{\langle i,j \rangle} J S_i S_j - \frac{1}{2} \sum_i (4 J - \mu) S_i \,</math>

For lattices where every site has an equal number of neighbors, this is the Ising model with a magnetic field <math> h = (z J - \mu)/2 </math>, where <math>z</math> is the number of neighbors.

Pairwise correlated bits

The activity of neurons in the brain can be modelled statistically. Each neuron at any time is either active + or inactive −. The active neurons are those that send an action potential down the axon in any given time window, and the inactive ones are those that do not. Because the neural activity at any one time is modelled by independent bits, Hopfield suggested that a dynamical Ising model would provide a first approximation to a neural network which is capable of learning.[2]

Following the general approach of Jaynes [3][4], a recent interpretation of Schneidman, Berry, Segev and Bialek [5], is that the Ising model is useful for any model of neural function, because a statistical model for neural activity should be chosen using the principle of maximum entropy. Given a collection of neurons, a statistical model which can reproduce the average firing rate for each neuron introduces a Lagrange multiplier for each neuron:

<math>

E = - \sum_i h_i S_i </math> But the activity of each neuron in this model is statistically independent. To allow for pair correlations, when one neuron tends to fire (or not to fire) along with another, introduce pair-wise lagrange multipliers:

<math>

E= - \frac{1}{2} \sum_{ij} J_{ij} S_i S_j - \sum_i h_i S_i </math> This energy function only introduces probability biases for a spin having a value and for a pair of spins having the same value. Higher order correlations are unconstrained by the multipliers. An activity pattern sampled from this distribution requires the largest number of bits to store in a computer, in the most efficient coding scheme imaginable, as compared with any other distribution with the same average activity and pairwise correlations. This means that Ising models are relevant to any system which is described by bits which are as random as possible, with constraints on the pairwise correlations and the average number of 1s, which frequently occurs in both the physical and social sciences.

Critical behavior

One dimension – independent spin flips

The energy of the one dimensional ferromagnetic Ising model is:

<math>

- \sum_i S_i S_{i+1} \,.</math>

Where <math>i</math> runs from <math>0</math> to <math>L</math>, the length of the line. The energy of the lowest state is <math>-L</math>, when all the spins are the same. For any other configuration, the extra energy is equal to the number of sign changes as you scan the configuration from left to right.

If we designate the number of sign changes in a configuration as <math>k</math>, the difference in energy from the lowest energy state is <math>2k</math>. Since the energy is additive in the number of flips, the probability <math>p</math> of having a spin-flip at each position is independent. The ratio of the probability of finding a flip to the probability of not finding one is the Boltzmann factor:

<math>

{p \over 1-p} = e^{-2\beta} \,.</math>

The problem is reduced to independent biased coin tosses. This essentially completes the mathematical description.

From the description in terms of independent tosses, the statistics of the model for long lines can be understood. The line splits into domains. Each domain is of average length <math>\exp(2\beta)</math>. The length of a domain is distributed exponentially, since there is a constant probability at any step of encountering a flip. The domains never become infinite, so a long system is never magnetized. Each step reduces the correlation between a spin and its neighbor by an amount proportional to <math>p</math>, so the correlations fall off exponentially.

<math>

\langle S_i S_j \rangle \,\propto\, e^{-p|i-j|} \,.</math>

The partition function is the volume of configurations, each configuration weighted by its Boltzmann weight. Since each configuration is described by a the sign-changes, the Partition function factorizes:

<math>

Z = \sum_{\mathrm{configs}} e^{\sum_k S_i} = \prod_k (1 + p ) = (1+p)^L \,.</math>

The logarithm divided by <math>L</math> is the free energy density:

<math>

\beta f = \log(1+p) = \log\left( 1 + {e^{-2\beta}\over 1+e^{-2\beta}} \right) \,.</math>

which is analytic away from <math>\beta=\infty</math>. A sign of a phase transition is a non-analytic free energy, so the one dimensional model does not have a phase transition.

Infinite dimensions – mean field

The behavior of an Ising model on a fully connected graph may be completely understood by mean field theory. This type of description is appropriate to very high dimensional square lattices, because then each site has a very large number of neighbors.

The idea is that if each spin is connected to a large number of spins, only the average number of + spins to − spins is important, since the fluctuations about this mean will be small. The mean field H is the average fraction of spins which are + minus the average fraction of spins which are -. The energy cost of flipping a single spin in the mean field H is <math> \pm</math> 2 JNH. It is convenient to redefine J to absorb the factor N, so that the limit <math>\scriptstyle N\rightarrow \infty</math> is smooth. In terms of the new J, the energy cost for flipping a spin is <math>\pm</math> 2 JH.

This energy cost gives the ratio of probability p that the spin is + to the probability 1 − p that the spin is −. This ratio is the Boltzmann factor.

<math>

{p\over 1-p} = e^{-2\beta JH} \,</math>

so that

<math>

p = {1 \over 1 + e^{-2\beta JH} } \,</math>

The mean value of the spin is given by averaging 1 and −1 with the weights p and 1 − p, so the mean value is 2p − 1. But this average is the same for all spins, and is therefore equal to H: <math> H = 2p - 1 = { 1 - e^{-2\beta JH} \over 1 + e^{-2\beta JH}} = \tanh (\beta JH) \,</math>

The solutions to this equation are the possible consistent mean fields. For <math>\beta J <1 </math> there is only the one solution at H = 0. For bigger values of β there are three solutions, and the solution at H = 0 is unstable.

The instability means that increasing the mean field above zero a little bit produces a statistical fraction of spins which are + which is bigger than the value of the mean field. So a mean field which fluctuates above zero will produce an even greater mean field, and will eventually settle at the stable solution. This means that for temperatures below the critical value <math>\beta J=1 </math> the mean field Ising model undergoes a phase transition in the limit of large N.

Above the critical temperature, fluctuations in H are damped because the mean field restores the fluctuation to zero field. Below the critical temperature, the mean field is driven to a new equilibrium value, which is either the positive H or negative H solution to the equation.

For <math>\beta J = 1+\varepsilon</math>, just below the critical temperature, the value of H can be calculated from the Taylor expansion of the Hyperbolic tangent:

<math>

H = \tanh(\beta J H) = (1+\epsilon)H - {(1+\epsilon)^3H^3\over 3} \,</math>

dividing by H to discard the unstable solution at H = 0, the stable solutions are:

<math>

H = \sqrt{3\varepsilon} \,</math>

The spontaneous magnetization H grows near the critical point as the square root of the change in temperature. This is true whenever H can be calculated from the solution of an analytic equation which is symmetric between positive and negative values, which led Landau to suspect that all Ising type phase transitions in all dimensions should follow this law.

The mean field exponent is universal because changes in the character of solutions of analytic equations are always described by catastrophes in the Taylor series, which is a polynomial equation. By symmetry, the equation for H must only have odd powers of H on the right hand side. Changing β should only smoothly change the coefficients. The transition happens when the coefficient of H on the right hand side is 1. Near the transition:

<math>

H = {\partial (\beta F) \over \partial h} = (1+A\epsilon) H + B H^3 + \cdots \,</math>

Whatever A and B are, so long as neither of them is tuned to zero, the sponetaneous magnetization will grow as the square root of ε. This argument can only fail if the free energy <math>\beta F</math> is either non-analytic or non-generic at the exact β where the transition occurs.

But the spontaneous magnetization in magnetic systems and the density in gasses near the critical point are measured very accuratedly. The density and the magnetization in three dimensions have the same power-law dependence on the temperature near the critical point, but the behavior from experiments is:

<math>

H \propto \epsilon^{0.308} \,</math>

The exponent is also universal, it is the same in the Ising model as in the experimental magnet and gas, but it is not equal to the mean field value. This was a great surprise.

This is also true in two dimensions, where

<math>

H\propto \varepsilon^{0.125} \,</math>

But there it was not a surprise, because it was predicted by Onsager.

Two dimensions – Onsager's solution

The partition function of the Ising model in two dimensions on a square lattice can be mapped to a two-dimensional free fermion. This allows the specific heat to be calculated exactly. Onsager obtained the following analytical expression for the magnetization as a function of temperature:

<math>

M = \left(1-\left[\sinh\left(\log(1+\sqrt{2})\frac{T_c}{T}\right)\right]^{-4}\right)^{\frac{1}{8}} \,</math>

Transfer matrix

Start with an analogy with quantum mechanics. The Ising model on a long periodic lattice has a partition function

<math>

\sum_S \exp\biggl(\sum_{ij} S_{i,j} S_{i,j+1} + S_{i,j} S_{i+1,j}\biggr) \,</math>

Think of the i direction as space, and the j direction as time. This is an independent sum over all the values that the spins can take at each time slice. This is a type of path integral, it is the sum over all spin histories.

A path integral can be rewritten as a Hamiltonian evolution. The Hamiltonian steps through time by performing a unitary rotation between time <math>t</math> and time <math>t+\Delta t</math>:

<math> U = e^{i H \Delta t}

\,</math>

The product of the U matrices, one after the other, is the total time evolution operator, which is the path integral we started with.

<math> U^N = (e^{i H \Delta t})^N = \int DX e^{iL}

\,</math>

where N is the number of time slices. The sum over all paths is given by a product of matrices, each matrix element is the transition probability from one slice to the next.

Similarly, one can divide the sum over all partition function configurations into slices, where each slice is the one-dimensional configuration at time 1. This defines the transfer matrix:

<math>

T_{C_1 C_2} \,</math>

The configurations in each slice is a one dimensional collection of spins. At each time slice, T has matrix elements between two configurations of spins, one in the immediate future and one in the immediate past. These two configurations are C1 and C2, and they are all one dimensional spin configurations. We can think of the vector space that T acts on as all complex linear combinations of these. Using quantum mechanical notation:

<math>

|A\rangle = \sum_S A(S) |S\rangle \,</math>

where each basis vector <math>|S\rangle</math> is a spin configuration of a one-dimensional Ising model.

Like the Hamiltonian, the transfer matrix acts on all linear combinations of states. The partition function is a matrix function of T, which is defined by the sum over all histories which come back to the original configuration after N steps:

<math>

Z= \mathrm{tr}(T^N) \,</math>

Since this is a matrix equation, it can be evaluated in any basis. So if we can diagonalize the matrix T, we can find Z.

T in terms of Pauli matrices

The contribution to the partition function for each past/future pair of configurations on a slice is the sum of two terms. There is the number of spin flips in the past slice and there is the number of spin flips between the past and future slice. Define an operator on configurations which flips the spin at site i:

<math>

\sigma^x_i \,</math>

In the usual Ising basis, acting on any linear combination of past configurations, it produces the same linear combination but with the spin at position i of each basis vector flipped.

Define a second operator which multiplies the basis vector by +1 and −1 according to the spin at position i:

<math>

\sigma^z_i \,</math>

T can be written in terms of these:

<math>

\sum_i A \sigma^x_i + B \sigma^z_i \sigma^z_{i+1} \,</math>

where A and B are constants which are to be determined so as to reproduce the partition function. The interpretation is that the statistical configuration at this slice contributes according to both the number of spin flips in the slice, and whether or not the spin at position i' has flipped.

Spin flip creation and annihilation operators

Just as in the one dimensional case, we will shift attention from the spins to the spin-flips. The <math>\sigma_z</math> term in T counts the number of spin flips, which we can write in terms of spin-flip creation and annihilation operators:

<math> \sum C \psi^\dagger_i \psi_i \,</math>

The first term flips a spin, so depending on the basis state it either:

  1. moves a spin-flip one unit to the right
  2. moves a spin-flip one unit to the left
  3. produces two spin-flips on neighboring sites
  4. destroys two spin-flips on neighboring sites.

Writing this out in terms of creation and annihilation operators:

<math> \sigma^x_i = D {\psi^\dagger}_i \psi_{i+1} + D^* {\psi^\dagger}_i \psi_{i-1} + C\psi_i \psi_{i+1} + C^* {\psi^\dagger}_i {\psi^\dagger}_{i+1}

\,</math>

Ignore the constant coefficients, and focus attention on the form. They are all quadratic. Since the coefficients are constant, this means that the T matrix can be diagonalized by Fourier transforms.

Carrying out the diagonalization produces the Onsager free energy.

Dimensions 3 and above – free field

In any dimension, the Ising model can be productively described by a locally varying mean field. The field is defined as the average spin value over a large region, but not so large so as to include the entire system. The field still has slow variations from point to point, as the averaging volume moves. These fluctuations in the field are described by a continuum field theory in the infinite system limit.

Local field

The field H is defined as the long wavelength Fourier components of the spin variable, in the limit that the wavelengths are long. There are many ways to take the long wavelength average, depending on the details of how high wavelengths are cut off. The details are not too important, since the goal is to find the statistics of H and not the spins. Once the correlations in H are known, the long-distance correlations between the spins will be proportional to the long-distance correlations in H.

For any value of the slowly varying field H, the free energy (log-probability) is a local analytic function of H and its gradients. The free energy F(H) is defined to be the sum over all Ising configurations which are consistent with the long wavelength field. Since H is a coarse description, there are many Ising configurations consistent with each value of H, so long as not too much exactness is required for the match.

Since the allowed range of values of the spin in any region only depend on the values of H within one averaging volume from that region, the free energy contribution from each region only depends on the value of H there and in the neighboring regions. So F is a sum over all regions of a local contribution, which only depends on H and its derivatives.

By symmetry in H, only even powers contribute. By reflection symmetry on a square lattice, only even powers of gradients contribute. Writing out the first few terms in the free energy:

<math>

\beta F = \int d^dx A H^2 + \sum_{i=1}^{d} Z_i (\partial_i H)^2 + \lambda H^4 ... \,</math>

On a square lattice, symmetries guarantee that the coefficients <math>Z_i</math> of the derivative terms are all equal. But even for an anisotropic Ising model, where the Z's in different directions are different, the fluctuations in H are isotropic in a coordinate system where the different directions of space are rescaled.

On any lattice, the derivative term <math>\scriptstyle Z_{ij} \partial_i H \partial_j H </math> is a positive definite quadratic form, and can be used to define the metric for space. So any translationally invariant Ising model is rotationally invariant at long distances, in coordinates that make <math>Z_{ij}=\delta_{ij}</math>. Rotational symmetry emerges spontaneously at large distances just because there aren't very many low order terms. At higher order multicritical points, this accidental symmetry is lost.

Since <math>\beta F</math> is a function of a slowly spatially varying field. The probability of any field configuration is:

<math>

P(H) \propto e^{ - \int d^dx AH^2 + Z |\nabla H|^2 + \lambda H^4 } \,</math>

The statistical average of any product of H's is equal to:

<math>

\langle H(x_1) H(x_2)....H(x_n) \rangle = { \int DH P(H) H(x_1) H(x_2) ... H(x_n) \over \int DH P(H) } \,</math>

The denominator in this expression is called the partition function, and the integral over all possible values of H is a statistical path integral. It integrates <math>\exp(\beta F)</math> over all values of H, over all the long wavelength fourier components of the spins. F is a Euclidean Lagrangian for the field H, the only difference between this and the quantum field theory of a scalar field is that all the derivative terms enter with a positive sign, and there is no overall factor of i.

<math>

Z = \int DH e ^ { \int d^dx A H^2 + Z |\nabla H|^2 + \lambda H^4} \,</math>

Dimensional analysis

The form of F can be used to predict which terms are most important by dimensional analysis. Dimensional analysis is not completely straightforward, because the scaling of H needs to be determined.

In the generic case, choosing the scaling law for H is easy, the only term that contributes is the first one,

<math>

F = \int d^dx A H^2 \,</math>

This term is the most significant, but it gives trivial behavior. This form of the free energy is ultralocal, meaning that it is a sum of an independent contribution from each point. This is like the spin-flips in the one-dimensional Ising model. Every value of H at any point fluctuates completely independently of the value at any other point.

The scale of the field can be redefined to absorb the coefficient A, and then it is clear that A only determines the overall scale of fluctuations. The ultralocal model describes the long wavelength high temperature behavior of the Ising model, since in this limit the fluctuation averages are independent from point to point.

To find the critical point, lower the temperature. As the temperature goes down, the fluctuations in H go up because the fluctuations are more correlated. This means that the average of a large number of spins does not become small as quickly as if they were uncorrelated, because they tend to be the same. This corresponds to decreasing A in the system of units where H does not absorb A. The phase transition can only happen when the subleading terms in F can contribute, but since the first term dominates at long distances, the coefficient A must be tuned to zero. This is the location of the critical point:

<math>

F= \int d^dx t H^2 + \lambda H^4 + Z (\nabla H)^2 \,</math>

Where t is a parameter which goes through zero at the transition.

Since t is vanishing, fixing the scale of the field using this term makes the other terms blow up. Once t is small, the scale of the field can either be set to fix the coefficient of the <math>H^4</math> term or the <math>\scriptstyle (\nabla H)^2</math> term to 1.

Continued at Ising model, part 2

See also

Footnotes

  1. ^ "Three-dimensional proof for Ising model impossible". Sandia National Laboratories. 200-05-20. http://www.sandia.gov/LabNews/LN04-21-00/sorin_story.html. Retrieved on 2006-10-14. 
  2. ^ J. J. Hopfield (1982). "Neural networks and physical systems with emergent collective computational abilities". Proceedings of the National Academy of Sciences of the USA 79 no. 8: 2554–2558. DOI:10.1073/pnas.79.8.2554. PMID 6953413. 
  3. ^ Jaynes, E. T. (1957). "Information Theory and Statistical Mechanics". Physical Review 106,: 620. DOI:10.1103/PhysRev.106.620. 
  4. ^ Jaynes, Edwin T. (1957). "Information Theory and Statistical Mechanics II". Physical Review 108: 71. DOI:10.1103/PhysRev.108.171. 
  5. ^ Elad Schneidman, Michael J. Berry, Ronen Segev and William Bialek (2006). "Weak pairwise correlations imply strongly correlated network states in a neural population". Nature 440: 1007–1012. DOI:10.1038/nature04701. 

References

External links

This page uses content from Wikipedia. The original article was at Ising model.
The list of authors can be seen in the page history. The text of this Wikinfo article is available under the GNU Free Documentation License and the Creative Commons Attribution-Share Alike 3.0 license.

English | Română | edit