Learning Objectives
Magnetism is the place where the abstract ideas of Chapter 4 (Pauli exclusion, exchange–correlation) finally show up as a number you can measure with a SQUID. By the end of this section you should be able to:
- Explain in one sentence why magnetism is fundamentally quantum-mechanical — and why the Bohr–van Leeuwen theorem says no classical solid can be magnetic.
- Translate between three vocabularies that describe the same physics: atomic spin , magnetic moment , and the spin density .
- Recognise the four canonical magnetic orderings — paramagnetic, ferromagnetic (FM), antiferromagnetic (AFM), and ferrimagnetic (FiM) — by inspection of a unit cell, and predict which one a Heisenberg Hamiltonian will favour from the sign of .
- Read a spin-polarised band structure or DOS plot fluently: identify the exchange splitting , compute the net moment as , and apply the Stoner criterion to predict ferromagnetism in a metal.
- Set up a spin-polarised VASP calculation correctly: the meaning of ISPIN, MAGMOM, NUPDOWN, LORBIT, and LMAXMIX; and how to seed an AFM unit cell so the SCF does not collapse to FM.
One-line preview: a magnetic crystal is what happens when the Pauli exclusion principle, the Coulomb interaction, and the translation symmetry of a lattice conspire to make it cheaper for some electrons to align their spins than to antialign them. Everything else — Heisenberg , exchange splitting, Stoner enhancement, MAGMOM in VASP — is bookkeeping on top of that one idea.
Where Magnetism Comes From
It is worth pausing on a deep and counterintuitive fact: classical physics has no room for permanent magnetism in a solid. The Bohr–van Leeuwen theorem (1911, 1921) shows that any ensemble of classical charged particles in thermal equilibrium has zero average magnetic moment, no matter what magnetic field you apply. Lorentz forces do no work, the partition function is independent of the vector potential, and the magnetic susceptibility is identically zero. Yet a refrigerator magnet sticks. Where does the moment come from?
It comes from quantum mechanics, in two distinct steps. First, every electron carries an intrinsic magnetic moment with and . That is a relativistic effect; it does not exist in the classical world. Second, when you put many electrons into the same crystal, the Pauli exclusion principle forbids two of them from occupying the same spatial orbital with the same spin. This forces the wavefunction to be antisymmetric, which in turn lowers the Coulomb repulsion between same-spin electrons (they cannot get close to each other). Aligning spins becomes energetically favourable — even though the magnetic dipole–dipole interaction is far too weak to do that on its own.
The energy scales tell the story
Magnetic dipole–dipole interaction between two electrons one ångström apart: (sub-Kelvin). Coulomb repulsion between those same electrons: . The exchange interaction — the part of that depends on whether the spins are parallel or antiparallel — is typically – eV, giving Curie temperatures of 100–1000 K. Magnetism is Coulomb physics in disguise, brokered by Pauli.
This is why the Heisenberg Hamiltonian we will write down in a moment looks like a spin-spin interaction even though there is no such fundamental force. The term is an effective description of the spin-dependent part of the Coulomb energy, integrated out from the underlying many-electron wavefunction. The number is what survives.
Spin as a Second Quantum Label
Section 5.1 introduced the band index and the Brillouin-zone wavevector as the two labels of a Bloch state. There is a third label, hidden in plain sight: spin. A complete Bloch state is with . Inside a non-magnetic, non-relativistic crystal the two spin channels are degenerate by time-reversal symmetry — every band carries two electrons, one up and one down, and we can ignore the label. The moment we break that degeneracy, by either spontaneous magnetism or an external field, the third label comes alive.
For collinear magnetism (every spin pointing along a single global axis, conventionally ) the bookkeeping simplifies dramatically: the Hamiltonian decomposes into two independent blocks, one for each spin channel, and we have two Kohn–Sham equations instead of one,
with two distinct effective potentials and . The whole magnetic story is contained in the fact that, in a magnetic ground state, those two potentials are not equal. They differ by precisely the exchange–correlation field :
That is the entire content of spin-density-functional theory: promote the density to a pair , let the exchange–correlation functional depend on both, and you automatically get spin-split Kohn–Sham potentials. The local magnetisation is then the difference,
and the total moment per unit cell is the integral
where is the total number of electrons in spin channel per unit cell. The headline number a VASP calculation prints — "total moment = 2.20 μ_B/cell" — is exactly this integral.
| Quantity | Symbol | Where it lives in DFT | Where in VASP |
|---|---|---|---|
| Spin-up density | n↑(r) | First density channel | CHGCAR (channel 1) |
| Spin-down density | n↓(r) | Second density channel | CHGCAR (channel 2) |
| Spin density / magnetisation | m(r) = n↑ − n↓ | Difference of channels | CHGCAR.spin (post-processed) |
| Total moment per cell | μ = μ_B (N↑ − N↓) | Integral of m(r) | OUTCAR — 'tot' column |
| Per-atom moment | μ_i = ∫_PAW-sphere m(r) dr | Projected onto atomic spheres | OUTCAR — 'magnetization (x)' table |
| Exchange-correlation field | B_xc(r) | Functional derivative of E_xc | Internal — not written |
A useful mental anchor
For a textbook ferromagnet, both bands and density of states come in two colours. Imagine the non-magnetic DOS coloured grey; turning on spin polarisation draws two copies — one shifted down (majority spin) and one shifted up (minority spin). The integrated occupied-state difference between the two copies is the moment. We will see this exactly in the interactive a few sections down.
The Exchange Interaction and the Heisenberg Model
Take two electrons, one in orbital on site A and one in on site B. Pauli forbids them from occupying the same spin-orbital, so the two-electron wavefunction must be antisymmetric. There are two ways to achieve that:
- Singlet (antiparallel spins). Symmetric spatial part, antisymmetric spin part. The electrons can sit on the same atom for an instant (virtual hopping is allowed), which costs a Coulomb-repulsion penalty but lowers the kinetic energy by through second-order perturbation theory.
- Triplet (parallel spins). Antisymmetric spatial part, symmetric spin part. The two electrons cannot occupy the same orbital — the wavefunction vanishes when (the "Fermi hole"). No virtual hopping, no kinetic-energy gain from , but also no Coulomb penalty — parallel spins effectively avoid each other, which lowers the direct Coulomb integral by an amount called the exchange integral .
The energy difference between the two configurations defines the exchange coupling . Through the Heitler–London or Hubbard analyses one finds the compact result
which is a beautiful little equation. The first term is the direct exchange — it favours parallel spins (FM tendency). The second term is the kinetic exchange — it favours antiparallel spins (AFM tendency). For most insulating magnets the kinetic term wins and the ground state is AFM; for many metallic 3d ferromagnets (Fe, Co, Ni) the direct term wins. The sign and magnitude of tell you which ordering will appear.
Once you have computed the per-bond (in DFT this is done by mapping total-energy differences between magnetic configurations onto a model — see "Energy Comparison" below), the magnetic crystal is described by the Heisenberg Hamiltonian:
with a sign convention where favours parallel spins (FM) and favours antiparallel (AFM). The sum runs over neighbour pairs . Despite its appearance, this is not a fundamental Hamiltonian — it is the effective low-energy description of the same Coulomb-plus-Pauli physics we just derived.
What this earns you
Once you have , you can throw the electrons away. The Curie temperature (mean-field), the spin-wave dispersion , and the magnetic ground-state structure all fall out of the Heisenberg model alone. This is why fitting from DFT total energies and then handing it to a Monte-Carlo or spin-wave code is the standard recipe for magnetic phase diagrams.
Interactive — The Four (and a Half) Magnetic Orderings
Below is a 3×3×3 simple-cubic lattice of magnetic atoms. Each atom carries a spin arrow. Click the buttons across the top to switch between paramagnetic (PM), ferromagnetic (FM), G-type and A-type antiferromagnetic (AFM-G, AFM-A), ferrimagnetic (FiM), and a non-collinear (NC) helical state. The footer reports the live net moment so you can verify by hand that AFM cancels and FM saturates.
Three things to do with the controls, in order:
- Switch to AFM-G. Every atom is anti-aligned with all six of its nearest neighbours along x, y, and z — a 3-D checkerboard. The net moment ΣM is essentially zero. This is the ground state of MnO, NiO, and most of the antiferromagnetic rock-salt oxides. Note that the magnetic unit cell (period 2 along each axis) is twice as large as the chemical unit cell — a fact that will bite us when we set up VASP supercells.
- Switch to AFM-A. Within each (xy)-layer the spins are FM; between layers they alternate. This shows up in the perovskite manganites and in CrI₃-type 2D magnets. The magnetic unit cell is doubled along z but not along x/y.
- Switch to FiM with the amplitude slider. The two sublattices have unequal magnitudes and opposite signs. The net moment is the difference, not zero. This is the structure of magnetite Fe₃O₄ (the original ‘lodestone’): two Fe sites with opposite spins, one with twice the moment of the other, giving a residual macroscopic magnetisation.
Why the netMoment readout is exact
The number ΣM in the footer is the literal sum over the 27 sites, with each set by the ordering. For FM with amplitude 0.7 you should read ; for AFM-G the parity of (i+j+k) gives 14 up and 13 down → net . The single-site asymmetry only disappears in the thermodynamic limit; this is exactly the boundary effect that forces real AFM calculations to use even-sized supercells.
Spin-Polarized DFT — Two Densities, One Crystal
We are now ready to plug magnetism into the Kohn–Sham machinery from Section 4.6. The promotion is mechanical: instead of one density-functional Lagrangian we have two, one for each spin channel, coupled only through the exchange–correlation functional. The total energy becomes a functional of the two densities,
with . Functional differentiation gives one Kohn–Sham equation per spin channel, with potentials
The first two terms (external potential and Hartree term) are spin-blind — they depend only on the total density. The exchange–correlation term is where spin polarisation enters. In the local spin-density approximation (LSDA), we evaluate the spin-polarised homogeneous electron-gas functional at the local densities,
where is the relative spin polarisation. GGA functionals (PBE, PW91) generalise this by also depending on . The point is that is a different function of ; for the energy density is lower than the spin-balanced result. That difference is the driving force for magnetism in DFT.
LSDA vs GGA for magnets — a practical caveat
LSDA underestimates the moment of Fe (≈ 2.15 μ_B vs experimental 2.22) and gets the BCC ground state wrong in some other 3d metals. PBE-GGA is the modern default — it gets BCC-Fe, FCC-Ni, and HCP-Co all in their correct ferromagnetic ground states with near-experimental moments. For strongly correlated oxides (NiO, MnO, CoO) you need on top a Hubbard correction (DFT+U) or a hybrid functional — see Chapter 6.
The price for magnetism is computational: ISPIN = 2 doubles the size of the eigenvalue problem (two channels of bands) and roughly doubles the wall time. The pay-off is that every electronic property now comes with a spin label: bands split into majority and minority, DOS becomes two curves, and the per-atom moment is a number you can read off OUTCAR.
Interactive — Exchange Splitting and the Net Moment
The cleanest way to see magnetism in DFT output is the spin-resolved DOS. We plot on one side of the energy axis and on the other, with a horizontal Fermi-level line. The net moment per cell is the integrated difference of occupied states,
which is reported live in the bottom strip of the figure. Drag Δex to control how strongly the two channels split apart — physically this is set by the exchange–correlation field — and drag EF to control the band filling.
Two limits to feel out:
- Δex = 0. The two curves coincide. Every state below EF is doubly occupied (one ↑, one ↓), so and the moment is zero. This is the non-magnetic, paramagnetic ISPIN = 1 picture.
- Δex ≈ 1.5 eV, EF in the gap between the two ↓ peaks. The majority channel is mostly filled, the minority channel is partly emptied, and the moment opens up to ~1–2 μ_B per atom. This is the band-theory picture of the BCC-Fe ground state — a so-called weak ferromagnet, where both channels still cross EF. Push Δex higher and the minority channel can be pushed entirely above EF: this is a half-metal (NiMnSb, CrO₂) — gapped in one channel and metallic in the other, the ideal source for spintronic devices.
Numerical sanity check
For most 3d ferromagnets, exchange splitting is empirically where is the Stoner parameter (≈ 1 eV/μ_B for Fe, Co, Ni). Read off the splitting in your DOS, divide by the moment from OUTCAR, and check that it lands near 1 eV/μ_B. If it does, your calculation is in the textbook regime; if not, suspect either a non-converged k-grid or correlations beyond DFT.
When Does a Metal Become Ferromagnetic? The Stoner Criterion
Why do Fe, Co, and Ni order ferromagnetically while their neighbours Cu, Pd, and Pt do not? The exchange parameter is similar across the 3d row, so it must be something about the band structure itself. Stoner's 1938 argument is one of the most elegant criteria in solid-state physics.
Imagine starting from a non-magnetic metal and infinitesimally moving a tiny number of electrons from the ↓ channel just below EF to the ↑ channel just above. The cost in kinetic energy is per electron moved (the inverse density of states sets the energy gap you have to span). The reward in exchange energy is (each spin flip lowers the energy by times the imbalance you just created). The total energy change is therefore
and ferromagnetism is the spontaneous winner whenever the bracket is negative — that is, the Stoner criterion:
A high density of states at the Fermi level is the lever; the exchange parameter is the prize.
| Element | g(E_F) (states/eV·atom) | I (eV) | I·g(E_F) | Outcome |
|---|---|---|---|---|
| Fe (BCC) | ≈ 2.4 | 0.93 | ≈ 2.2 | Ferromagnet (μ ≈ 2.2 μ_B) |
| Co (HCP) | ≈ 2.0 | 0.99 | ≈ 2.0 | Ferromagnet (μ ≈ 1.7 μ_B) |
| Ni (FCC) | ≈ 2.7 | 1.01 | ≈ 2.7 | Ferromagnet (μ ≈ 0.6 μ_B) |
| Pd (FCC) | ≈ 1.6 | 0.71 | ≈ 1.1 | Borderline, paramagnetic but enhanced |
| Cu (FCC) | ≈ 0.3 | 0.73 | ≈ 0.2 | Non-magnetic (filled d-shell) |
| Al (FCC) | ≈ 0.2 | 0.66 | ≈ 0.1 | Non-magnetic |
Pd is the famous near-miss: , almost satisfying the criterion but not quite. Its susceptibility is enhanced by an order of magnitude over the Pauli value, and a tiny amount of Co or Ni doping pushes it over the threshold. Cu, with its full d-shell, has tiny from the s-band only and the criterion fails by an order of magnitude.
The Stoner criterion is what your DFT actually solves
When you turn on ISPIN = 2 with a non-zero MAGMOM seed, VASP minimises the spin-polarised energy. If the SCF flows downhill into a ferromagnetic state and the moment grows iteration by iteration. If the moment decays back to zero and the answer is paramagnetic regardless of how large you set the initial moment. This is why setting MAGMOM correctly is necessary but not sufficient — the underlying band structure has the final say.
Picking the Ground State by Total-Energy Comparison
For most magnetic materials there is no analytical shortcut for deciding which ordering is the ground state. The standard recipe is embarrassingly direct: compute the total energy of every candidate ordering and pick the smallest. For a binary transition-metal oxide like NiO this means at minimum:
- A non-magnetic (ISPIN = 1) reference run.
- A ferromagnetic (FM) run on the chemical unit cell.
- A G-type AFM run on a doubled magnetic unit cell (e.g. a 2×2×2 supercell of the rock-salt cell).
- Optionally, A-type, C-type, and other AFM patterns if the lattice allows them.
Order the resulting energies. The lowest is the predicted ground state. The differences also let you fit the Heisenberg : writing
gives you from a single energy difference, with the coordination number and the local spin. Repeat with second- and third-neighbour AFM patterns to extract , , etc. This is the workhorse "four-state mapping" method used in modern magnetic first-principles studies.
A worked baseline: NiO
DFT-PBE+U total energies for NiO (per Ni atom): (reference), , . The 40-meV preference for AFM-II is small but reproducible across codes and pseudopotentials. The corresponding nearest-neighbour (negative, AFM) and second-neighbour via super-exchange through the bridging O — an order of magnitude larger and the actual driver of NiO's 523-K Néel temperature.
VASP — Spin-Polarized Calculations from INCAR to OUTCAR
Setting up a magnetic VASP calculation requires four INCAR tags (ISPIN, MAGMOM, LORBIT, LMAXMIX) and a discipline about reading the output. Here is the canonical template for BCC iron — the simplest interesting ferromagnet — followed by a line-by-line walkthrough.
1bcc Fe (a = 2.866 Å, conventional cell of 2 atoms)
21.0
3 2.866 0.000 0.000
4 0.000 2.866 0.000
5 0.000 0.000 2.866
6Fe
72
8Direct
9 0.0 0.0 0.0
10 0.5 0.5 0.51Automatic mesh
20
3Gamma
415 15 15
5 0 0 0The INCAR — annotated below
Code Walkthrough — Reading the Magnetic Output
After the SCF converges, all the magnetic information you need is scattered through OUTCAR and (for plotting) vasprun.xml. The following pymatgen snippet pulls it out in a few lines.
Pitfalls You Will Hit On Your First Try
Spin-polarised calculations have a small zoo of failure modes. Each of these will eat half a day from a new student; recognising them in advance saves the day.
| Symptom | Probable cause | Fix |
|---|---|---|
| Total moment converges to 0 even with MAGMOM = 2*2.5 | SCF fell into the non-magnetic local minimum | Increase MAGMOM (try 4.0), tighten EDIFF, set NUPDOWN to fix the moment for the first 30 steps |
| AFM run converges to FM | MAGMOM seed has the wrong pattern OR the cell is not large enough to accommodate AFM | Double the cell along the broken-symmetry axis; write MAGMOM as '4*2.5 4*-2.5' explicitly; never use 'N*x' for AFM |
| SCF energy oscillates by 0.1 eV every step | Mixing parameters wrong for transition metals | Set LMAXMIX = 4 (3d) or 6 (4f); reduce AMIX_MAG to 0.4; AMIX = 0.2 |
| Per-atom moments don't sum to total moment in OUTCAR | Magnetisation outside the PAW spheres (interstitial) | Normal — the difference is the interstitial moment. To match exactly, use larger RWIGS or analyse via Bader |
| Moments wander between SCF iterations | MAGMOM seed too small or too far from a stable solution | Use a stronger seed; run a brief NUPDOWN-constrained warm-up; or apply DFT+U for narrow-band oxides |
| DOS plot shows tiny moment but VASP reports large moment | Wrong sign convention on n_↓ | VASP plots n_↓ as negative; pymatgen returns it positive — flip in your script when plotting |
The single most useful diagnostic
Look at — the change in total moment between successive SCF iterations — printed in OSZICAR. A well-converged magnetic SCF has in the last few steps. Oscillations or monotone drift mean you should tighten EDIFF, lower mixing, or seed differently.
Looking Ahead — Spin-Orbit Coupling
Everything in this section assumed the spin axis is global — every moment is along (collinear). That is exact in a non-relativistic Hamiltonian, where the spin-rotation symmetry of the kinetic and Coulomb operators makes the choice of axis arbitrary. The world is not non-relativistic. The spin-orbit interaction
couples the spin to the spatial wavefunction and breaks the rotational invariance of the spin space. The consequences for a magnetic crystal are dramatic: the moments now point in a definite crystallographic direction (magnetocrystalline anisotropy), bands split by a few hundred meV in heavy elements, and entirely new phenomena emerge — Rashba splitting, Dirac semimetals, topological insulators. This is the subject of Section 5.7 — Spin–Orbit Coupling, where we will turn on the LSORBIT tag in VASP and see exactly how the band structure of CdSe (a near-textbook material for our chapter-6 case study) splits at the valence-band maximum.
Summary
- Magnetism is quantum. The Bohr–van Leeuwen theorem rules out classical magnetism; permanent moments come from electron spin combined with the Pauli exclusion principle, which turns Coulomb repulsion into a spin-dependent exchange interaction.
- Spin is a third Bloch label. A complete state is . In a magnetic crystal the two spin channels have different Kohn–Sham potentials, differing by the exchange–correlation field .
- The four canonical orderings — PM, FM, AFM, FiM — together with non-collinear states cover essentially every magnetic crystal you will meet. in a Heisenberg model favours FM, favours AFM.
- Spin-polarised DFT (LSDA / GGA with ) gives every electronic property a spin label. The exchange splitting separates majority and minority bands; the integrated occupied difference is the moment.
- The Stoner criterion says ferromagnetism appears in a metal whenever the band exchange parameter times the Fermi-level density of states exceeds unity. Fe/Co/Ni satisfy it; Cu/Pd do not.
- In VASP: ISPIN = 2 turns on spin polarisation, MAGMOM seeds the SCF (must be set explicitly for AFM cells), LORBIT = 11 writes per-atom orbital-resolved moments, and LMAXMIX = 4 stabilises the SCF for 3d transition metals. The ground-state ordering is decided by total-energy comparison across candidate magnetic configurations.
Coming next: Section 5.7 — Spin–Orbit Coupling — where we let the spin axis tilt off , watch new band splittings open, and turn on the LSORBIT tag in VASP for the CdSe case study.