Boo-AI — Master Artificial Intelligence by Building from Scratch

Learning Objectives

By the end of this section you should be able to:

State, in one sentence, why the band gap of a standard PBE calculation is roughly half of the experimental value — and why this is not a numerical bug but a structural feature of any local exchange-correlation approximation.
Identify self-interaction error as the underlying disease, and recognise its two clinical symptoms: under-binding of localised states and over-delocalisation of charge.
Read a hybrid functional like $E_\text{xc}^\text{PBE0} = \tfrac{1}{4}\,E_\text{x}^\text{HF} + \tfrac{3}{4}\,E_\text{x}^\text{PBE} + E_\text{c}^\text{PBE}$ and translate every term back to the physical principle it enforces.
Place PBE, PBE0, HSE06, and the exact Kohn–Sham potential on Jacob's ladder — and predict, before running a single calculation, which one is appropriate for a given material.
Explain why the choice of algorithm (Davidson, RMM-DIIS, Pulay mixing) is just as load-bearing for a hybrid run as the choice of functional, and why a good algorithm can turn a 30× slowdown into a 3× one.

One-line preview: Pure DFT is wrong about band gaps in a very specific, very fixable way. The fix — mixing in a fraction of Hartree–Fock exchange — is conceptually one line and computationally a full chapter, because the integrals involved are nonlocal and the iterative algorithms that worked for PBE no longer converge on autopilot.

The Failure That Launched a Thousand Functionals

At the end of Chapter 5 we built the band structure of CdSe in PBE and read off a fundamental gap of $\sim 0.7\,\text{eV}$ . The experimental gap of bulk zincblende CdSe is $1.74\,\text{eV}$ at room temperature. PBE is not slightly off — it is off by more than a factor of two. This is not a CdSe problem; it is a generic feature of every standard exchange-correlation functional in the LDA/GGA family. The pattern is so robust that it has its own name: the DFT band-gap problem.

Material	PBE gap (eV)	Experiment (eV)	Error
Si	0.61	1.17	−48 %
Ge	0.00 (metallic!)	0.66	−100 %
GaAs	0.55	1.52	−64 %
CdSe (zb)	0.70	1.74	−60 %
ZnO	0.73	3.44	−79 %
NiO	0.4 (metallic in PBE)	4.3	qualitatively wrong
Diamond	4.18	5.48	−24 %

Three lessons from that table. First, the error is systematic: pure DFT gaps are almost always too small, never too large. Second, the error is material-dependent: covalent semiconductors lose a factor of two; transition-metal oxides like NiO are not just quantitatively but qualitatively wrong, predicted as metals when they are insulators. Third, the error scales with the localisation of the relevant orbitals — Ni 3d in NiO is far more localised than Si 3p in silicon, and the gap collapses accordingly. We will see why in a moment.

Why this is not just an academic complaint

A wrong band gap propagates into every observable that touches it: optical absorption thresholds, photoluminescence wavelengths, defect ionisation energies, charge-transfer barriers, dielectric constants, and the band offsets that decide whether a heterostructure is a good photovoltaic or a bad one. For Mn-doped CdSe — the project of Chapter 7 — the gap controls both the host emission wavelength and the energy of the Mn ${}^4T_1 \to {}^6A_1$ transition. Get the gap wrong by a factor of two and your predicted device colour is in a different part of the visible spectrum.

Where the Error Comes From

The Kohn–Sham equations replace the interacting many-electron problem with a fictitious system of non-interacting electrons moving in an effective potential

v_\text{KS}(\mathbf{r}) \;=\; v_\text{ext}(\mathbf{r}) \;+\; v_\text{H}[\rho](\mathbf{r}) \;+\; v_\text{xc}[\rho](\mathbf{r}).

The first two pieces — the external potential and the Hartree (classical Coulomb) potential — are exact and local. Everything quantum and interacting has been pushed into the third term, the exchange-correlation potential $v_\text{xc}$ . The exact $v_\text{xc}$ is unknown; LDA approximates it as a function of the local density $\rho(\mathbf{r})$ , GGA also as a function of the gradient $\nabla \rho$ . Both are local — they depend only on the density at point $\mathbf{r}$ .

The exact exchange interaction, by contrast, is irreducibly nonlocal:

E_\text{x}^\text{HF} \;=\; -\tfrac{1}{2}\sum_{i,j}\iint \frac{\psi_i^*(\mathbf{r})\,\psi_j^*(\mathbf{r}')\,\psi_j(\mathbf{r})\,\psi_i(\mathbf{r}')}{|\mathbf{r}-\mathbf{r}'|}\,d\mathbf{r}\,d\mathbf{r}'.

That double integral over occupied orbitals $\psi_i, \psi_j$ contains the Pauli exclusion principle in its most physical form: an electron at $\mathbf{r}$ sees a hole at $\mathbf{r}'$ with weight set by the orbital overlap. Replace it with a function of the local density and you have thrown away the long-range tail of that hole. The price you pay is a residual self-interaction: in pure DFT, every electron partly repels itself.

The one-electron crime scene

Consider a single hydrogen atom — one electron, one proton. The exact total energy is $-13.6\,\text{eV}$ . In Hartree–Fock, exchange exactly cancels the Hartree term for a one-electron system: no self-interaction, exact answer. In LDA, the local approximation does not cancel: the electron pays a spurious Coulomb cost for repelling its own density. This residual error is small for hydrogen ( $\sim 0.3\,\text{eV}$ ), but it is the seed of every band-gap underestimate, every over-delocalised polaron, and every wrong-sign charge transfer in the LDA/GGA literature.

Two Symptoms of the Same Disease

Self-interaction error manifests in two ways that look unrelated until you stare at them long enough to see they are the same equation in different clothes.

Symptom 1 — Underestimated band gaps

The fundamental gap of a solid is the difference between the ionisation potential and the electron affinity:

E_\text{g} \;=\; \big[E(N+1) - E(N)\big] - \big[E(N) - E(N-1)\big].

The exact Kohn–Sham gap differs from this fundamental gap by a discontinuity $\Delta_\text{xc}$ in the exchange-correlation potential when an electron is added — a well-known formal result of Perdew & Levy and Sham & Schlüter (1983). LDA and GGA functionals are continuous functionals of the density, so by construction their $\Delta_\text{xc} = 0$ . The DFT gap is missing a contribution that is, in many materials, comparable in size to the gap itself. That is the gap problem in one equation.

Symptom 2 — Over-delocalised charge

Localise an electron on a defect site, and self-interaction pushes its density outward to lower the spurious self-repulsion. The result is a defect orbital that looks too diffuse, polarons that prefer to delocalise across many sites instead of self-trapping, and band edges whose orbital character is smeared across too many atoms. In NiO, the Ni 3d electrons want to be tightly localised on each Ni site — Hund's rules and crystal-field splitting demand it. PBE refuses to let them, and the system collapses to a metallic ground state.

Why these are the same disease

A band-gap is the energy cost of taking one electron from one place and putting it somewhere else. If self-interaction lets electrons spread out for free, then the "take one electron away" step is cheaper than it should be — the gap shrinks. The over-delocalisation of charge and the underestimation of the gap are two faces of the same coin: pure DFT systematically prefers smeared electron densities to localised ones.

The Fix in One Sentence: Mix in Some Exact Exchange

Hartree–Fock has zero self-interaction (it cancels exactly between the exchange and Hartree terms) but no correlation, so HF gaps come out far too large: 7+ eV for silicon, 13 eV for diamond. PBE has decent correlation but residual self-interaction, so its gaps come out too small. The simplest possible thing to do is mix them:

E_\text{xc}^\text{hybrid} \;=\; a\,E_\text{x}^\text{HF} \;+\; (1-a)\,E_\text{x}^\text{DFT} \;+\; E_\text{c}^\text{DFT}.

The fraction $a$ is the exact-exchange mixing parameter. The two most influential choices in the literature both fall out of physical arguments rather than fitting:

Functional	a	Origin	Best for
PBE0	0.25	Adiabatic-connection / perturbation theory argument by Becke (1996)	Molecules, finite systems, gaps in semiconductors
B3LYP	0.20	Empirically fit to atomisation energies (Becke 1993)	Quantum chemistry, organic molecules
HSE06	0.25 (short-range only)	Range-separated PBE0 — keeps short-range HF, drops the long-range tail	Solids, especially semiconductors and oxides

For solids, plain PBE0 is usually too expensive and slightly over-corrects the gap; HSE06 is the default workhorse because the screening of the long-range Coulomb interaction in a metal or small-gap semiconductor justifies dropping the long-range exchange tail. We dedicate Section 6.5 to HSE06 in detail.

Why 0.25 keeps showing up

Becke's adiabatic-connection argument estimates that $a \approx 1/4$ from second-order Møller–Plesset perturbation theory: roughly, that fraction of HF exchange optimally cancels the leading-order self-interaction error while preserving the correlation captured by the DFT piece. PBE0's 25 % and HSE06's 25 % short-range exchange are not coincidences — they reflect a genuine physical estimate, not a fit.

Jacob's Ladder of Functionals

John Perdew organised the zoo of exchange-correlation functionals into what he called Jacob's ladder: each rung adds a new ingredient and, in principle, a new piece of physics.

Rung	Class	Ingredient	Examples	Typical gap error
1	LDA	Local density ρ(r)	PW92, VWN	−50 % to −60 %
2	GGA	+ density gradient ∇ρ	PBE, PW91, BLYP	−40 % to −50 %
3	meta-GGA	+ kinetic-energy density τ	SCAN, TPSS, M06-L	−20 % to −40 %
4	Hybrid	+ fraction of exact HF exchange	PBE0, B3LYP, HSE06	0 % to ±10 %
5	Double hybrid / RPA	+ virtual orbitals (correlation)	B2PLYP, RPA, GW	few %

Two things to take away. First, every rung adds non-local information about the electronic structure — first the gradient, then the kinetic energy density, then the occupied orbitals (for HF exchange), then the virtual orbitals. Second, the cost grows roughly an order of magnitude per rung. Hybrid functionals are the first rung where the gap problem is meaningfully fixed, but they are also the first rung where the algorithmic cost forces you to think hard about the underlying SCF machinery.

Where this chapter lives on the ladder

We spend most of Chapter 6 on rung 4 — hybrid functionals — because for most materials they are the first rung that gets the band gap right, and they are also the rung most VASP users will reach for in practice. Rung 5 (GW, BSE) was previewed in Section 5.10 and lives in a more specialised territory; if you understand hybrids well, GW is a one-day jump.

Why Algorithms Matter as Much as Functionals

Switching from PBE to HSE06 is a one-line change in your INCAR:

📝text

1# INCAR — switch from PBE to HSE06
2LHFCALC = .TRUE.        # turn on Hartree-Fock exchange
3HFSCREEN = 0.2          # range-separation parameter (Å^-1) — defines HSE06
4AEXX    = 0.25          # 25% short-range HF exchange
5ALGO    = ALL           # all-bands algorithm; needed for nonlocal exchange

The functional has changed by three lines. The cost of the calculation has changed by a factor of 30 to 100. Why? Because every iteration of the SCF cycle now has to evaluate the nonlocal Fock operator — that double integral over all pairs of occupied orbitals — at every k-point. The naive scaling is $\mathcal{O}(N_k^2 \, N_\text{bands}^2 \, N_\text{plane-waves}\,\log N_\text{plane-waves})$ per SCF step, and the SCF cycle itself is harder to converge because the Fock operator is non-diagonal in the plane-wave basis. The naive Davidson algorithm that worked beautifully for PBE struggles. Pulay mixing parameters that were robust for GGAs throw the iteration into oscillation. The functional gives you the right answer; the algorithm decides whether you ever reach it.

A real budget: PBE → HSE06 on Mn:CdSe

A 64-atom Mn:CdSe supercell on 16 cores: PBE finishes a structural relaxation in ~25 minutes. HSE06 with default ALGO=ALL takes ~14 hours on the same hardware. With a thoughtful choice of algorithm (ALGO=Damped, NELMDL=−12, KPAR=4, well-chosen $\text{PRECFOCK}=\text{Fast}$ ) the same calculation finishes in ~3 hours. That is what algorithms buy you. Sections 6.6–6.9 spend ~70 reading minutes on the machinery that turns the 14-hour run into the 3-hour one.

What This Looks Like in VASP

Before we dive into the formal pieces, here is the full INCAR you will be running by the end of this chapter on Mn:CdSe — annotated so each tag is recognisable now even if its meaning will be unpacked section by section:

📝text

1# INCAR — HSE06 hybrid run for Mn:CdSe (preview)
2
3# ---- Functional ----
4LHFCALC  = .TRUE.       # turn on HF exchange (§6.4, §6.5)
5HFSCREEN = 0.2          # screening (Å^-1); 0.2 = HSE06 (§6.5)
6AEXX     = 0.25         # fraction of HF exchange (§6.4)
7
8# ---- Iterative diagonalization ----
9ALGO     = ALL          # all-bands minimisation; required for nonlocal X (§6.6)
10NELM     = 100          # max SCF steps
11NELMIN   = 6            # min SCF steps
12EDIFF    = 1E-6         # SCF tolerance
13TIME     = 0.4          # trial step for direct minimisation (§6.8)
14
15# ---- Density mixing ----
16AMIX     = 0.2          # linear mixing weight (§6.7)
17BMIX     = 0.0001       # Kerker damping (§6.7)
18AMIX_MAG = 0.8          # mixing for magnetisation (Mn doping)
19BMIX_MAG = 0.0001
20
21# ---- Cost knobs ----
22PRECFOCK = Fast         # coarse FFT grid for the Fock operator (§6.9)
23NKRED    = 2            # downsample k-grid for HF only (§6.9)
24LMAXFOCK = 4            # max ang. mom. for one-centre HF terms (§6.9)
25KPAR     = 4            # k-point parallelism (§6.10)

Twelve tags — each one is a chapter section in disguise. $\texttt{LHFCALC}$ , $\texttt{AEXX}$ , and $\texttt{HFSCREEN}$ select the functional; $\texttt{ALGO}$ , $\texttt{NELMIN}$ , and $\texttt{TIME}$ control the iterative diagonalisation; $\texttt{AMIX}$ , $\texttt{BMIX}$ set the density mixer; $\texttt{PRECFOCK}$ , $\texttt{NKRED}$ , and $\texttt{KPAR}$ are the cost-acceleration knobs. By the end of Section 6.10 you will be able to write, annotate, and tune that file from memory.

The Map of Chapter 6

Section	Topic	What it adds
6.2	Hartree–Fock Exchange Recap	The variational equations, the exchange operator, and why HF gaps are too big
6.3	Self-Interaction Error in DFT	Why pure DFT must under-bind localised states; one-electron and many-electron SIE
6.4	Hybrid Functionals: PBE0, B3LYP, HSE06	Adiabatic-connection derivation; mixing parameters; when each one wins
6.5	Range-Separated Hybrids — HSE in Detail	Splitting 1/r into short and long range; why screening matters in solids
6.6	Iterative Diagonalization: Davidson and RMM-DIIS	Subspace methods for big sparse Hamiltonians; how ALGO=Normal/Fast/All differ
6.7	Density Mixing: Pulay, Broyden, and Kerker	Why naive SCF oscillates; what AMIX and BMIX really do
6.8	Direct Minimization and Preconditioning	Skipping diagonalisation altogether — ALGO=Damped, ALGO=All; the role of the preconditioner
6.9	Computational Cost and Acceleration of Hybrid DFT	PRECFOCK, NKRED, ACFDT-like tricks; when downsampling is safe
6.10	Running Hybrid DFT in VASP	End-to-end recipe: convergence tests, restart strategy, common failure modes, and a clean Mn:CdSe HSE06 INCAR

Summary

Standard LDA/GGA functionals systematically underestimate band gaps by ~40–60 % for typical semiconductors and qualitatively miss the insulating ground state of strongly correlated oxides like NiO.
The root cause is self-interaction error: local exchange-correlation approximations let an electron partially repel its own density, which (a) shifts band-edge energies and (b) over-delocalises charge.
The cure is to mix a fraction of exact (Hartree–Fock) exchange into the functional. PBE0 mixes 25 % everywhere; HSE06 mixes 25 % only at short range. Both fix the gap of typical semiconductors to within ~10 % of experiment.
Jacob's ladder organises functionals by the ingredients they use; hybrids sit on rung 4 and are the first rung that consistently gets band gaps right.
Hybrids are 10–100× more expensive than GGA per SCF step, and the iteration is harder to converge. Choosing the right algorithm (Davidson / RMM-DIIS / direct minimisation) and density mixer (Pulay, Broyden, Kerker-damped) is just as important as choosing the functional.
In VASP, switching to HSE06 is a three-tag change to the INCAR ( $\texttt{LHFCALC}, \texttt{HFSCREEN}, \texttt{AEXX}$ ), but a robust hybrid run typically tunes a dozen tags in concert.

Section 6.1 Core Insight

"The DFT band-gap problem is not a bug to be fixed by a tighter ENCUT or a denser k-grid. It is a structural feature of any local functional, and the cure — a fraction of exact exchange — is one line of physics and ten sections of algorithms."

Coming next: Section 6.2 — Hartree–Fock Exchange Recap — where we re-derive the Fock operator from the antisymmetrised product wavefunction and look closely at the nonlocal integral that hybrids will eventually mix into the Kohn–Sham potential.