Learning Objectives
By the end of this section you should be able to:
- Explain in plain words why an electron inside a crystal behaves dynamically as if it had a different mass than the bare electron in vacuum — and recognise the effective mass as the single number that captures this dressing of the electron by the periodic potential.
- Derive the master formula from the semiclassical equations of motion, and read it the other way: a band's curvature is a mass, expressed in inverse units.
- Generalise to anisotropic minima — the inverse mass is a tensor — and tell apart the three masses you will meet in practice: the band-curvature mass, the density-of-states mass , and the conductivity mass .
- Explain holes — why a missing electron at the top of a valence band carries positive charge and a positive effective mass, and where the sign flips happen in the bookkeeping.
- Connect to transport via the Drude formulae and , and predict which materials make fast transistors and which make sluggish insulators just from a band structure plot.
- Compute an effective mass from VASP output: locate the band edge in vasprun.xml, sample E(k) along a dense path, fit a parabola, and convert the leading coefficient to .
One-line preview: the curvature of a band edge is a mass, the inverse of that mass is a velocity-per-force, and that single number — together with a scattering time τ — predicts every bulk transport coefficient your device-physics professor will ever ask you to compute.
The Puzzle — A Real Mass That Lies
Take an electron in vacuum. Push it with a force . Newton, undisturbed since 1687, says it accelerates at with . Now drop the same electron into a silicon crystal at the bottom of the conduction band, apply the same force, and measure the acceleration. You will find it accelerates as if it had a mass of along the longitudinal axis of an ellipsoidal valley and in the perpendicular directions. Push an electron at the bottom of the GaAs conduction band and it responds as if it weighed only — barely more than a feather's worth of the bare value. Push it into the heavy-hole band of GaAs and the same particle suddenly weighs .
The electron itself never changed. What changed is its environment. Inside the crystal, the periodic potential of the ions is constantly scattering the electron — emitting and reabsorbing it as Bloch waves — and the net effect is that response to an external force looks exactly like Newton's second law would for a particle with a different mass. We call that number the effective mass . It is not a fudge factor; it is a genuine, calculable property of the band structure, and it determines almost every electronic property the engineer cares about.
A useful analogy: marbles in molasses, beads on a wire
Imagine a marble rolling on a flat table — that is the vacuum electron. Now drop it into a viscous fluid: pushing it requires more force per unit acceleration, so it acts heavier. That is the heavy-electron limit. Conversely, a bead threaded on a frictionless straight wire, pushed along the wire, requires less force than a free marble — most of the bead's "mass" is constrained by the wire and only the parallel component matters. That is the light-electron limit. The crystal lattice is doing something analogous: it dresses the electron with a band-dependent inertia.
Newton's Law for a Bloch Electron
Section 5.1 already introduced the two semiclassical equations of motion that govern a Bloch electron in a smoothly varying external force . We restate them here because they are the only physics input we need:
The first equation says the wavevector evolves under the external force, exactly as the momentum does in classical mechanics — the crystal momentum plays the role of . The second says the velocity of the wavepacket is the gradient of the band in k-space: a wavepacket centred at moves with the group velocity. Put these two together and you have a complete dynamical theory: an external force slides the wavepacket along the band, and the band itself prescribes the velocity at every point.
Why this is "semiclassical"
The Bloch wavefunction is fully quantum, but the centre of mass of a wavepacket built from nearby k-states obeys these classical-looking equations, exactly. The magic is hidden in : every quantum-mechanical subtlety of the periodic potential is already baked into the shape of the band. Solve the band structure once, then run classical equations of motion. This is why a single-particle band calculation is so powerful in device physics.
Curvature → Mass: The Derivation
Differentiate the velocity equation with respect to time and use the chain rule:
Compare to Newton's second law . We are forced to identify the inverse effective-mass tensor as the Hessian of the band:
This is the master formula. Read it slowly. The left-hand side is a familiar mechanical object — a 3×3 inverse mass tensor with units of . The right-hand side is a purely geometric property of the band structure — the curvature of at a chosen k-point. These are the same thing. Curvature in k-space is mass.
The physical meaning of the formula
- Sharp parabola at a band minimum (large curvature) → small → light, mobile carrier. GaAs electron with .
- Flat saucer at a band minimum (small curvature) → large → heavy, sluggish carrier. Heavy-fermion compounds reach .
- Negative curvature (a band maximum, like the valence band top) → negative effective mass for the electron description. We will see in §Holes how this sign flip is absorbed by introducing the hole.
The 1D scalar form
For a one-dimensional band, or for an isotropic minimum where the Hessian is a multiple of the identity, the tensor collapses to a single number:
Around a band minimum, expand the energy in a Taylor series. The zero-th order is the band-edge energy . The first derivative vanishes (it's an extremum). What remains is the parabolic approximation:
— exactly the kinetic energy of a free particle of mass . Near a band edge, every semiconductor pretends to be a free-electron gas with a renormalised mass. This is the single most useful approximation in solid-state physics. Every textbook formula for carrier statistics, mobility, optical absorption near the gap, exciton binding energy, and donor level depth is built on it.
The k·p method — where the formula really comes from
Strictly speaking, the parabolic expansion is the leading order of k·p perturbation theory: write and treat the second term as a perturbation around . The second-order correction couples the band of interest to all other bands through momentum matrix elements , and yields exactly the inverse-mass tensor formula above. The lighter the band edge, the stronger its coupling to a remote band of opposite parity. In III-V semiconductors most of the conduction electron's lightness comes from coupling to the valence band ~1 eV away — the smaller the gap, the lighter the electron. That is why InSb (gap 0.17 eV) has while diamond (gap 5.5 eV) has .
Interactive — Bend the Band, Read the Mass
Below is a one-band sandbox. Slide the effective-mass control to make the parabola open wide (light mass) or narrow (heavy mass). Drag the pink dot along the band — the dashed tangent gives you the slope at that point, which is exactly times the group velocity . Toggle between the electron view (parabola opens upward at the conduction band minimum) and the hole view (parabola opens downward at the valence band maximum) to see the sign flip we will discuss in §Holes below. The faint dashed parabola is the free-electron reference : as you change the slider, watch how dramatically the band departs from it.
Three things to try
- Set and read the mobility. Now set it to . The mobility drops by exactly a factor of 10 — that is the Drude scaling staring you in the face.
- Drag the dot away from . The curvature is the same everywhere on a parabola, but the slope (and therefore the group velocity) grows linearly with . This is the textbook fact that an electron at the bottom of a band has zero group velocity, even though it has finite kinetic energy.
- Switch to the hole view. The parabola flips upside down — and yet the curvature has the same magnitude. The negative sign of the second derivative is what makes us reinvent the hole.
Anisotropy — The Effective Mass Tensor
Real bands are almost never spherical. The conduction band of silicon has six equivalent ellipsoidal valleys near (but not at) the X points; each valley is elongated along the direction. The valence band of GaAs has two heavy degenerate bands at Γ that are warped: the equal-energy surfaces are not even ellipsoids but rumpled shapes resembling potatoes. To handle these, we have to keep the full Hessian.
At a band edge in three dimensions, the parabolic approximation is
The inverse mass tensor is symmetric, so it can be diagonalised: rotate coordinates to align with its principal axes and the energy surface becomes
— three independent effective masses along the three principal directions. The equal-energy surface is now an ellipsoid:
For an axially symmetric valley like silicon, two of the masses are equal — call them the transverse mass — and the third is the longitudinal mass . The whole ellipsoid is then specified by two numbers. For Si: , — the valley is a cigar pointing along ⟨100⟩, much heavier along its long axis than across.
Why anisotropy matters for devices
When you grow an MOSFET on a (100) silicon wafer, the channel current flows in the wafer plane. Two of the six Si valleys point out of the plane (their long, heavy axis is normal to the channel) while four point in-plane. Surface confinement raises the energy of the four in-plane valleys, leaving the carriers in the two out-of-plane "heavy-up, light-along" valleys — and these give the channel its low transport mass . This is one of the rare cases where you actually have to care which face of silicon you cut, and it is band-curvature physics all the way down.
Three Flavours of Effective Mass
Once the valley is anisotropic, "the" effective mass ceases to be a single number. Different physical observables average the principal masses in different ways. You will meet three:
1. The band-curvature mass
Direct readout of from the Hessian. This is the mass tensor itself, with three components for an ellipsoidal valley. It is what you get out of a parabolic fit to a VASP band along a chosen direction. Use it for k·p models and for direction-resolved transport.
2. The density-of-states effective mass
The number of carrier states up to energy in a single ellipsoidal valley equals the same quantity for a sphere with mass and the same volume. That gives the geometric mean of the principal masses, multiplied by the number of equivalent valleys raised to a 2/3 power:
Use this when computing the carrier concentration in a doped semiconductor, the position of the Fermi level versus temperature, or the effective DOS prefactors . For Si: — substantially larger than any single principal mass, because the six valleys all contribute states at the same energy.
3. The conductivity effective mass
For transport, the relevant average is the harmonic mean of the principal masses (the inverse mass tensor adds when you sum over equivalent valleys):
Use this in the Drude formula . For Si: — much lighter than the DOS mass because conductivity is dominated by the easy directions, while DOS counts all directions equally.
Cheat sheet: which mass goes where
| Quantity | Mass to use | Why |
|---|---|---|
| Carrier concentration n(T), Fermi level | DOS effective mass | You are counting states up to an energy — geometric mean × valley count |
| Mobility, conductivity, drift velocity | Conductivity effective mass | Inverse masses average — the easy axis carries the current |
| Cyclotron resonance, k·p Hamiltonian | Principal-axis masses (the tensor) | Direction-resolved physics — keep all three |
| Optical absorption near gap, exciton binding | Reduced mass μ from m_e* and m_h* | Two-particle problem — both band-curvature masses combine |
Holes — Curvature Turned Upside Down
At a band maximum like the valence-band top, the curvature is negative: . Plug that into our master formula and you get a negative effective mass for the electron, which then accelerates opposite to the applied force — apply an electric field pointing right and the electron drifts left. That sounds bizarre until you realise the valence band is almost completely full, and what is moving is the absence of an electron.
Define the hole as the missing electron. Its charge is (because removing a negative contributor leaves a net positive); its momentum is (the missing electron had ); its energy is measured from the band top. With these substitutions the curvature
is positive. The hole has positive charge, positive mass, and obeys Newton's second law in the ordinary sense: an electric field pulls it in the field direction, and it drifts down in the valence-band picture (which is up in the hole-band picture — the hole's energy increases as it moves toward higher E-states from the valence-band perspective).
Two simultaneous bookkeeping conventions
You will see both conventions in the literature, and switching between them is a common source of sign errors. (i) The electron picture: keep the electron charge and the negative curvature, and let the dynamics handle themselves. (ii) The hole picture: flip the sign of charge, momentum, and curvature simultaneously, and pretend the carrier is a positively charged particle with positive mass. Both give identical predictions for any observable. VASP and most band-structure codes report eigenvalues in the electron picture; mobility models and device simulators use the hole picture. When reading a number, ask: was its sign already flipped?
Drude Transport: Mobility and Conductivity
We now have everything we need to derive the central transport formula of solid-state physics. Apply a uniform electric field . The semiclassical equation gives ; the wavevector slides at constant rate. If nothing else happened, the electron would just keep accelerating: it would slide up the band, reach the zone boundary, Bragg-reflect, slide back, and oscillate (these are Bloch oscillations, which have actually been observed in cold-atom systems and superlattices). In a normal crystal, however, scattering intervenes long before the electron reaches the zone boundary.
Model scattering as a Poisson process with mean free time : in a small interval the electron is randomly scattered with probability , after which its crystal momentum is reset to a random value drawn from the equilibrium distribution. Between scatterings it accelerates as a Bloch electron of effective mass . The steady-state ensemble-averaged velocity along the field — the drift velocity — is
defining the mobility , with units of . Multiply both sides by the carrier density and the charge to get the current density and read off the Drude conductivity:
Three numbers determine every bulk transport coefficient: how many carriers there are (), how heavy each one is (), and how often they scatter (). DFT gives us and (with statistics) the carrier density; is harder — it comes from electron-phonon coupling, electron-impurity scattering, and electron-electron scattering, each of which is its own chapter. For our purposes it is enough to remember: a band edge with light curvature gives a small mass, which gives high mobility, which gives a fast device.
Numerical sanity check
For GaAs at 300 K: , . Plugging in: — bang on the experimental value of once you account for finite-temperature phonon scattering. The Drude formula is an excellent first-principles estimate.
Interactive — Drift in an Electric Field
Below is a 2D Drude sandbox. Each cyan dot is a carrier with effective mass (slider). The yellow arrows on the right show an applied electric field; the carriers feel a force in the +x direction. With scattering off, every carrier accelerates indefinitely — there is no steady state, the drift velocity climbs linearly, and a perfect crystal would have infinite conductivity. Toggle scattering on and watch the drift velocity saturate near the theoretical value : each carrier accelerates between scattering events, then has its momentum randomised, and the ensemble settles into a steady drift. This is why insulators of any quality have any finite conductivity at all, and why cooling a metal increases its conductivity (longer ).
Three things to try
- With scattering on, double the field. The drift velocity doubles. With scattering off, doubling the field doubles the rate of acceleration, not the velocity — Ohm's law fails completely without scattering.
- With scattering on, halve . The drift velocity doubles. Same field, half the inertia, twice the response — exactly .
- With scattering on, halve . The drift velocity halves. Mobility scales linearly with mean free time — the cleaner your sample (longer ), the faster your device.
Real Materials — A Reference Table
Here is a sampling of room-temperature effective masses for the semiconductors and metals you will most often meet. Numbers are cited in units of ; for ellipsoidal valleys we list both principal masses; for warped valence bands we list the heavy and light hole branches separately.
| Material | m_e* / m₀ | m_hh* / m₀ | m_lh* / m₀ | Gap (eV) | Notes |
|---|---|---|---|---|---|
| Si (indirect) | 0.92 (∥), 0.19 (⊥) | 0.49 | 0.16 | 1.12 | 6 ellipsoidal valleys near X |
| Ge (indirect) | 1.59 (∥), 0.082 (⊥) | 0.33 | 0.043 | 0.66 | 4 ellipsoidal valleys at L |
| GaAs | 0.067 | 0.51 | 0.082 | 1.42 | Direct gap at Γ — workhorse for HEMTs |
| InAs | 0.023 | 0.41 | 0.026 | 0.36 | Very light electrons — high-frequency electronics |
| InSb | 0.014 | 0.43 | 0.015 | 0.17 | Lightest electrons of any common semiconductor |
| CdSe (zincblende) | 0.13 | 0.45 | 0.27 | 1.74 | Our running example — see §VASP below |
| GaN (wurtzite) | 0.20 | 1.4 (A) | 1.1 (B) | 3.4 | Power electronics, heavy holes |
| Diamond | 0.36 | 1.08 | 0.36 | 5.5 | Ultra-wide gap — heavy carriers |
| Cu (metal) | 1.0–1.4 (anisotropic) | — | — | — | Metals don't have a 'mass' in the same sense |
Where these numbers come from
Most of the values in the table are not raw DFT predictions — plain PBE notoriously underestimates band gaps and correspondingly overestimates curvature, leaving electron masses ~30% too light in many III-V materials. The numbers above are experimental, typically extracted from cyclotron resonance, Shubnikov-de Haas oscillations, or magneto-optical absorption. When you compute an effective mass with VASP for the first time, expect to be within a factor of 2 of the experimental number using PBE, and within ~20% using HSE06. Spin-orbit coupling matters for the valence bands of any III-V or II-VI compound, and for the conduction bands of heavy elements like Pb and Bi — turn it on with LSORBIT = .TRUE. in the INCAR (more in §6.3 and §5.7).
Computing Effective Mass in VASP
With the theory in place, the actual VASP computation is a short recipe. Run a band structure (recipe from §5.1), find the band edge in the eigenvalue file, sample the band along a chosen direction densely enough that a parabola is a good fit (≥ 7 points within ±0.05 of the BZ in each direction), and least-squares fit a polynomial. The leading coefficient is ; divide and you have the mass.
The INCAR — what to set, and why
The two-step SCF + non-SCF recipe from §5.1 carries over almost unchanged. The two extra knobs that matter for an accurate effective mass are k-point density and convergence of the eigenvalues at the band edge.
1# INCAR (step 2: non-SCF along high-symmetry path, dense sampling near edge)
2SYSTEM = CdSe zincblende — effective-mass extraction
3PREC = Accurate
4ENCUT = 400 # converged from §6.3 test
5ICHARG = 11 # read CHGCAR, do not update
6LORBIT = 11
7ISMEAR = 0 # Gaussian — semiconductors
8SIGMA = 0.02 # tight smearing — we fit eigenvalues, not occupations
9EDIFF = 1E-7 # tighter than usual SCF — small Δk × small ΔE
10NBANDS = 32
11LSORBIT = .TRUE. # spin-orbit on — matters for VBM in CdSe1# KPOINTS (step 2: line mode, 60 points/segment near Γ for fine sampling)
2Bands for effective-mass fit
360 ! 60 points per segment for fine Δk
4Line
5Reciprocal
6
70.00 0.00 0.00 Gamma
80.05 0.00 0.00 near-Gamma-X ! short segment, dense — for m_e* along ⟨100⟩
9
100.00 0.00 0.00 Gamma
110.05 0.05 0.00 near-Gamma-K ! along ⟨110⟩ — second principal direction
12
130.00 0.00 0.00 Gamma
140.05 0.05 0.05 near-Gamma-L ! along ⟨111⟩ — thirdWhy short segments instead of the full Γ→X path
The parabolic approximation only holds within ~5% of the BZ around the edge. If you fit a parabola to the entire Γ→X line (which goes all the way to the zone boundary) you will get the curvature averaged over a region where the band has already turned non-parabolic — often by a factor of 2 wrong. Always run a separate KPOINTS file with short, dense segments around the band edge for effective-mass extraction. For an anisotropic valley you need at least three independent directions to reconstruct the full tensor.
Parsing vasprun.xml — the parabolic fit, line by line
Every modern VASP post-processing pipeline starts with pymatgen's Vasprun parser. The snippet below loads the band structure, locates the conduction band minimum, slices a 9-point window around it, fits a parabola, and prints the resulting effective mass. Click any line to see the full execution trace and what each library function does.
Production-grade upgrades
- Use a higher-order fit with cross-validated order.
np.polyfit(dk, dE, deg=4)gives the secondand fourth derivatives — the latter measures non-parabolicity and is essential for narrow-gap semiconductors like InSb where the conduction band is famously non-parabolic. - Fit a 3D paraboloid when you sample multiple directions:
scipy.optimize.curve_fitwith the model extracts the full mass tensor in one shot. - Use the EffectiveMass class from
effmass(a community Python package built specifically for this) — it handles non-parabolicity, finite-temperature occupations, and the Boltzmann transport integrals for you. Cite Whalley, J. Open Source Softw. 3, 797 (2018).
Summary
We started with a puzzle — that an electron in a crystal responds to force as if its mass had been silently rewritten — and we resolved it with a single, beautiful identity:
Curvature in k-space is, literally, an inverse mass. From this one relation, with no further heavy lifting, we deduced:
- The parabolic-band approximation — the universal local description of every band edge.
- The effective-mass tensor for anisotropic valleys, with three principal masses; and the difference between (band curvature), (geometric mean of principal masses, used for state counting), and (harmonic mean, used for conductivity).
- The hole concept — a missing electron at a band maximum, with positive charge, positive mass, and ordinary dynamics, accounting for the negative curvature with a sign flip in every equation simultaneously.
- The Drude formulae and , which turn a single number on a band-structure plot into a quantitative prediction of every bulk transport coefficient — and the interactive sandbox showed why scattering is the physics that makes those formulae finite.
- A VASP recipe with a parabolic fit to
vasprun.xml, fully traceable line by line, that gives in 30 lines of Python.
In the next section (5.5) we use these masses to compute the optical properties of a semiconductor — the absorption edge, the dielectric tensor, and the joint density of states are all written in the language of effective masses and the parabolic-band approximation we built here.