The Delivery Truck, the 787, the Cruise Ship
A FedEx tractor breaks down? Annoying, expensive, but the parcels reach their destination a day late and the country keeps turning. A 787 turbofan fails in flight? An entirely different conversation: passenger lives, an emergency landing, an AOG event whose cost reaches eight figures. A cruise ship six days at sea? Somewhere between the two: the operator wants high availability, but a stranded vessel is closer to the 787 end of the spectrum than the FedEx end.
Each of these operators has identical fleet management software and identical RUL prediction models — but they want different things from those models. The trucking company wants minimum RMSE so it can schedule its maintenance shop tightly. The airline wants minimum NASA score so its model never surprises a flight crew. The cruise line wants somewhere in between. The same neural network, three different deployment regimes.
One Combined Cost, One Knob
Formalise the choice with a single safety weight . Define the combined operational cost of a model as
where and are min-max-normalised versions of the two metrics over a candidate set of models. Three special cases:
| Regime | What it minimises | Production analogue | |
|---|---|---|---|
| Accuracy-first | Pure RMSE | Offline batch analytics, fleet planning | |
| Balanced | Equal RMSE / NASA | Most production fleets | |
| Safety-first | Pure NASA | Real-time monitoring, safety-critical |
The same knob can be expressed inside the training loss too — making it a hyper-parameter you can choose at deployment time without retraining the architecture. We will see exactly that in the PyTorch code below.
Interactive: The Frontier with Real Numbers
The chart below is the actual table from the paper — the multi-condition (FD002 + FD004) average RMSE and NASA scores, across five random seeds, for ten methods. The three green dots are the three proposed models. The red dot is DKAMFormer, the previous published state of the art. Drag the safety-weight slider to switch regimes; the model that minimises is highlighted with a black ring.
Three observations the chart makes plain. First, the three proposed models each own a regime: AMNL wins at , GABA wins around , GRACE wins at . Second, the dashed green line is the Pareto frontier — every model on it is non-dominated, every model off it is irrelevant for any choice of . Third, DKAMFormer (the red dot) lies dramatically outside the frontier, dominated on both axes by every model in the framework. That single observation is Chapter 26's headline result.
Three Models, One Per Regime
The mapping the rest of the book commits to:
| Regime | Model | Loss family | What changes |
|---|---|---|---|
| Accuracy-first () | AMNL (Ch 14-16) | Failure-biased weighted MSE | Sample weights up-weight near-failure cycles |
| Balanced () | GABA (Ch 17-20) | Standard MSE + adaptive task weighting | Inverse-gradient controller equalises task contributions |
| Safety-first () | GRACE (Ch 21-23) | Failure-biased MSE + adaptive task weighting | Combines AMNL's loss shape with GABA's adaptation |
Python: A Regime-Selectable Loss
Concretise the knob in code. Three synthetic models stand in for AMNL, GABA, and GRACE; one combined-cost function ranks them under any regime weight.
Reading the table
Each row is one model; each column is one regime. The lowest number in each column is the model the operator should ship under that regime. AMNL wins column [email protected]; GABA wins column [email protected]; GRACE wins column [email protected]. Three independent winners is the compact statement of why the book trains three separate objectives.
PyTorch: The Same Switch as a Module
The same regime knob lives naturally in the training loss. The module below is a foreshadowing of the GRACE loss in Chapter 21 — one knob, smooth interpolation, GPU-friendly, autograd-friendly.
self.w with the GABA adaptive controller. The training loop never knows the difference — the loss always exposes a single forward(pred, actual) contract.A Decision Tree for Practitioners
Five questions. Five-second decision.
| Question | If yes | Recommended model |
|---|---|---|
| Are you doing offline batch analytics on already-failed fleets? | RMSE matters; a late prediction has no real-world consequence. | AMNL |
| Are you running real-time safety-critical monitoring? | A late prediction is a crash, a fire, or a death. | GRACE |
| Are you somewhere in between - normal production fleet? | Balanced cost; you want competitive RMSE without unsafe surprises. | GABA |
| Are you writing a paper that reports only RMSE? | Reviewers will accept it - the world will be slightly less safe. | AMNL (carefully cited) |
| Do you have unlimited compute and want the absolute Pareto-best? | Train all three, deploy whichever your operator picks. | All three |
The Same Knob in Other Industries
The accuracy-safety tradeoff with a single regime knob is not unique to prognostics. It appears, with different variable names, almost everywhere a prediction has asymmetric consequences.
| Industry | Accuracy regime | Safety regime | Knob |
|---|---|---|---|
| Aviation RUL (this book) | Tight RMSE, occasional surprise | Loose RMSE, never surprise | Sample weights / NASA loss |
| Medical imaging triage | Catch every true positive | Never miss a malignant case | Threshold on classifier score |
| Autonomous-vehicle braking | Smooth ride, brake late | Brake early, hard if unsure | Time-to-collision threshold |
| Loan underwriting | Low default rate | No discriminatory false denials | Fairness regulariser weight |
| Quant trading | Maximise expected return | Cap drawdown | Risk aversion λ |
| Power-grid reserves | Minimise spinning reserve cost | Never blackout | Safety stock factor |
| Climate-risk modelling | Best estimate of warming | 95th-percentile worst case | Quantile τ |
Every row is a different name for the same knob. The book's machinery for building, training, and choosing among models on the Pareto frontier carries over directly.
The book in two sentences. One backbone. Three losses. One knob to choose between them.
Takeaway
- There is no single best RUL model. There is a Pareto frontier and three useful operating points on it.
- One combined cost, one knob. collapses the choice into a single scalar in .
- Three models, three regimes. AMNL owns w near 0; GABA owns w near 0.5; GRACE owns w near 1. Real numbers from the paper bear this out.
- The same backbone underlies all three. Only the training objective differs — meaning all of Part III applies to all three of Parts V, VI, and VII.
- The published SOTA is dominated. Every model in our framework, including the simplest baseline, beats DKAMFormer on both axes on multi-condition data.