Let denote a batch from the live data-generating process at time . A self-healing system is a policy choosing an adaptation action so as to minimize:
Reason-agnostic baselines collapse this to a fixed action (say, "retrain on the last samples"). They never ask why performance dropped, and so they can't choose actions whose mechanism matches the actual shift.
ℋ-LLM instead splits the policy into two LLM-guided stages:
- diagnose · propose a hypothesis about the structure of
- prescribe · pick whose mechanism targets
- verify · accept the action only if held-out risk improves
The catalogue here includes refit_recent, importance_reweight, drop_spurious_feature, recalibrate_threshold, and no_op — each more or less appropriate depending on whether the diagnosis is covariate, label, concept, or spurious.