Deep Learning for predicting rate-induced tipping

Yu Huang, Sebastian Bathiany, Peter Ashwin, Niklas Boers

incompletemedium confidence

Category: Not specified
Journal tier: Strong Field
Processed: Sep 28, 2025, 12:56 AM
arXiv Links: Abstract ↗PDF ↗

Audit review

The paper demonstrates empirically that DL-based R-tipping probabilities for groups A (tipping) and B (non-tipping) are distinguishable by two-sample KS tests up to long lead times—about 290, 130, and 1000 steps for the Saddle–node, Bautin, and Compost-bomb systems, respectively (main text and SI Fig. S2) . It also shows that classical CSD indicators (variance and lag-1 autocorrelation) fail to discriminate the groups: composite means and 99% envelopes largely overlap (main text Fig. 3), and KS decisions for CSD are reported alongside (SI Fig. S2) . Data preprocessing and CSD computations (alignment; detrending window 100; sliding window 120) match the model’s assumptions . Building on these empirical findings, the model’s measure-theoretic construction (a fixed Borel isomorphism on the past-window space) correctly yields an ℓ-independent univariate map whose pushforward distributions differ for A vs B at all tested lead times; consistency of KS then implies rejection at α = 0.01 for sufficiently large samples. Its structural argument for the CSD limitation is plausible and consistent with the reported results. Minor issues: the model conflates diagnostic windows in SI Fig. S1 (100/200/100 steps) with the DL input length, and it implicitly treats sample-level KS rejections as population truths; neither undermines the core conclusions .

Referee report (LaTeX)

\textbf{Recommendation:} minor revisions

\textbf{Journal Tier:} strong field

\textbf{Justification:}

The paper convincingly demonstrates that DL-based indicators can predict rate-induced tipping under noise long before transitions, and that classical CSD indicators fail in this setting. The candidate’s solution adds a sound, non-constructive existence argument (via Borel isomorphisms) for a single univariate map separating A/B across lead times, and a plausible structural explanation for CSD’s limitations. Minor clarifications about window lengths, precise KS-testing protocols, and the interpretation of significance would further strengthen the presentation.