2406.02234
On the Limitations of Fractal Dimension as a Measure of Generalization
Charlie B. Tan, Inés García-Redondo, Qiquan Wang, Michael M. Bronstein, Anthea Monod
correctmedium confidence
- Category
- Not specified
- Journal tier
- Specialist/Solid
- Processed
- Sep 28, 2025, 12:56 AM
- arXiv Links
- Abstract ↗PDF ↗
Audit review
The paper estimates the PH-dimension of the full optimization trajectory WS by regressing log E0^1(Wn) versus log n, using 5,000 post-convergence iterates as a finite sample to build multiple subsamples Wn; it does not set dimPH0(xS) for the finite sample equal to the run’s PH-dimension. This is explicit in their Experimental Design and the summary of Birdal et al.’s estimator linking the slope m of log E0^1(Wn) to dimPH0(WS) ≈ α/(1−m) (with α=1) . The model’s core claim—that because any finite set has dimPH0=0, the run-level PH-dimensions are identically zero—misapplies the definition by replacing WS with a single finite sample xS; the paper, in contrast, follows the theoretical setup where dimPH0 is a property of WS and is estimated via scaling of E0^α(Wn) over n (not taken as dimPH0(xS)) . Empirically, the paper finds: (i) PH dimensions correlate more weakly with generalization gap than simpler measures like the final parameter norm and the LR/BS ratio, per Table 1 and Fisher z-tests ; (ii) much apparent correlation disappears after conditioning on learning rate via partial correlations and conditional mutual information tests, depending on dataset and batch size ; and (iii) PH magnitudes exhibit a model-wise double descent pattern with width, aligned more with test accuracy than with the accuracy gap . These results contradict the model’s asserted “impossibility” of hyperparameter confounding and double descent under the stated measures. In short, the paper’s methodology and conclusions are consistent with the PH-dimension framework, whereas the model’s argument attacks a straw definition (dimPH0 of a finite sample) rather than the trajectory-level quantity actually used.
Referee report (LaTeX)
\textbf{Recommendation:} minor revisions \textbf{Journal Tier:} specialist/solid \textbf{Justification:} A careful empirical audit of PH-dimension as a generalization proxy. It uses appropriate estimators and statistical tools to show PH-dimension correlates weakly with generalization gap and is confounded by hyperparameters, while also documenting a double descent pattern in PH magnitudes. Minor clarifications would improve accessibility and reproducibility.