2204.00912
Escaping High-order Saddles in Policy Optimization for Linear Quadratic Gaussian (LQG) Control
Yang Zheng, Yue Sun, Maryam Fazel, Na Li
correctmedium confidence
- Category
- Not specified
- Journal tier
- Strong Field
- Processed
- Sep 28, 2025, 12:56 AM
- arXiv Links
- Abstract ↗PDF ↗
Audit review
The paper’s Theorem 2 proves that for a non-minimal stationary point K̃ built by augmenting a minimal realization with a symmetric Hurwitz Λ, the Hessian of the full-order LQG cost is indefinite whenever eig(−Λ) avoids the zero set of a certain transfer G(s); it further deduces the almost-sure strict-saddle property for random Λ and that global optimality forces G ≡ 0. The candidate solution derives an explicit bilinear form for the Hessian along the same coupling directions and reaches the same three conclusions via the same block-augmentation/Lyapunov–Sylvester machinery. The two arguments align on definitions, structure, and logic; the model’s derivation is slightly more explicit (resolvent/Sylvester and a sum over eigenvalues) but substantively the same.
Referee report (LaTeX)
\textbf{Recommendation:} minor revisions \textbf{Journal Tier:} strong field \textbf{Justification:} The paper offers a clear structural characterization of non-minimal stationary points in full-order LQG and a crisp Hessian-based certificate (via G(s)) for strict saddles, with convincing examples and a plausible algorithmic application. The appendix computations are sound and align with standard Lyapunov/Sylvester manipulations. Minor clarifications (on the zero set Z and precise randomness assumptions for Λ) would further strengthen readability.