A short introduction to the control theory in finite-dimensional spaces

Hoai-Minh Nguyen

correcthigh confidence

Category: Not specified
Journal tier: Specialist/Solid
Processed: Sep 28, 2025, 12:56 AM
arXiv Links: Abstract ↗PDF ↗

Audit review

The paper proves existence/uniqueness, Pontryagin-type optimality, and the state-feedback form u*(t) = -B^T P_T(t) x*(t) for the finite-horizon LQR, via a convexity argument, an integral optimality identity (Lemma 9.1), and an adjoint/value-function identification y*(t) = P_T(t) x*(t) (Theorem 9.1) . The candidate solution reaches the same conclusions using a standard direct-method existence proof, a variational Lagrange-multiplier derivation of the adjoint, an explicit dynamic programming “splicing” argument, and an envelope-identity to identify y* with the gradient of the value function, hence y*(t)=P_T(t)x*(t), yielding the same feedback law. The paper and model differ mainly in how y*(t)=P_T(t)x*(t) is shown (inner-product identity vs. envelope/gradient), but the results and normalizations coincide; the paper’s Remark 9.6 explicitly situates the proof at the intersection of Pontryagin and dynamic programming, matching the model’s approach in spirit .

Referee report (LaTeX)

\textbf{Recommendation:} minor revisions

\textbf{Journal Tier:} specialist/solid

\textbf{Justification:}

A clear, correct, and pedagogically valuable presentation of finite-horizon LQR. While the results are classical, the integration of a variational (Pontryagin-type) derivation with a value-function perspective is well executed and useful for readers. Minor clarifications would enhance readability and self-containment.