Notes on the KAM theorem
1 Introduction
I hope eventually to expand these notes into a standalone presentation of KAM that presents a precise formulation of the theorem and gives detailed proofs of everything. There are few presentations of KAM in the literature that give a precise formulation of the theorem, and even those that give precise formulations such as [6] and [7] glide over some details. Gallavotti [4] explains the history of quasi-periodic phenomena in celestial mechanics.
Let .
For , let . Let and let . For , we have .
If is a symplectic manifold and , then the Hamiltonian vector field with energy function is the vector field on uniquely determined by the condition for all points and tangent vectors .
We say that are canonical coordinates for if . If are canonical coordinates for and then
for all , where
Let be the flow of on . Then
called Hamilton’s equations.
2 Action-angle coordinates
Let be a -dimensional symplectic manifold. Let . If for all (namely the functions are in involution) and if at each point in the differentials of the functions are linearly independent in the cotangent space at that point, then we say that the set of functions is completely integrable.
We define the momentum map by .
We say that is locally trivial at a value in its range if there is a neighborhood of such that for all there is a smooth map such that is a diffeomorphism from to . The bifurcation set of is the set of at which fails to be locally trivial.
The following theorem is proved in [1, Theorem 5.2.21].
Theorem 1.
Let be open. If is a proper map then each of the vector fields is complete, , and the fibers of the locally trivial fibration are disjoint unions of manifolds each diffeomorphic with .
Let , and define the linear flow on by . Let be the projection map and let be such that ; if then , so such a map exists, and is clearly unique. A flow on induced by a linear flow on is called a quasi-periodic flow.
Say that , and let be the flow induced by and be the flow induced by . Then for some , and for any such that , for any . Hence . Thus a quasi-periodic flow is induced by a unique vector . We call the frequency vector of the flow .
We say that is resonant if there is some such that , and we say that it is nonresonant otherwise.
Let be the quasi-periodic flow on with frequency vector . It can be shown that each orbit of is dense in if and only if is nonresonant. This is proved in [1, pp. 818–820]; that each orbit of is dense in if is nonresonant is proved in [5, Theorem 444].
Let ; we call this distinguished function the Hamiltonian, and we are concerned with the flow of the Hamiltonian vector field .
The following theorem is proved in [1, Theorem 5.2.24].
Theorem 2.
Let be in the range of , let denote a connected component of , and let be the flow of . Then there is a quasiperiodic flow on and a diffeomorphism such that .
Let and let . Let , where is the identity matrix. For we have that .
Let be an open ball in . is a symplectic submanifold of . We define coordinates and , . If does not depend on then we say that it has action-angle coordinates in .
If admits action-angle coordinates then for all we have
i.e. for all , and as depends only on this gives
where . We integrate this equation from to and get
Thus for , given the trajectory of under the Hamiltonian flow of can be explicitly seen if we know . We say that a value of determines an invariant torus for the Hamiltonian flow of .
If is a symplectic manifold and , we say that admits action-angle coordinates on an open set if there exists a symplectic diffeomorphism such that has action-angle coordinates in . If admits action-angle coordinates, then one can check that the push-forward is the Hamiltonian vector field , so that
Let . If the set is completely integrable, with , then for any open set for which for all , Abraham and Marsden [1, pp. 398–400] find action-angle coordinates in . Here , the momentum map. This construction is also explained by Arnold [2, pp. 282–284].
Suppose that has action-angle coordinates , and assume that for all ,
Then by the inverse function theorem, for every there is a neighborhood of and a neighborhood of such that is a diffeomorphism. In we can use and as coordinates.
For , let , and let be the rank of the -module , i.e. the maximal number of elements of that are linearly independent over . The proof of the following theorem follows [8, Proposition 2.1].
Theorem 3.
Let and let . In the torus with frequency , each trajectory is dense in some -dimensional subtorus and the -dimensional torus is foliated by these -dimensional tori.
Proof.
There exists a basis of and vectors such that the matrix with rows has determinant . (I should show why such a basis exists.) Let . is an matrix and is an matrix.
Let . Since , is invertible over . The coordinate is only determined up to , and for then also . Thus are coordinates on . The equation can be written using the coordinates as . Then
Let .
We see that ; since they both have rank , they are equal. It follows that is nonresonant. Hence any trajectory on the -dimensional torus with frequency is dense in the -dimensional torus . ∎
3 Diophantine frequency vectors
For and we define
We further define .
Theorem 4.
For any and for any positive integer , there is some with such that
Proof.
Let . The set has elements. For we have
Let .
Let . In the set , there are two elements that are in same interval , , since has elements and there are such intervals. That is, there are such that for some . Hence .
One can show by induction that for all , . Therefore for we have
Finally, . ∎
Corollary 5.
If then .
Proof.
Let . Suppose that there is some . Let be the least integer such that is greater than ; since such a exists.
∎
Treschev and Zubelevich give a construction for points in for sufficiently large [8, Theorem 9.2]. Thus there is some such that for all , . It is clear that for we have the inclusion . Hence this construction also shows that for all and . However this construction does not show that for . Indeed, one can show that , but also that the set has Hausdorff dimension [7, p. 5].
Our proof of the following theorem expands on [8, Theorem 9.3]. Let , the cube in of edge length . Let be -dimensional Lebesgue measure. We will use the fact that the maximal dimensional area of the intersection of and a hyperplane is [3].
Theorem 6.
Let . For and ,
Proof.
Let . Let . Let . Then there is some such that , and so . Thus
so
Let . is the region bounded by the two hyperplanes and . Let and . For any two points we can check that , and for any two points we can check that . Thus the vector is orthogonal to each of the hyperplanes and . It follows that the distance between the hyperplanes and is the distance between the points and , which is . Since , this is . Therefore
where we use the fact that the maximal dimensional area of the intersection of and a hyperplane is [3].
For each positive integer , the hypercube has faces, on each of which there are points with integer coordinates. Hence for each integer positive integer , we have .
Therefore
Since the terms in the sum are positive and decreasing, we can estimate the sum using an integral:
finishing the proof. ∎
Corollary 7.
If then
Proof.
Fix . Let . Let be an -neighborhood of the boundary of . We will make whatever assumption about we need in order to get .
Suppose that is sufficiently large so that . Then Theorem 6 gives us that .
Let . Since , we have .
4 Statement of KAM
If we have a Hamiltonian system which admits action-angle coordinates in , then the trajectories of points in phase space are constrained to lie on invariant tori. Moreover, on these tori the dynamics of the system are quasi-periodic; a priori we don’t have a reason to expect that the dynamics should be so nice just because the trajectories lie on tori. But a generic Hamiltonian on the same phase space (I would like to make this notion precise) does not admit action-angle coordinates. The KAM theorem is a statement about the dynamics induced by making a sufficiently small change to a Hamiltonian. If we perturb a Hamiltonian which admits action-angle coordinates to one which probably does not, if the perturbation is sufficiently small, then most of the trajectories of points under the flow of the new Hamiltonian will also lie on tori. In some sense which I want to clarify, the invariant tori of the new Hamiltonian are close to the invariant tori of the Hamiltonian that admits action-angle coordinates. It is not clear to me how an invariant torus of the old Hamiltonian transforms into an invariant torus of the new Hamiltonian; in what sense does an invariant torus for the old Hamiltonian become an invariant torus for the new Hamiltonian?
In particular, a consequence of the KAM theorem is that if we make a small perturbation of a Hamiltonian system that admits action-angle coordinates then the trajectories of most points will not be dense on a hypersurface in phase space, since they are constrained to lie on -dimensional tori. In other words, the new Hamiltonian system is not ergodic, since the invariant tori have lower dimension than , and so have -dimensional measure 0.
Let’s explain the KAM theorem in another way. Suppose that we have a symplectic manifold and a Lagrangian foliation whose leaves are tori, and suppose that the leaves of are invariant tori for a Hamiltonian . That is, the Hamiltonian vector field is tangent to all the leaves in . Now let . The leaves of the foliation will not be invariant under the flow of . We would like to obtain a symplectomorphism such that the Hamiltonian vector field is tangent to most leaves in the foliation . Here we mean most in a measure theoretic sense that depends on the magnitude of the perturbation away from the Hamiltonian that admits action-angle coordinates.
How do we construct a diffeomorphism? Often the best way is to demand that it be the time flow of a vector field, so for some , and to see if such a vector field exists. Suppose that is a function such that if is the flow of then .
5 Normal forms
Normal forms of vector fields, homological equation [9].
References
- [1] (2008) Foundations of mechanics. Second edition, AMS Chelsea Publishing, Providence, Rhode Island. Cited by: §2, §2, §2, §2.
- [2] (1989) Mathematical methods of classical mechanics. Second edition, Graduate Texts in Mathematics, Vol. 60, Springer. Cited by: §2.
- [3] (1986) Cube slicing in . Proc. Amer. Math. Soc. 97 (3), pp. 465–473. External Links: ISSN 0002-9939, Document, Link, MathReview (Jeffrey D. Vaaler) Cited by: §3, §3.
- [4] (2001) Quasi periodic motions from Hipparchus to Kolmogorov. Atti della Accademia Nazionale dei Lincei. Classe di Scienze Fisiche, Matematiche e Naturali. Rendiconti Lincei. Matematica e Applicazioni 12 (2), pp. 125–152. Cited by: §1.
- [5] (2008) An introduction to the theory of numbers. Sixth edition, Oxford University Press. External Links: ISBN 978-0-19-921986-5 Cited by: §2.
- [6] (2004) A proof of Kolmogorov’s theorem. Discrete Contin. Dyn. Syst. 10 (1-2), pp. 367–385. External Links: ISSN 1078-0947, Document, Link, MathReview (Dario Bambusi) Cited by: §1.
- [7] (2001) A lecture on the classical KAM theorem. In Smooth Ergodic Theory and Its Applications, A. Katok, R. de la Llave, Y. Pesin, and H. Weiss (Eds.), Proceedings of Symposia in Pure Mathematics, Vol. 69, pp. 707–732. Cited by: §1, §3.
- [8] (2010) Introduction to the perturbation theory of Hamiltonian systems. Springer Monographs in Mathematics, Springer. Cited by: §2, §3, §3.
- [9] (2003) Introduction to applied nonlinear dynamical systems and chaos. second edition, Texts in Applied Mathematics, Vol. 2, Springer. Cited by: §5.