Notes on the KAM theorem

Jordan Bell

April 6, 2015

1 Introduction

I hope eventually to expand these notes into a standalone presentation of KAM that presents a precise formulation of the theorem and gives detailed proofs of everything. There are few presentations of KAM in the literature that give a precise formulation of the theorem, and even those that give precise formulations such as [6] and [7] glide over some details. Gallavotti [4] explains the history of quasi-periodic phenomena in celestial mechanics.

Let $\mathbb{T}^{n}=\mathbb{R}^{n}/\mathbb{Z}^{n}$ .

For $x,y\in\mathbb{R}^{n}$ , let $\langle x,y\rangle=\sum_{j=1}^{n}x_{j}y_{j}$ . Let $\|x\|=\sum_{j=1}^{n}x_{j}^{2}$ and let $\|x\|_{\infty}=\max_{1\leq j\leq n}|x_{j}|$ . For $x,y\in\mathbb{R}^{n}$ , we have $|\langle x,y\rangle|\leq n\|x\|_{\infty}\|y\|_{\infty}$ .

If $(M,\omega)$ is a symplectic manifold and $H\in C^{\infty}(M)$ , then the Hamiltonian vector field with energy function $H$ is the vector field $X_{H}$ on $M$ uniquely determined by the condition $\omega_{x}(X_{H}(x),v)=(dH)(x)(v)$ for all points $x\in M$ and tangent vectors $v\in T_{x}M$ .

We say that $(q^{1},\ldots,q^{n},p_{1},\ldots,p_{n})$ are canonical coordinates for $(M,\omega)$ if $\omega=\sum_{j=1}^{n}dq^{j}\wedge dp_{j}$ . If $(q^{1},\ldots,q^{n},p_{1},\ldots,p_{n})$ are canonical coordinates for $(M,\omega)$ and $H\in C^{\infty}(M)$ then

X_{H}(x)=((\partial_{p}H)(x),(-\partial_{q}H)(x))

for all $x\in M$ , where

\partial_{q}H=\Big{(}\frac{\partial H}{\partial q^{1}},\ldots,\frac{\partial H% }{\partial q^{n}}\Big{)},\quad\partial_{p}H=\Big{(}\frac{\partial H}{\partial p% _{1}},\ldots,\frac{\partial H}{\partial p_{n}}\Big{)}.

Let $\phi$ be the flow of $X_{H}$ on $M$ . Then

\frac{d(q^{j}(\phi_{t}(x)))}{dt}=\frac{\partial H}{\partial p_{j}}(\phi_{t}(x)% ),\quad\frac{d(p_{j}(\phi_{t}(x)))}{dt}=-\frac{\partial H}{\partial q^{j}}(% \phi_{t}(x)),

called Hamilton’s equations.

2 Action-angle coordinates

Let $(M,\omega)$ be a $2n$ -dimensional symplectic manifold. Let $f_{1},\ldots,f_{n}\in C^{\infty}(M)$ . If $\{f_{i},f_{j}\}=0$ for all $1\leq i,j\leq n$ (namely the functions are in involution) and if at each point in $M$ the differentials of the functions are linearly independent in the cotangent space at that point, then we say that the set of functions is completely integrable.

We define the momentum map $F:M\to\mathbb{R}^{n}$ by $F=f_{1}\times\cdots\times f_{n}$ .

We say that $F$ is locally trivial at a value $y_{0}$ in its range if there is a neighborhood $U$ of $y_{0}$ such that for all $y\in U$ there is a smooth map $h_{y}:F^{-1}(U)\to F^{-1}(y_{0})$ such that $F\times h_{y}$ is a diffeomorphism from $F^{-1}(U)$ to $U\times F^{-1}(y_{0})$ . The bifurcation set of $F$ is the set $\Sigma_{F}$ of $y_{0}\in\mathbb{R}^{n}$ at which $F$ fails to be locally trivial.

The following theorem is proved in [1, Theorem 5.2.21].

Theorem 1.

Let $U\subseteq\mathbb{R}^{n}$ be open. If $F|F^{-1}(U):F^{-1}(U)\to U$ is a proper map then each of the vector fields $X_{f_{i}}|F^{-1}(U)$ is complete, $U\subseteq\mathbb{R}^{n}\setminus\Sigma_{F}$ , and the fibers of the locally trivial fibration $F|F^{-1}(U)$ are disjoint unions of manifolds each diffeomorphic with $\mathbb{T}^{n}$ .

Let $\nu\in\mathbb{R}^{n}$ , and define the linear flow $F$ on $\mathbb{R}^{n}$ by $F_{t}(v)=v+t\nu$ . Let $\pi:\mathbb{R}^{n}\to\mathbb{T}^{n}$ be the projection map and let $\phi_{t}:\mathbb{T}^{n}\to\mathbb{T}^{n}$ be such that $\pi\circ F_{t}=\phi_{t}\circ\pi$ ; if $\pi(v_{1})=\pi(v_{2})$ then $\pi\circ F_{t}(v_{1})=\pi\circ F_{t}(v_{2})$ , so such a map exists, and is clearly unique. A flow $\phi$ on $\mathbb{T}^{n}$ induced by a linear flow on $\mathbb{R}^{n}$ is called a quasi-periodic flow.

Say that $\nu\neq\mu$ , and let $\phi$ be the flow induced by $\nu$ and $\psi$ be the flow induced by $\mu$ . Then for some $i$ , $\nu_{i}\neq\mu_{i}$ and for any $t$ such that $t(\nu_{i}-\mu_{i})\not\in\mathbb{Z}$ , $\phi_{t}(\theta)\neq\psi_{t}(\theta)$ for any $\theta\in\mathbb{T}^{n}$ . Hence $\phi\neq\psi$ . Thus a quasi-periodic flow is induced by a unique vector $\nu\in\mathbb{R}^{n}$ . We call $\nu$ the frequency vector of the flow $\phi$ .

We say that $\nu\in\mathbb{R}^{n}$ is resonant if there is some $0\neq k\in\mathbb{Z}^{n}$ such that $\langle k,\nu\rangle=0$ , and we say that it is nonresonant otherwise.

Let $\phi$ be the quasi-periodic flow on $\mathbb{T}^{n}$ with frequency vector $\nu\in\mathbb{R}^{n}$ . It can be shown that each orbit of $\phi$ is dense in $\mathbb{T}^{n}$ if and only if $\nu$ is nonresonant. This is proved in [1, pp. 818–820]; that each orbit of $\phi$ is dense in $\mathbb{T}^{n}$ if $\nu$ is nonresonant is proved in [5, Theorem 444].

Let $H=f_{1}$ ; we call this distinguished function the Hamiltonian, and we are concerned with the flow of the Hamiltonian vector field $X_{H}$ .

The following theorem is proved in [1, Theorem 5.2.24].

Theorem 2.

Let $c$ be in the range of $F$ , let $I_{c}^{0}$ denote a connected component of $F^{-1}(c)$ , and let $\phi$ be the flow of $X_{H}$ . Then there is a quasiperiodic flow $\psi$ on $\mathbb{T}^{n}$ and a diffeomorphism $g:\mathbb{T}^{n}\to I_{c}^{0}$ such that $g\circ\psi_{t}=\phi_{t}|I_{c}^{0}\circ g$ .

Let $\mathbb{R}^{2n}=\{q^{1},\ldots,q^{n},p_{1},\ldots,p_{n}\}$ and let $\omega=\sum_{j=1}^{n}dq^{j}\wedge dp_{j}$ . Let $J=\begin{bmatrix}0&I\\ -I&0\end{bmatrix}$ , where $I$ is the $n\times n$ identity matrix. For $u,v\in\mathbb{R}^{2n}$ we have that $\omega(u,v)=\langle u,Jv\rangle$ .

Let $B^{n}$ be an open ball in $\mathbb{R}^{n}$ . $B^{n}\times\mathbb{T}^{n}$ is a symplectic submanifold of $\mathbb{R}^{2n}$ . We define coordinates $I^{j}=q^{j}$ and $\theta_{j}=p_{j}+\mathbb{Z}$ , $j=1,\ldots,n$ . If $H\in C^{\infty}(B^{n}\times\mathbb{T}^{n})$ does not depend on $\theta_{1},\ldots,\theta_{n}$ then we say that it has action-angle coordinates in $B^{n}\times\mathbb{T}^{n}$ .

If $H\in C^{\infty}(B^{n}\times\mathbb{T}^{n})$ admits action-angle coordinates $(I,\theta)$ then for all $x\in B^{n}\times\mathbb{T}^{n}$ we have

\frac{d(I^{j}(\phi_{t}(x)))}{dt}=\frac{\partial H}{\partial\theta_{j}}(\phi_{t% }(x))=0,

i.e. $I^{j}(\phi_{t}(x))=I^{j}(x)$ for all $t$ , and as $H$ depends only on $I$ this gives

\frac{d(\theta_{j}(\phi_{t}(x)))}{dt}=-\frac{\partial H}{\partial I^{j}}(\phi_% {t}(x))=-\frac{\partial H}{\partial I^{j}}(x)=\nu_{j},

where $\nu=\nu(I(x))$ . We integrate this equation from $0$ to $t$ and get

\theta_{j}(\phi_{t}(x))-\theta_{j}(x)=t\nu_{j}.

Thus for $x\in B^{n}\times\mathbb{T}^{n}$ , given $I(x)$ the trajectory $\phi_{t}(x)$ of $x$ under the Hamiltonian flow of $H$ can be explicitly seen if we know $\nu(I(x))$ . We say that a value of $I$ determines an invariant torus for the Hamiltonian flow of $H$ .

If $(M,\omega)$ is a symplectic manifold and $H\in C^{\infty}(M)$ , we say that $H$ admits action-angle coordinates $(I,\theta)$ on an open set $U\subset M$ if there exists a symplectic diffeomorphism $\psi:U\to B^{n}\times\mathbb{T}^{n}$ such that $H\circ\psi^{-1}$ has action-angle coordinates $(I,\theta)$ in $B^{n}\times\mathbb{T}^{n}$ . If $H$ admits action-angle coordinates, then one can check that the push-forward $\psi_{*}X_{H}$ is the Hamiltonian vector field $X_{H\circ\psi^{-1}}$ , so that

\psi_{*}X_{H}=-\sum_{j=1}^{n}\frac{\partial(H\circ\psi^{-1})}{\partial I_{j}}% \frac{\partial}{\partial\theta_{j}}.

Let $f_{1},\ldots,f_{n}\in C^{\infty}(\mathbb{R}^{2n})$ . If the set $\{f_{1},\ldots,f_{n}\}$ is completely integrable, with $H=f_{1}$ , then for any open set $U\subseteq\mathbb{R}^{2n}\setminus\Sigma_{F}$ for which $F^{-1}(c)=\mathbb{T}^{n}$ for all $c\in U$ , Abraham and Marsden [1, pp. 398–400] find action-angle coordinates in $U$ . Here $F=f_{1}\times\cdots f_{n}$ , the momentum map. This construction is also explained by Arnold [2, pp. 282–284].

Suppose that $H\in C^{\infty}(B^{n}\times\mathbb{T}^{n})$ has action-angle coordinates $(I,\theta)$ , and assume that for all $I\in B^{n}$ ,

\det(\partial^{2}_{I}H(I))\neq 0.

Then by the inverse function theorem, for every $I\in B^{n}$ there is a neighborhood $U$ of $I$ and a neighborhood $V$ of $\nu=\partial_{I}H(I)$ such that $\partial_{I}H:U\to V$ is a diffeomorphism. In $U\times\mathbb{T}^{n}$ we can use $\nu$ and $\theta$ as coordinates.

For $\nu\in\mathbb{R}^{n}$ , let $g_{\nu}=\{k\in\mathbb{Z}^{n}:\langle\nu,k\rangle=0\}$ , and let $\operatorname{rank}(g_{\nu})$ be the rank of the $\mathbb{Z}$ -module $g_{\nu}$ , i.e. the maximal number of elements of $g_{\nu}$ that are linearly independent over $\mathbb{Z}$ . The proof of the following theorem follows [8, Proposition 2.1].

Theorem 3.

Let $\nu\in\Omega$ and let $r=\operatorname{rank}(g_{\nu})$ . In the torus with frequency $\nu$ , each trajectory is dense in some $(n-r)$ -dimensional subtorus and the $n$ -dimensional torus is foliated by these $(n-r)$ -dimensional tori.

Proof.

There exists a basis $k_{1},\ldots,k_{r}$ of $g_{\nu}$ and vectors $k_{1}^{*},\ldots,k_{n-r}^{*}\in\mathbb{Z}^{n}$ such that the $n\times n$ matrix $K_{0}$ with rows $k_{1}^{*},\ldots,k_{n-r}^{*},k_{1},\ldots,k_{r},$ has determinant $1$ . (I should show why such a basis exists.) Let $K_{0}=\begin{bmatrix}K^{*}\\ K\end{bmatrix}$ . $K^{*}$ is an $(n-r)\times n$ matrix and $K$ is an $r\times n$ matrix.

Let $q=K_{0}\theta$ . Since $\det(K_{0})=1$ , $K_{0}$ is invertible over $\mathbb{Z}$ . The coordinate $\theta$ is only determined up to $\mathbb{Z}^{n}$ , and for $q_{1}-q_{2}\in\mathbb{Z}^{n}$ then also $\theta_{1}-\theta_{2}\in\mathbb{Z}^{n}$ . Thus $q=K_{0}\theta$ are coordinates on $\mathbb{T}^{n}$ . The equation $\dot{\theta}=\nu$ can be written using the $q$ coordinates as $\dot{q}=K_{0}\nu$ . Then

K_{0}\nu=\begin{bmatrix}K^{*}\\ K\end{bmatrix}\nu=\begin{bmatrix}K^{*}\nu\\ K\nu\end{bmatrix}=\begin{bmatrix}K^{*}\nu\\ 0\end{bmatrix}.

Let $\nu^{*}=K^{*}\nu$ .

We see that $\{l\in\mathbb{Z}^{n}:l_{1}=\cdots=l_{n-r}=0\}\subseteq g_{K_{0}\nu}$ ; since they both have rank $r$ , they are equal. It follows that $\nu^{*}\in\mathbb{R}^{n-r}$ is nonresonant. Hence any trajectory on the $n$ -dimensional torus with frequency $\nu$ is dense in the $r$ -dimensional torus $\{q\in\mathbb{T}^{n}:q_{n-r+1}=\cdots=q_{n}=\text{constant}\}$ . ∎

3 Diophantine frequency vectors

For $c>0$ and $\gamma\geq 0$ we define

D_{n}(c,\gamma)=\{\nu\in\mathbb{R}^{n}:|\langle k,\nu\rangle|\geq\frac{1}{c\|k% \|_{\infty}^{\gamma}}\ \ \textrm{for all}\ k\in\mathbb{Z}^{n}\}.

We further define $D_{n}(\gamma)=\bigcup_{c>0}D_{n}(c,\gamma)$ .

Theorem 4.

For any $\nu\in\mathbb{R}^{n}$ and for any positive integer $K$ , there is some $0\neq k\in\mathbb{Z}^{n}$ with $\|k\|_{\infty}\leq 2K$ such that

|\langle k,\nu\rangle|\leq\frac{n\|\nu\|_{\infty}}{(2K)^{n-1}}.

Proof.

Let $B_{K}=\{k\in\mathbb{Z}^{n}:0<\|k\|_{\infty}\leq K\}$ . The set $B_{K}$ has $(2K+1)^{n}-1$ elements. For $k\in B_{K}$ we have

|\langle k,\nu\rangle|\leq n\|k\|_{\infty}\|\nu\|_{\infty}\leq nK\|\nu\|_{% \infty}.

Let $A=nK\|\nu\|_{\infty}$ .

Let $M=(2K+1)^{n}-2$ . In the set $\{|\langle k,\nu\rangle|:k\in B_{K}\}$ , there are two elements that are in same interval $[\frac{(j-1)A}{M},\frac{jA}{M}]$ , $j=1,\ldots,M$ , since $B_{K}$ has $M+1$ elements and there are $M$ such intervals. That is, there are $k^{\prime},k^{\prime\prime}\in B_{K}$ such that $|\langle k^{\prime},\nu\rangle|,|\langle k^{\prime\prime},\nu\rangle|\in[\frac% {(j-1)A}{M},\frac{jA}{M}]$ for some $j$ . Hence $|\langle k^{\prime},\nu\rangle-\langle k^{\prime\prime},\nu\rangle|\leq\frac{A% }{M}=\frac{nK\|\nu\|_{\infty}}{(2K+1)^{n}-2}$ .

One can show by induction that for all $n\geq 1$ , $\frac{K}{(2K+1)^{n}-2}\leq\frac{1}{(2K)^{n-1}}$ . Therefore for $k=k^{\prime}-k^{\prime\prime}$ we have

|\langle k,\nu\rangle|\leq\frac{n\|\nu\|_{\infty}}{(2K)^{n-1}},

Finally, $\|k\|_{\infty}\leq\|k^{\prime}\|_{\infty}+\|k^{\prime\prime}\|_{\infty}\leq 2K$ . ∎

Corollary 5.

If $\gamma<n-1$ then $D_{n}(\gamma)=\emptyset$ .

Proof.

Let $c>0$ . Suppose that there is some $\nu\in D_{n}(c,\gamma)$ . Let $K$ be the least integer such that $(2K)^{n-1-\gamma}$ is greater than $2cn\|\nu\|_{\infty}$ ; since $n-1-\gamma>0$ such a $K$ exists.

By Theorem 4, there is some $0\neq k\in\mathbb{Z}^{n}$ with

|\langle k,\nu\rangle|\leq\frac{n\|\nu\|_{\infty}}{(2K)^{n-1}}.

Then

$\displaystyle\|\langle k,\nu\rangle\|$	$\displaystyle\leq$	$\displaystyle\frac{n\\|\nu\\|_{\infty}(2K)^{-\gamma}}{(2K)^{n-1-\gamma}}$
	$\displaystyle\leq$	$\displaystyle\frac{n\\|\nu\\|_{\infty}(2K)^{-\gamma}}{2cn\\|\nu\\|_{\infty}}$
	$\displaystyle=$	$\displaystyle\frac{1}{2c(2K)^{\gamma}}$
	$\displaystyle\leq$	$\displaystyle\frac{1}{2c(4\\|k\\|_{\infty})^{\gamma}}$
	$\displaystyle<$	$\displaystyle\frac{1}{c\\|k\\|_{\infty}^{\gamma}},$

contradicting that $\nu\in D_{n}(c,\gamma)$ . Therefore for all $c>0$ , $D_{n}(c,\gamma)=\emptyset$ .

∎

Treschev and Zubelevich give a construction for points in $D_{n}(c,n-1)$ for sufficiently large $c$ [8, Theorem 9.2]. Thus there is some $C(n)$ such that for all $c\geq C(n)$ , $D_{n}(c,n-1)\neq\emptyset$ . It is clear that for $\gamma^{\prime}\geq\gamma$ we have the inclusion $D_{n}(c,\gamma)\subseteq D_{n}(c,\gamma^{\prime})$ . Hence this construction also shows that $D_{n}(c,\gamma)\neq\emptyset$ for all $\gamma\geq n-1$ and $c\geq C(n)$ . However this construction does not show that $m(D_{n}(c,n-1))>0$ for $c\geq C(n)$ . Indeed, one can show that $m(D_{n}(n-1))=0$ , but also that the set $D_{n}(n-1)$ has Hausdorff dimension $n$ [7, p. 5].

Our proof of the following theorem expands on [8, Theorem 9.3]. Let $Q_{n}(L)=\{\nu\in\mathbb{R}^{n}:\|\nu\|_{\infty}\leq\frac{L}{2}\}$ , the cube in $\mathbb{R}^{n}$ of edge length $L$ . Let $m$ be $n$ -dimensional Lebesgue measure. We will use the fact that the maximal $n-1$ dimensional area of the intersection of $Q_{n}(L)$ and a hyperplane is $\sqrt{2}L^{n-1}$ [3].

Theorem 6.

Let $L>0$ . For $\gamma>n-1$ and $c>0$ ,

m(Q_{n}(L)\setminus D_{n}(c,\gamma))\leq\frac{4\sqrt{2}n(3L)^{n-1}}{c}\Big{(}1% -\frac{1}{\gamma-n+1}\Big{)}.

Proof.

Let $Q_{n}=Q_{n}(L)$ . Let $\Pi_{k}=\{\nu\in\mathbb{R}^{n}:|\langle\nu,k\rangle|<\frac{1}{c\|k\|_{\infty}^% {\gamma}}\}$ . Let $\nu\in Q_{n}\setminus D_{n}(c,\gamma)$ . Then there is some $k\neq 0$ such that $|\langle k,\nu\rangle|<\frac{1}{c\|k\|_{\infty}^{\gamma}}$ , and so $\nu\in\Pi_{k}$ . Thus

Q_{n}\setminus D_{n}(c,\gamma)\subseteq\bigcup_{k\neq 0}(Q_{n}\cap\Pi_{k}),

m(Q_{n}\setminus D_{n}(c,\gamma))\leq\sum_{k\neq 0}m(Q_{n}\cap\Pi_{k}).

Let $k\neq 0$ . $\Pi_{k}$ is the region bounded by the two hyperplanes $\pi_{1}=\{\nu\in\mathbb{R}^{n}:\langle\nu,k\rangle=\frac{1}{c\|k\|_{\infty}^{% \gamma}}\}$ and $\pi_{2}=\{\nu\in\mathbb{R}^{n}:\langle\nu,k\rangle=-\frac{1}{c\|k\|_{\infty}^{% \gamma}}\}$ . Let $p_{1}=\frac{k}{c\|k\|_{\infty}^{\gamma}\|k\|}\in\pi_{1}$ and $p_{2}=-\frac{k}{c\|k\|_{\infty}^{\gamma}\|k\|}\pi_{2}$ . For any two points $\nu_{1},\nu_{2}\in\pi_{1}$ we can check that $\langle p_{1}-p_{2},\nu_{1}-\nu_{2}\rangle=0$ , and for any two points $\nu_{1},\nu_{2}\in\pi_{2}$ we can check that $\langle p_{1}-p_{2},\nu_{1}-\nu_{2}\rangle=0$ . Thus the vector $p_{1}-p_{2}$ is orthogonal to each of the hyperplanes $\pi_{1}$ and $\pi_{2}$ . It follows that the distance between the hyperplanes $\pi_{1}$ and $\pi_{2}$ is the distance between the points $p_{1}$ and $p_{2}$ , which is $2\cdot\frac{\|k\|}{c\|k\|_{\infty}^{\gamma}\|k\|^{2}}$ . Since $\|k\|\geq\|k\|_{\infty}$ , this is $\leq\frac{2}{c\|k\|_{\infty}^{\gamma+1}}$ . Therefore

m(Q_{n}\cap\Pi_{k})\leq\frac{2}{c\|k\|_{\infty}^{\gamma+1}}\cdot\sqrt{2}L^{n-1},

where we use the fact that the maximal $n-1$ dimensional area of the intersection of $Q_{n}=Q_{n}(L)$ and a hyperplane is $\sqrt{2}L^{n-1}$ [3].

For each positive integer $l$ , the hypercube $\{k\in\mathbb{Z}^{n}:\|k\|_{\infty}=l\}$ has $2n$ faces, on each of which there are $(2l+1)^{n-1}$ points with integer coordinates. Hence for each integer positive integer $l$ , we have $\#\{k\in\mathbb{Z}^{n}:\|k\|_{\infty}=l\}\leq 2n(2l+1)^{n-1}$ .

Therefore

$\displaystyle m(Q_{n}\setminus D_{n}(c,\gamma))$	$\displaystyle\leq$	$\displaystyle\sum_{k\neq 0}m(Q_{n}\cap\Pi_{k})$
	$\displaystyle\leq$	$\displaystyle\sum_{k\neq 0}\frac{2\sqrt{2}L^{n-1}}{c\\|k\\|_{\infty}^{\gamma+1}}$
	$\displaystyle=$	$\displaystyle\sum_{l=1}^{\infty}\sum_{\\|k\\|_{\infty}=l}\frac{2\sqrt{2}L^{n-1}}% {cl^{\gamma+1}}$
	$\displaystyle\leq$	$\displaystyle\sum_{l=1}^{\infty}2n(2l+1)^{n-1}\frac{2\sqrt{2}L^{n-1}}{cl^{% \gamma+1}}$
	$\displaystyle\leq$	$\displaystyle\sum_{l=1}^{\infty}2n(3l)^{n-1}\frac{2\sqrt{2}L^{n-1}}{cl^{\gamma% +1}}$
	$\displaystyle=$	$\displaystyle\frac{4\sqrt{2}n(3L)^{n-1}}{c}\sum_{l=1}^{\infty}\frac{1}{l^{% \gamma-n+2}}.$

Since the terms in the sum are positive and decreasing, we can estimate the sum using an integral:

\sum_{l=1}^{\infty}\frac{1}{l^{\gamma-n+2}}\leq 1+\int_{1}^{\infty}\frac{dx}{x% ^{\gamma-n+2}}=1+\frac{1}{\gamma-n+1},

finishing the proof. ∎

Corollary 7.

If $\gamma>n-1$ then $m(\mathbb{R}^{n}\setminus D_{n}(\gamma))=0.$

Proof.

Let $L>0$ . For every $c>0$ , $m(Q_{n}(L)\setminus D_{n}(\gamma))\leq m(Q_{n}(L)\setminus D_{n}(c,\gamma))$ . By Theorem 6, $m(Q_{n}(L)\setminus D_{n}(c,\gamma))\to 0$ as $c\to\infty$ . Hence $m(Q_{n}(L)\setminus D_{n}(\gamma))=0$ . But then

m(\mathbb{R}^{n}\setminus D_{n}(\gamma))=\lim_{L\to\infty}m(Q_{n}(L)\setminus D% _{n}(\gamma))=\lim_{L\to\infty}0=0.

∎

Fix $\gamma>n-1$ . Let $\alpha=\frac{1}{c}$ . Let $A_{\alpha}$ be an $\alpha$ -neighborhood of the boundary of $\Omega$ . We will make whatever assumption about $\partial\Omega$ we need in order to get $m(A_{\alpha})=O(\alpha)$ .

Suppose that $L$ is sufficiently large so that $\Omega\subseteq Q_{n}(L)$ . Then Theorem 6 gives us that $m(\Omega\setminus D_{n}(c,\gamma))=O(\alpha)$ .

Let $\Omega_{\alpha}=D_{n}(c,\gamma)\cap(\Omega\setminus A_{\alpha})$ . Since $\Omega\setminus\Omega_{\alpha}=(\Omega\setminus D_{n}(c,\gamma))\cup(\Omega% \cap A_{\alpha})$ , we have $m(\Omega\setminus\Omega_{\alpha})=O(\alpha)$ .

4 Statement of KAM

If we have a Hamiltonian system which admits action-angle coordinates in $B^{n}\times\mathbb{T}^{n}$ , then the trajectories of points in phase space are constrained to lie on invariant tori. Moreover, on these tori the dynamics of the system are quasi-periodic; a priori we don’t have a reason to expect that the dynamics should be so nice just because the trajectories lie on tori. But a generic Hamiltonian on the same phase space (I would like to make this notion precise) does not admit action-angle coordinates. The KAM theorem is a statement about the dynamics induced by making a sufficiently small change to a Hamiltonian. If we perturb a Hamiltonian which admits action-angle coordinates to one which probably does not, if the perturbation is sufficiently small, then most of the trajectories of points under the flow of the new Hamiltonian will also lie on tori. In some sense which I want to clarify, the invariant tori of the new Hamiltonian are close to the invariant tori of the Hamiltonian that admits action-angle coordinates. It is not clear to me how an invariant torus of the old Hamiltonian transforms into an invariant torus of the new Hamiltonian; in what sense does an invariant torus for the old Hamiltonian become an invariant torus for the new Hamiltonian?

In particular, a consequence of the KAM theorem is that if we make a small perturbation of a Hamiltonian system that admits action-angle coordinates then the trajectories of most points will not be dense on a hypersurface in phase space, since they are constrained to lie on $n$ -dimensional tori. In other words, the new Hamiltonian system is not ergodic, since the invariant tori have lower dimension than $n-1$ , and so have $n-1$ -dimensional measure 0.

Let’s explain the KAM theorem in another way. Suppose that we have a symplectic manifold $M$ and a Lagrangian foliation $\mathscr{F}_{0}$ whose leaves are tori, and suppose that the leaves of $\mathscr{F}_{0}$ are invariant tori for a Hamiltonian $H_{0}$ . That is, the Hamiltonian vector field $X_{H_{0}}$ is tangent to all the leaves in $\mathscr{F}_{0}$ . Now let $H=H_{0}+\epsilon H_{1}$ . The leaves of the foliation $\mathscr{F}_{0}$ will not be invariant under the flow of $H$ . We would like to obtain a symplectomorphism $\Phi:M\to M$ such that the Hamiltonian vector field $X_{H}$ is tangent to most leaves in the foliation $\mathscr{F}=\Phi(\mathscr{F}_{0})$ . Here we mean most in a measure theoretic sense that depends on the magnitude $\epsilon$ of the perturbation away from the Hamiltonian that admits action-angle coordinates.

How do we construct a diffeomorphism? Often the best way is to demand that it be the time $1$ flow of a vector field, so $\Phi=\Phi_{1}$ for some $\Phi_{t}$ , and to see if such a vector field exists. Suppose that $f$ is a function such that if $\Phi_{t}$ is the flow of $X_{f}$ then $\Phi_{1}=\Phi$ .

5 Normal forms

Normal forms of vector fields, homological equation [9].

References

[1] R. Abraham and J. E. Marsden (2008) Foundations of mechanics. Second edition, AMS Chelsea Publishing, Providence, Rhode Island. Cited by: §2, §2, §2, §2.
[2] V. I. Arnold (1989) Mathematical methods of classical mechanics. Second edition, Graduate Texts in Mathematics, Vol. 60, Springer. Cited by: §2.
[3] K. Ball (1986) Cube slicing in ${\bf R}^{n}$ . Proc. Amer. Math. Soc. 97 (3), pp. 465–473. External Links: ISSN 0002-9939, Document, Link, MathReview (Jeffrey D. Vaaler) Cited by: §3, §3.
[4] G. Gallavotti (2001) Quasi periodic motions from Hipparchus to Kolmogorov. Atti della Accademia Nazionale dei Lincei. Classe di Scienze Fisiche, Matematiche e Naturali. Rendiconti Lincei. Matematica e Applicazioni 12 (2), pp. 125–152. Cited by: §1.
[5] G. H. Hardy and E. M. Wright (2008) An introduction to the theory of numbers. Sixth edition, Oxford University Press. External Links: ISBN 978-0-19-921986-5 Cited by: §2.
[6] J. Hubbard and Y. Ilyashenko (2004) A proof of Kolmogorov’s theorem. Discrete Contin. Dyn. Syst. 10 (1-2), pp. 367–385. External Links: ISSN 1078-0947, Document, Link, MathReview (Dario Bambusi) Cited by: §1.
[7] J. Pöschel (2001) A lecture on the classical KAM theorem. In Smooth Ergodic Theory and Its Applications, A. Katok, R. de la Llave, Y. Pesin, and H. Weiss (Eds.), Proceedings of Symposia in Pure Mathematics, Vol. 69, pp. 707–732. Cited by: §1, §3.
[8] D. Treschev and O. Zubelevich (2010) Introduction to the perturbation theory of Hamiltonian systems. Springer Monographs in Mathematics, Springer. Cited by: §2, §3, §3.
[9] S. Wiggins (2003) Introduction to applied nonlinear dynamical systems and chaos. second edition, Texts in Applied Mathematics, Vol. 2, Springer. Cited by: §5.