Polarization Identity

Polarization Identity

The Polarization identity for quadratic forms like inner products is also known as the parallelogram law. It is very important and always a good teaching moment in a multi-variable calculus course. It is more than just an identity. It tells that if know “lengths”, then we can recover “angles”. All the geometry in a Hilbert space is determined by the norm. The identity is the almost trivial algebraic (x+y)^2 - (x-y)^2=4xy. It implies for example that for \mathcal{L}^2 random variables, knowing the variance determines correlations: {\rm Var}[X+Y] - {\rm Var}[X-Y] = 4 {\rm Cov}[X,Y]. In differential geometry, the identity gives that knowing the Ricci tensor evaluated on pairs of the same vectors R(v,v) suffices to get the Ricci tensor for all v. This insight helps to get reasonable notions of Ricci tensor in finite structures like graphs, where it is a quantity attached to edges of the graph. Similarly also the metric tensor g, which also is a bilinear form at every point.

In order to generalize this identity to n-linear forms, introduce S=\{ -1,1\}^n and write the above parallelogram identity as \sum_{s \in S} s_1 s_2 (s_1 x + s_2 y)^2 = 2^2 \cdot 2! =8 which is obviously the same, as we just doubled the sum. But now, we have a sum that is symmetric with respect to permutations of the variables so that the result has to be a multiple of x y as each term is homogeneous of degree 2. Once we understand this, it is easy to generalize this to higher dimensions. The formula holds in any commutative ring with 1. I called it in the video the polarization identity in dimension n. (Better would be Polarization identity for degree n). It is an identity which holds in any ring and x_i are elements in the ring. Note that they can agree. We can for example in the degree 3 case write 3! 2^3 x y^2 = (x+y+y)^3 -(x+y-y)^3 - (x-y+y)^3 + (x-y-y)^3 .......-(-x-y-y)^3. The identity allows to write any monomial of degree n in the ring as a sum or difference of powers of n. While the identity is not deep, it is useful.

Polarization Identity: Let x_1,x_2, \dots, x_n be elements in a commutative ring with 1, then \sum_{s \in S} (\prod_{j=1}^n s_j) (\sum_{i=1}^n  s_i x_i)^n  = 2^n n! \prod_{j=1}^n x_j. Every monomial in the ring of degree n can be written as a difference of sums of powers of n.

The proof is straightforward. As in the degree 2 case, note that the left hand side is symmetric and homogeneous of degree n so that it has to be a multiple C of \prod_{j=1}^n x_j = x_1 x_2 \cdots x_n. To get the constant C, plug in x_j=1 and expand (\sum_{i=1}^n  s_i x_i)^n and simplify the sum. It reduces to show that \sum_{k=0}^n \binom{n}{k} (-1)^k (n-2k)^n = n! 2^n. To simplify this, use that \sum_{k=0}^n (-1)^k \binom{n}{k} k^j=0 is for j \leq n only nonzero for j=n, where it is (-1)^n n!.

We were interested in the polarization identity because we need to write in \mathbb{R}^q any k-form f (with k \leq q-2) as a linear combination of inner derivative i_X g, where $latex $g$ is a (k+1)-form, invariant under the flow of a vector field X, meaning L_X g = 0. For example, in \mathbb{R}^3, the 1-form f=a(y,z) dy is equal to i_X g, where g=a(y,z) dx \wedge dy and X=(1,0,0). So, in order to get this in dimension q=3, we need to write any polynomial p=x^{n_1} y^{n_2} z^{n_3} of degree n=n_1+n_2+n_3 as a sum of polynomials that can be expressed using 2 variables only. Polarization does the job because we can write p as a sum of polynomials of the form (s_1 x + s_2 y + s_3 z)^n which allows in each of the cases to find a constant vector field X, which is parallel to the plane s_1 x + s_2 y + s_3 z. Change coordinates so that is is x=0 and X=(1,0,0) so that g=a(y,z) dx \wedge d_y does the job. Polarization allows to write a polynomial as sum of polynomials, where each monomial in the sum can in some coordinate system be written with one variable less.

This now allows to show that if the strong Huygens principle works for (k+1)-forms, then it works for k-forms. The above polarization trick shows that every k-form can be written as a sum of f=i_X g, where L_X g=0. For such a form, we have, using the Cartan magic formula L_X = d i_X + i_X d and L_X g=0 and the fact that L_X,d,i_X all commute with L=D^2 (if vector field is parallel (constant)) the following computation d_t f = \phi_{q+2}(t D) d f = \phi_{q+2}(tD) d i_X g = -\phi_{q+2}(tD)  i_X d g = -i_X \phi_{q+2} dg = -i_X d_t g. So, also d_t g(p) only depends on points x that are distance t away from p.