Darren L. Slider The Spinor Norm and Zassenhaus’s Theorem Darren’s Writings Home |
The Spinor Norm and Zassenhaus’s Theorem A Research Paper Submitted in Partial Fulfillment of the Requirements for the Master of Science Degree Department of Mathematics ACKNOWLEDGEMENTS I would like to express my deepest gratitude to my advisor, Professor Andrew G. Earnest, for his abundant help and guidance with this paper, and for making my two years of graduate school a stimulating and enjoyable experience. I would also like to thank the other members of my committee, Professors Robert Fitzgerald and Mary Wright, for their time and effort with respect to this paper and its presentation. Finally, I express my wholehearted indebtedness and praise to God and to my wife, for reasons which transcend verbal explanation. THE SPINOR NORM AND ZASSENHAUS’S THEOREM The proof of Theorem 1 here given is due to Geoffrey Mason, which is sketched in his article “Groups, Discriminants, and the Spinor Norm” (see Bibliography). The examples following Zassenhaus’s Theorem were obtained from L. E. Dickson, Studies in the Theory of Numbers (see Bibliography). We assume throughout that V is a vector space over a field F of characteristic not 2 (i.e., there exists a Î F such that 2a ¹ 0) and that dim(V) = n (where n Î N is fixed). Definition. A mapping B : V ´ V ® F is a symmetric bilinear form on V if for all v, w, x Î V, a Î F, the following properties hold: (1) B(v,w+x) = B(v,w) + B(v,x) Definition. Let B be a symmetric bilinear form on V. Let Q : V ® F be given by Q(v) = B(v,v). Q is a quadratic form on V. Definition. Let V have basis {v1,...,vn} and symmetric bilinear form B. The determinant det(B(vi,vj)) of the n ´ n matrix (B(vi,vj)) is called the discriminant of V and is written d(V). d(V) is invariant under change of basis in the group F*/F*2 and is thus represented as an element of that field (see O’Meara, pp. 85-87). Let n, t Î N satisfy 1 £ n £ t. We introduce variables aij (1 £ i, j £ t) and the t ´ t matrices D = (aij)
En = (I + A1)(I + A2)××××××(I + An) Lemma 1. Jn-1D + An = JnD and (Jn-1D - N)An = 0. Proof. Note that
It follows that Jn-1D + An =
Furthermore, (Jn-1D - N)An =
Q.E.D. Proposition 1. (I - N)(En - I) = JnD. Proof. By induction on n: Note that E1 - I = I + A1 - I = A1. So (I - N)(E1 - I) = (I - N)A1 =
by the first result of the proof of Lemma 1. Assume that (I - N)(Ek -1 - I) = Jk -1D for some k with 2 ≤ k ≤ t. Then
by Lemma 1. Q.E.D. From now on, let dim V = n = t, replace aij by -2aij/aii, and set T(vi) =
Lemma 2. det(T(v1)T(v2)×××T(vn) - I) = (-2)ndet(aij)/a11×××ann. Proof. Since n = t, we have JnD = JtD = ID = D. We may obtain det(D) =
Q.E.D. Definitions. Let B be a symmetric bilinear form on V, and let W be a subspace of V. We define the orthogonal complement W* of W to be We assume henceforth that V is nondegenerate and has a symmetric bilinear form B with associated quadratic form Q. Let M : V ® V be an injective linear transformation such that for all v, w Î V, B(M(v),M(w)) = B(v,w). M is called an isometry of V onto itself. For w Î V, define T(w) : V ® V by T(w)(v) = v - (2B(v,w)w/Q(w)) for v Î V. T(w) is the symmetry on V with respect to w. Let {v1, . . . ,vn} be a basis for V. Then if v Î V, v = a1v1 + . . . anvn for some a1, . . . ,an Î F. Note that
under the interpretation aij = B(vi,vj). Indeed, in the ith row we have
and from this we can see that the choice of terminology for the symmetry on V with respect to vi was no accident: the latter, as we have just shown, has the matrix representation of I + Ai shown above. Let M be an isometry of V onto itself. By the theorem of Cartan and Dieudonné (O’Meara, p. 102) we know that M has a representation as a product of symmetries,
with k ≤ n and Q(wi) ≠ 0 for i = 1,...,n. Proposition 2. Let M be as in (1) with k minimal. Let W = áw1,...,wkñ. Then M has no eigenvalue equal to 1 if and only if V = W. Proof. (Þ) For i = 1,...,k, let wi* = {v Î V: B(v,wi) = 0}. Since W is spanned by {w1,...,wk}, for w Î W we may find a1,...,ak Î F such that w = a1w1 + ××× + akwk . Assume that w′ Î Çwi*; then for w Î W we have
so that w’ Î W*. Therefore, Çwi* Í W*. On the other hand, if w Î W*, then Let w Î W* = Çwi*; then for i = 1,...,k, B(w,wi) = 0 and
and thus M(w) = T(w1)×××T(wk)(w) = w. Therefore (M - I)(w) = 0 and if w ¹ 0, w is an eigenvector corresponding to the eigenvalue 1 of M. But by hypothesis, M has no eigenvalues equal to 1, so we must have w = 0. It follows that W* = {0}. In particular, rad(W) = W Ç W* = {0} so that W is nondegenerate. We conclude that V = W + W* = W + {0} = W. (Ü) Assume now that V = W. Then V = áw1,...,wkñ; since k is minimal and dim(V) = n, k = n and áw1,...,wnñ is a basis for V. Under the interpretation vi = wi,
Since V is nondegenerate, d(V) = det(B(wi,wj)) ¹ 0, so det(M - I) ¹ 0. We conclude that M has no eigenvalue equal to 1. Q.E.D. Definition. Let M be as above, and let U be a subspace of V. We say that M acts as a unipotent operator on U if for each u Î U, there exists a t Î N such that We may assume that the largest subspace on which M acts as a unipotent operator contains all other such subspaces. Indeed, let U be a subspace and let u’ be a vector not in U such that (M - I)m(u’) = 0 for some m Î N. Let u Î U be arbitrary; then for some n Î N, (M - I)n(u) = 0. Let t = max{m,n}; then
so that M acts as a unipotent operator on U + áu’ñ. Continuing in this way, we can (since V is finite-dimensional) obtain a subspace upon which M acts as a unipotent operator which contains all other such subspaces. Lemma 3. Let U be the largest subspace of V on which M acts as a unipotent operator, and let W be as in Proposition 2. Then V = U + W. Proof. Let v Î V. Then
so that (M - I)(v) = v - w1’ - v = -w1’ Î W. It follows that im(M - I) Í W. Now consider the operators M - I, (M - I)2, (M - I)3, . . . on V. Note that Let x Î V0 Ç V1. Since x Î V1, there must exist some y Î V such that Since W Ê im(M - I) Ê im(M - I)2 Ê . . . , we have V1 Í W and V0 Í U. So
and of course dim(V) ≥ dim(U + W); thus dim(V) = dim(U + W). It follows that V = U + W, as desired. Q.E.D. Lemma 4. Let W be as in Proposition 2, and let M’ be defined on W/rad(W) by M’(w+rad(W)) = M(w)+rad(W). Then M’ is a well-defined isometry on W/rad(W). Furthermore, let X/rad(W) be the largest subspace of W/rad(W) on which M’ acts as a unipotent operator, and let U be as in Lemma 3. If X/rad(W) is nondegenerate, then X Í U and W Í X + X*. Proof. Given the symmetric bilinear form B on V, define B’ on W/rad(W) by B’(w+rad(W),w’+rad(W)) = B(w,w’). It is trivial to verify that B’ is a symmetric bilinear form on W/rad(W). To see that B’ is well-defined, note that if w1 + rad(W) = w + rad(W) and w1’ + rad(W) = w’ + rad(W), it follows that w - w1, Given this symmetric bilinear form B’ on W/rad(W), it is easy to prove that M’ is an isometry, for it is clear from the proof of Lemma 3 that M maps W into W. If, say, w and w’ belong to the same coset of W/rad(W), then w-w’ Î rad(W), and so
so that M(w) + rad(W) = M(w’) + rad(W). Hence M’ is well-defined. Now suppose that X/rad(W) as defined above is nondegenerate; let x Î X. Certainly x + rad(W) Î X/rad(W); since M’ acts as a unipotent operator on X/rad(W), there is t Î N such that (M-I)t(x)+rad(W)=(M’-I)t(x+rad(W)) = 0 + rad(W); in particular, (M - I)t(x) Î rad(W). Now rad(W) Í W* so that (M - I)t(x) Î W*; by the proof of Proposition 2, we have M((M - I)t(x)) = (M - I)t(x). Therefore, (M - I)t+1(x) = 0. It follows that x Î U. So X Í U. We shall now prove that (X/rad(W))* = X*/rad(W). Let w+rad(W) Î (X/rad(W))*. Then if x+rad(W)ÎX/rad(W), we have B(w,x) = B’(w + rad(W),x + rad(W)) = 0. Then w Î X* so that w + rad(W) Î X*/rad(W). Therefore (X/rad(W))* Í X*/rad(W). On the other hand, let x + rad(W) Î X*/rad(W); then x Î X* and if Since X/rad(W) is assumed to be nondegenerate, we have
Consider w Î W. For some x Î X, x’ Î X*, we have
so that w - (x + x’) Î rad(W); say w’ = w - (x + x’). Then w = x + x’ + w’. To prove that W Í X + X*, it suffices to show that w’ Î X*. Since w’ Î rad(W), then in particular w’ Î W*. Since X Í W, for any y Î X, we have y Î W, so that B(w’,y) = 0. It follows that w’ Î X*, which concludes the proof. Q.E.D. Proposition 3. The largest subspace U of V on which M acts as a unipotent operator is nondegenerate. Proof. By induction on dim(V) = n. Let W be as in Proposition 2. Then by Lemma 3 we have V = U + W. We set R = rad(U) = U Ç U* and try to show that R = {0}. If n = 1, then either V = U or V = W. If V = U, then U is nondegenerate because V is assumed to be so. If V = W, then by Proposition 2, M has no eigenvalue equal to 1. It follows that if (M - I)(v) = 0 for some v Î V, then v = 0. Likewise, if Now assume that the largest subspace of any nondegenerate vector space of dimension less than n upon which any given isometry acts as a unipotent operator is nondegenerate. Now consider dim(V) = n. By the above argument, we are done if V = W, so assume that V ¹ W. Note that W* Í U (since M acts as a unipotent operator on W* by the proof of Proposition 2); therefore (see O’Meara, p. 92)
Let X/rad(W) be as in Lemma 4. Since W ¹ V, we have dim(W) < dim(V). Clearly dim(W/rad(W)) ≤ dim(W), so that dim(W/rad(W)) ≤ dim(W) < dim(V) = n. It now follows from the induction assumption that X/rad(W) is nondegenerate. Since U* Í W, we get R = U Ç U* Í U Ç W. We shall show that U Ç W Í X so that R Í X. Indeed, let w Î U Ç W; then w Î U so that (M - I)t(w) = 0 for some
whence by the maximality of X/rad(W), w + rad(W) Î X/rad(W) so that w Î X. Since R = rad(U), R is orthogonal to U; since R Í X = (X*)*, R is orthogonal to X*. Hence R is orthogonal to U + X*. By Lemma 4 we have X Í U and W Í X + X*, since X/rad(W) is nondegenerate. Therefore
and certainly U + X* Í V, so V = U + X*. Now V is nondegenerate, so we have V* = (U + X*)* = {0}, whence R Í (U + X*)* = {0} gives us R = {0}, as desired. Q.E.D. Definition. Let M be an isometry on the quadratic space V with associated quadratic form Q. If M = T(w1)T(w2)×××T(wk), we define the spinor norm of M to be Θ(M) = Q(w1)Q(w2)×××Q(wk) Î F*/F*2. This is a well-defined invariant (i.e., independent of the choice of product of reflections) in F*/F*2 (see O’Meara, pp. 131-137), Furthermore, it is easily seen that Θ is a homomorphism on the group (under the operation of composition) of isometries of V onto itself. Proposition 4. If M acts as a unipotent operator on V, then Θ(M) = 1. Proof. By induction on dim (V) = n. Let dim(V) = á v ñ. Since M acts as a unipotent operator on V, there must exist some minimal t Î N such that (M - I)((M - I)t-1(v)) = (M - I)t(v) = 0. If t = 1, let Since M may be represented as a product of reflections, say, M = T(w1)×××T(wk) with Q(wi) ¹ 0, then in particular, Q(w1) ¹ 0 and w1 ¹ 0 so that V = áw1ñ. Hence for
so that M = T(w1)T(w1). Therefore Θ(M) = Q(w1)Q(w1) = (Q(w1))2 = 1 in F*/F*2. Now assume that the spinor norm of any isometry acting as a unipotent operator on any vector space of dimension < n is 1. Let W be as in Proposition 2. Let 0 ¹ v Î V; then we have (M - I)t(v) = 0 for some minimal t Î N. If t = 1, then 1 is an eigenvalue of M corresponding to v. If t > 1, then 1 is an eigenvalue of M corresponding to (M - I)t-1(v) (which is in this case an eigenvector of M as (M - I)((M - I)t-1(v)) = (M - I)t(v) = 0). In either case, M has an eigenvalue equal to 1 and by Proposition 2, V ¹ W. Then there is 0 ¹ w Î W* (for otherwise W* = {0} implies that V = W by the proof of Proposition 2). Consider the space Y / áwñ, where Y = áwñ + Z and Z = {v Î V : B(v, w) = 0}. Given the symmetric bilinear form B on V, define B" on the space Y / áwñ by
since z’ Î Z. Hence B(z,z’) = B(z1,z’). By a similar argument, B(z1,z’) = B(z1,z1’) so that B(z,z’) = B(z1,z’) = B(z1,z1’). Define M" on the space Y/áwñ by M"(y + áwñ) = M(y) + áwñ. Given the symmetric bilinear form B", it is fairly easy to prove that M" is an isometry; to see that M" is well-defined, suppose that y + áwñ = y’ + áwñ. Then y - y’ Î áwñ so that y - y’ = αw for some α Î F. Therefore M(y) - M(y’) = M(y - y’) = M(αw) = αM(w) = αw since M acts trivially on W* by the proof of Proposition 2. Thus M(y) - M(y’) Î áwñ so that M(y) + áwñ = M(y’) + áwñ. Furthermore, M" acts as a unipotent operator on Y/áwñ. To see this, let y + áwñ be any element of Y + áwñ. Since in particular y Î V, and since M acts as a unipotent operator on V, there exists n Î N with (M - I)n(y) = 0. Thus we have Note that for i = 1 to k, Q"(wi + áwñ) = B"(wi + áwñ,wi + áwñ) = B(wi,wi) = Q(wi) ¹ 0. Now let T"(wi + áwñ) be the symmetry on Y/áwñ with respect to wi + áwñ. For arbitrary y + áwñ Î Y/áwñ, we have
so that
It follows that M" = T"(w1 + áwñ)×××T"(wk + áwñ) is a representation of M" as a product of reflections such that Q"(wi + áwñ) ¹ 0 for i = 1 to k. Note that dim(Y/áwñ) < n since w ¹ 0. Therefore
by the induction hypothesis. Q.E.D. Lemma 5. M|U and M|U* are well-defined isometries. Furthermore, let the characteristic polynomial of M be given by χ(z). Set χ(z) = (z - 1)fχ0(z) where (z - 1) does not divide χ0(z). Then the characteristic polynomials of M|U, M|U* are given by (z - 1)f, χ0(z) respectively. Proof. Let u Î U; then there is t Î N such that (M - I)t(u) = 0. Then (M - I)t(M(u)) = M((M - I)t(u)) = M(0) = 0. It follows that M(u) Î U for all u Î U. Therefore M|U is well-defined and is an isometry because M is an isometry. Since M|U is an isometry on U, M|U is injective; since U is finite-dimensional, M must also be surjective. In particular, if we let u Î U, there exists x Î U such that M(x) = M|U(x) = u. Now let u’ Î U* be arbitrary. Since M is an isometry, we have B(u,M(u’)) = B(M(x),M(u’)) = B(x,u’) = 0. It follows (since u was also arbitrary) that M(u’) Î U*. Therefore M|U* is well-defined and is an isometry because M is an isometry. We claim that charpoly(M) = charpoly(M|U)charpoly(M|U*). Indeed, since V = U + U*, there exists a basis {v1, . . . ,vm,vm+1, . . . ,vn} for V with v1, . . . ,vm Î U, vm+1, . . . ,vn Î U*, and the corresponding matrix for M is in block-diagonal form. Let α be an eigenvalue of M|U, and let u be the corresponding eigenvector. Since u Î U, there exists n Î N such that (M - I)n(u) = 0. Since α is an eigenvalue, M(u) = M|U(u) = αu and (M - I)(u) = (α - 1)u so that
Proceeding inductively in this way, we get 0 = (M - I)n(u) = (α - 1)nu whence it follows that α = 1. Therefore 1 is the only eigenvalue of M|U. Furthermore, if 1 is an eigenvalue of M, M(x) = x for some 0 ¹ x Î V whence we get (M - I)(x) = 0 and x Î U. Thus 1 is not an eigenvalue of M|U*. We may conclude that (z - 1) f | charpoly(M|U) in F[z]. Let {v1, . . . ,vn} be the basis for V mentioned above, and let E be a splitting field for charpoly(M|U). Define V # = {α1v1 + ××× + αnvn : αi Î E} and U # = {α1v1 + ××× + αmvm : αi Î E}; it is not hard to see that these are vector spaces over E. Furthermore, define
on V #; the fact that B # is a symmetric bilinear form on V # then follows from the fact that B is a symmetric bilinear form on V. Similarly, define M #(α1v1 + ××× + αnvn) = α1M(v1) + ××× + αnM(vn) on V #; it follows routinely that M # is an isometry on V # with respect to B #. Furthermore, it is routinely verifiable that M # acts as a unipotent operator on U#. Since M # and M have the same matrix representations for the given basis by definition of M #, it makes sense to write “M #|U #”. Indeed, M #|U # and M|U will have the same matrix representations and hence the same characteristic polynomials. But by an analogue to the above argument, since M # acts as a unipotent operator on U #, 1 is the only eigenvalue of M #|U #. Since E is a splitting field for charpoly(M|U) = charpoly(M #|U #), charpoly(M|U) factors completely in E[z] as an integer power of z - 1. Since 1 Î F, charpoly(M|U) must also factor completely in F[z] as an integer power of z - 1. Therefore charpoly(M|U) = (z - 1) f, as desired. Finally, since (z - 1) fχ0(z) = χ(z) = charpoly(M) = charpoly(M|U)charpoly(M|U*) = (z - 1) fcharpoly(M|U*), we must have charpoly(M|U*) = χ0(z), which completes the proof. Q.E.D. The spinor norm is usually defined (as above) in terms of the symmetries of which the given isometry can be written as a product. Theorem 1, which we are now ready to prove, gives an intrinsic characterization of the spinor norm independent of such a factorization. Theorem 1. Let M, U, χ(z) be as in Lemma 5, and let V, F be as above. In F*/F*2, Θ(M)χ0(1) = 2nd(U*). Proof. Since U is nondegenerate by Proposition 3, V = U + U* and thus Θ(M) = Θ(M|U)Θ(M|U*) (see O’Meara, p. 139). Indeed, Θ(M) = Θ(M|U*) since Θ(M|U) = 1 by Proposition 4. By Lemma 5, χ0(z) is the characteristic polynomial of M|U*. Finally, since the largest subspace of U* upon which M|U* acts as a unipotent operator is {0}, the orthogonal complement of that space is just U*. Hence it suffices to prove the theorem for U*. Let V’ = U*, M’ = M|U* = M|V’. Let W and {w1, . . . wk} be as in Proposition 2. By Lemma 5,
which is what we wanted. Q.E.D. As a special case of Theorem 1, we obtain a formula for the computation of the spinor norm in terms of a simple determinant. This formula was originally obtained by H. Zassenhaus as a by-product of his proof that the spinor norm is well-defined (i.e., independent of the choice of symmetries of which the given isometry is expressed as a product). Theorem 2. (Zassenhaus) Let M be an isometry on V such that det(M + I) ¹ 0. Then Θ(M) = det((M + I)/2) in F*/F*2. Proof. det(M + I) ¹ 0, so M has no eigenvalue equal to -1. It follows that -M has no eigenvalue equal to 1. As above, take charpoly(-M) = χ(z) = (z - 1)fχ0(z). We assert that U = {0}. Indeed, assume (-M - I)t(v) = 0 for some v Î V. Let t be the least positive integer for which this is true for v in particular. If t = 1, then either v is an eigenvector corresponding to an eigenvalue of 1 or v = 0. Since there is no such eigenvalue, in this case v = 0. If, however, t > 1, then (-M - I)((-M - I)t-1(v)) = 0 so that (-M - I)t-1(v) is either an eigenvector corresponding to the eigenvalue 1 or is 0. Both are impossible, because there is no such eigenvalue and because t was chosen to be minimal; hence this case yields a contradiction. It follows that v = 0, and we have proved our assertion. Hence V = U + U* = U*, so that by Lemma 5, χ(z) = χ0(z). It follows that χ0(1) = det(M + I). Let {x1, . . . ,xn} be an orthogonal basis for V. It is almost trivial to verify that -I = T(x1)×××T(xn). It will now follow that Θ(-I) = Q(x1)×××Q(xn) = d(V). Furthermore, Θ(-M) = Θ(-I)Θ(-M) = d(V)Θ(M). Hence by Theorem 1, d(V)Θ(M)det(M + I) = Θ(-M)χ0(1) = 2nd(U*) = 2nd(V) so that Θ(M)det(M + I) = 2n. Hence in F*/F*2, multiplying both sides by 2-2ndet(M + I) yields Θ(M) = 2-ndet(M + I). By elementary linear algebra, 2-ndet(M + I) = det((M + I)/2), and we are done. Q.E.D. Examples. (1) Let F = Q, V any vector space over Q with dim(V) = 3, {v1,v2,v3} a basis for V over Q. Define
Then B is a symmetric bilinear form on V and M is an isometry from V onto itself. Since det(M + I) = 4 ¹ 0, Zassenhaus’s Theorem applies and Θ(M) = det((M + I)/2) = 1/8 = (1/4)(2) = (1/2)2(2) = 2 in Q*/Q*2. (2) Let F = Q, V any vector space over Q with dim(V) = 3, {v1,v2,v3} a basis for V over Q. Define
Then B is a symmetric bilinear form on V and M is an isometry of V onto itself. Unfortunately, det(M + I) = 0, so Zassenhaus’s Theorem does not apply directly. We shall use Theorem 1. We find that χ(z) = det(zI - M) = (z + 1)2(z - 1) so that χ0(1) = 4 = 1 in Q*/Q*2. Since
it follows by induction that (M - I)n = (-2)n(M - I) for all n Î N. Therefore u Î U if and only if Applying Theorem 1, we now have Θ(M) = Θ(M)χ0(1) = 2nd(U*) = (8)(2) = 16 = 1 in Q*/Q*2. BIBLIOGRAPHY Anton, Howard. Elementary Linear Algebra. John Wiley & Sons, New York, 1973. Dickson, L. E. Studies in the Theory of Numbers. University of Chicago Press, 1930. Mason, Geoffrey. “Groups, Discriminants and the Spinor Norm.” Bulletin of the London Mathematical Society 21 (1989) 51-56. O'Meara, O. T. Introduction to Quadratic Forms. Springer-Verlag, Berlin, 1963. Zassenhaus, H. “On the Spinor Norm.” Archiv der Mathematik, Basel, 13 (1962) 434-451. Author’s Note: I wrote this paper in the spring of 1991 as the research paper for a Master of Science degree in Mathematics, with which degree I graduated that May from Southern Illinois University at Carbondale. It is in the subspecialty of abstract linear algebra. |