The Spinor Norm and Zassenhaus’s Theorem

Darren L. Slider

The Spinor Norm and Zassenhaus’s Theorem

Darren’s Writings

Home

DARREN L. SLIDER (B. 1967)

The Spinor Norm and Zassenhaus’s Theorem

A Research Paper Submitted in Partial Fulfillment of the Requirements for the Master of Science Degree

Department of Mathematics
Graduate School
Southern Illinois University at Carbondale
© 1991

ACKNOWLEDGEMENTS

I would like to express my deepest gratitude to my advisor, Professor Andrew G. Earnest, for his abundant help and guidance with this paper, and for making my two years of graduate school a stimulating and enjoyable experience. I would also like to thank the other members of my committee, Professors Robert Fitzgerald and Mary Wright, for their time and effort with respect to this paper and its presentation. Finally, I express my wholehearted indebtedness and praise to God and to my wife, for reasons which transcend verbal explanation.

THE SPINOR NORM AND ZASSENHAUS’S THEOREM

The proof of Theorem 1 here given is due to Geoffrey Mason, which is sketched in his article “Groups, Discriminants, and the Spinor Norm” (see Bibliography). The examples following Zassenhaus’s Theorem were obtained from L. E. Dickson, Studies in the Theory of Numbers (see Bibliography).

We assume throughout that V is a vector space over a field F of characteristic not 2 (i.e., there exists a Î F such that 2a ¹ 0) and that dim(V) = n (where n Î N is fixed).

Definition. A mapping B : V ´ V ® F is a symmetric bilinear form on V if for all v, w, x Î V, a Î F, the following properties hold:

(1) B(v,w+x) = B(v,w) + B(v,x)
(2) B(av,w) = aB(v,w)
(3) B(v,w) = B(w,v).

Definition. Let B be a symmetric bilinear form on V. Let Q : V ® F be given by Q(v) = B(v,v). Q is a quadratic form on V.

Definition. Let V have basis {v₁,...,v_n} and symmetric bilinear form B. The determinant det(B(v_i,v_j)) of the n ´ n matrix (B(v_i,v_j)) is called the discriminant of V and is written d(V). d(V) is invariant under change of basis in the group F^*/F^*2 and is thus represented as an element of that field (see O’Meara, pp. 85-87).

Let n, t Î N satisfy 1 £ n £ t. We introduce variables a_ij (1 £ i, j £ t) and the t ´ t matrices

D = (a_ij)

J_n =
I_n 0
0 0

N =
0 a₁₂ - - a_1t
| \ \ |
| \ \ |
| \ a_t-1,t
0 - - - 0

A_i =
0
|
0
a_i1×××a_it
0
|
0

E_n = (I + A₁)(I + A₂)××××××(I + A_n)

Lemma 1. J_n-1D + A_n = J_nD and (J_n-1D - N)A_n = 0.

Proof. Note that

It follows that J_n-1D + A_n =

a₁₁ - - - - a_1t
| |
a_n-1,1 - - - - a_n-1,t
0 - - - - 0
| |
| |
0 - - - - 0
+
0 - - - - 0
| |
| |
a_n1 - - - - a_nt
0 - - - - 0
| |
0 - - - - 0
=
a₁₁ - - - - a_1t
| |
| |
a_n1 - - - - a_nt
0 - - - - 0
| |
0 - - - - 0
= J_nD.

Furthermore, (J_n-1D - N)A_n =

a₁₁ - - - - - - a_1t
| |
| |
a_n-1,1 - - - - - - a_n-1,t
0 - - - - - - 0
| |
| |
0 - - - - - - 0
-
0 a₁₂ - - - - - a_1t
| \ \ |
| \ \ |
| \ \ |
| \ \ |
| \ \ |
| \ a_t-1,t
0 - - - - - - 0

A_n =

a₁₁ 0 - - - - - 0
| \ \ |
| \ \ |
a_1,n-1 - - a_n-1,n-1 0 - - 0
0 - - - 0 a_n,n+1 - a_nt
| \ \ |
| \ a_t-1,t
0 - - - - - - 0
0 - 0
| |
| |
0 - 0
a_n1 - a_nt
0 - 0
| |
0 - 0
= 0.

Q.E.D.

Proposition 1. (I - N)(E_n - I) = J_nD.

Proof. By induction on n:

Note that E₁ - I = I + A₁ - I = A₁. So (I - N)(E₁ - I) = (I - N)A₁ =

1 -a₁₂ - -a_1t
0 \ \ |
| \ \ -a_t-1,t
0 - 0 1
a₁₁ - - a_1t
0 - - 0
| |
0 - - 0
=
a₁₁ - - a_1t
0 - - 0
| |
0 - - 0
= J_nD

by the first result of the proof of Lemma 1.

Assume that (I - N)(E_{k -1} - I) = J_{k -1}D for some k with 2 ≤ k ≤ t. Then

(I - N)(E_k - I) = (I - N)(E_{k - 1}(I + A_k) - I)
= (I - N)(E_{k - 1} + E_{k - 1}A_k - I)
= (I - N)(E_{k - 1} - I) + (I - N)(E_{k - 1}A_k)
= J_{k - 1}D + (I - N)(E_{k - 1}A_k - A_k + A_k)
= J_{k - 1}D + (I - N)(E_{k - 1} - I)A_k + (I - N)A_k
= J_{k - 1}D + J_{k - 1}DA_k + A_k - NA_k
= (J_{k - 1}D + A_k) + (J_{k - 1}D - N)A_k
= J_kD + 0
= J_kD

by Lemma 1.

Q.E.D.

From now on, let dim V = n = t, replace a_ij by -2a_ij/a_ii, and set T(v_i) = I + A_i =

1 0 - - - - - - 0
0 \ \ |
| \ \ \ |
0 - 0 1 0 - - - 0
-2a_i1/a_ii - - -2a_i,i-1/a_ii -1 -2a_i,i+1/a_ii - - -2a_in/a_ii
0 - - - 0 1 0 - 0
| \ \ \ |
| \ \ 0
0 - - - - - - 0 1

Lemma 2. det(T(v₁)T(v₂)×××T(v_n) - I) = (-2)ⁿdet(a_ij)/a₁₁×××a_nn.

Proof. Since n = t, we have J_nD = J_tD = ID = D. We may obtain det(D) = det(-2a_ij/a_ii) by multiplying each row of (a_ij) by -2 and (for 1 ≤ i ≤ n) the ith row by 1/a_ii; elementary linear algebra (see Anton, p. 65) tells us that this yields det(D) = (-2)ⁿdet(a_ij)/a₁₁×××a_nn. Also from elementary linear algebra, we get det(I - N) = 1 since I - N is upper triangular (see the proof of Proposition 1 and Anton, p. 64). Taking determinants on both sides of the result of Proposition 1 now yields

det(T(v₁)×××T(v_n) - I) = det(((I + A₁)×××(I + A_n)) - I)
= det(E_n - I)
= det(I - N)det(E_n - I)
= det((I - N)(E_n - I))
= det(J_nD)
= det(D)
= (-2)ⁿdet(a_ij)/a₁₁×××a_nn.

Q.E.D.

Definitions. Let B be a symmetric bilinear form on V, and let W be a subspace of V. We define the orthogonal complement W^* of W to be {v Î V : B(v,w) = 0 " w Î W}. We call rad(W) = W Ç W^* the radical of W. It is easy to prove that W^* and rad(W) are subspaces of W. We say that W is nondegenerate if rad(W) = {0}. In this case, V = W + W^* (O’Meara, p. 102).

We assume henceforth that V is nondegenerate and has a symmetric bilinear form B with associated quadratic form Q.

Let M : V ® V be an injective linear transformation such that for all v, w Î V, B(M(v),M(w)) = B(v,w). M is called an isometry of V onto itself. For w Î V, define T(w) : V ® V by T(w)(v) = v - (2B(v,w)w/Q(w)) for v Î V. T(w) is the symmetry on V with respect to w.

Let {v₁, . . . ,v_n} be a basis for V. Then if v Î V, v = a₁v₁ + . . . a_nv_n for some a₁, . . . ,a_n Î F. Note that

1 0 - - - - - - 0
0 \ \ |
| \ \ \ |
0 - 0 1 0 - - - 0
-2a_i1/a_ii - - -2a_i-1,i/a_ii -1 2a_i+1,i/a_ii - - 2a_in/a_ii
0 - - - 0 1 0 - 0
| \ \ \ |
| \ \ 0
0 - - - - - - 0 1
a₁
|
|
a_i-1
a_i
a_i+1
|
|
a_n

=
a₁
|
|
a_i-1
a_i - (2B(v,v_i)/Q(v_i))
a_i+1
|
|
a_n

under the interpretation a_ij = B(v_i,v_j). Indeed, in the ith row we have

-2a₁a_i1

a_ii
+ ××× +
-2a_i-1a_i,i-1

a_ii
+ (-a_i)
+
- 2a_i+1a_i,i+1

a_ii
+ ××× +
- 2a_na_in

a_ii

=
a_i +
-2(a_ia_i1 + ××× + a_na_in)

a_ii
=
a_i +
-2(a_iB(v_i,v₁) + ××× + a_nB(v_i,v_n))

Q(v_i)
=
a_i +
-2(B(v_i,a₁v₁) + ××× +B(v_i,a_nv_n))

Q(v_i)
=
a_i -
2B(v_i,a₁v₁ + ××× + a_nv_n)

Q(v_i)
=
a_i -
2B(v_i,v)

Q(v_i)
=
a_i -
2B(v,v_i)

Q(v_i)

and from this we can see that the choice of terminology for the symmetry on V with respect to v_i was no accident: the latter, as we have just shown, has the matrix representation of I + A_i shown above.

Let M be an isometry of V onto itself. By the theorem of Cartan and Dieudonné (O’Meara, p. 102) we know that M has a representation as a product of symmetries,

M = T(w₁)×××T(w_k)

(1)

with k ≤ n and Q(w_i) ≠ 0 for i = 1,...,n.

Proposition 2. Let M be as in (1) with k minimal. Let W = áw₁,...,w_kñ. Then M has no eigenvalue equal to 1 if and only if V = W.

Proof. (Þ) For i = 1,...,k, let w_i^* = {v Î V: B(v,w_i) = 0}. Since W is spanned by {w₁,...,w_k}, for w Î W we may find a₁,...,a_k Î F such that w = a₁w₁ + ××× + a_kw_k . Assume that w′ Î Çw_i^*; then for w Î W we have

B(w’,w) = B(w’,a₁w₁ + ××× + a_kw_k)
= B(w’,a₁w₁) + ××× + B(w’,a_kw_k)
= a₁B(w’,w₁) + ××× + a_kB(w’,w_k)
= a₁(0) + ××× + a_k(0)
= 0

so that w’ Î W^*. Therefore, Çw_i^* Í W^*. On the other hand, if w Î W^*, then B(w,w_i) = 0 for i = 1,...,k since w_i Î W; thus W^* Í Çw_i^*. So we have W^* = Çw_i^*.

Let w Î W^* = Çw_i^*; then for i = 1,...,k, B(w,w_i) = 0 and

T(w_i)(w)
=
w -
2B(w,w_i)

Q(w_i)
w_i = w

and thus M(w) = T(w₁)×××T(w_k)(w) = w. Therefore (M - I)(w) = 0 and if w ¹ 0, w is an eigenvector corresponding to the eigenvalue 1 of M. But by hypothesis, M has no eigenvalues equal to 1, so we must have w = 0. It follows that W^* = {0}. In particular, rad(W) = W Ç W^* = {0} so that W is nondegenerate. We conclude that V = W + W^* = W + {0} = W.

(Ü) Assume now that V = W. Then V = áw₁,...,w_kñ; since k is minimal and dim(V) = n, k = n and áw₁,...,w_nñ is a basis for V. Under the interpretation v_i = w_i, a_ij = B(w_i,w_j), Lemma 2 yields

det(M-I) = (-2)ⁿdet(B(w_i,w_j))/Q(w₁)×××Q(w_n).

(2)

Since V is nondegenerate, d(V) = det(B(w_i,w_j)) ¹ 0, so det(M - I) ¹ 0. We conclude that M has no eigenvalue equal to 1.

Q.E.D.

Definition. Let M be as above, and let U be a subspace of V. We say that M acts as a unipotent operator on U if for each u Î U, there exists a t Î N such that (M - I)^t(u) = 0.

We may assume that the largest subspace on which M acts as a unipotent operator contains all other such subspaces. Indeed, let U be a subspace and let u’ be a vector not in U such that (M - I)^m(u’) = 0 for some m Î N. Let u Î U be arbitrary; then for some n Î N, (M - I)ⁿ(u) = 0. Let t = max{m,n}; then

(M - I)^t(u + au’) = (M - I)^t(u) + (M - I)^t(au’)
= (M - I)^t-n((M - I)ⁿ(u)) + a(M - I)^t-m((M - I)^m(u’))
= (M - I)^t-n(0) + a(M - I)^t-m(0)
= 0

so that M acts as a unipotent operator on U + áu’ñ. Continuing in this way, we can (since V is finite-dimensional) obtain a subspace upon which M acts as a unipotent operator which contains all other such subspaces.

Lemma 3. Let U be the largest subspace of V on which M acts as a unipotent operator, and let W be as in Proposition 2. Then V = U + W.

Proof. Let v Î V. Then

M(v)

T(w₁)×××T(w_k)(v)

T(w₁)×××T(w_k-1)(v -

2B(v,w_k)

Q(w_k)

w_k)

T(w₁)×××T(w_k-1)(v -

w_k’) for w_k’ÎW

T(w₁)×××T(w_k-2)(v -

w_k’ -

2B(v-w_k’,w_k-1)

Q(w_k-1)

w_k-1)

T(w₁)×××T(w_k-2)(v -

w_k-1’) for w_k-1’ÎW

×××××××××××××××

v - w₁’ for w₁’ÎW

so that (M - I)(v) = v - w₁’ - v = -w₁’ Î W. It follows that im(M - I) Í W.

Now consider the operators M - I, (M - I)², (M - I)³, . . . on V. Note that ker(M - I) Í ker(M - I)² Í . . . are subspaces of V. Since V is finite-dimensional, there must exist k Î N such that ker(M - I)^k = ker(M - I)^k+j for all j Î N. Let V₀ = ker(M - I)^k and let V₁ = im(M - I)^k.

Let x Î V₀ Ç V₁. Since x Î V₁, there must exist some y Î V such that x = (M - I)^k(y). Since x Î V₀, we must also have (M - I)^k(x) = 0. Thus 0 = (M - I)^k(x) = (M - I)^k((M - I)^k(y)) = (M - I)^2k(y). But ker(M - I)^2k = ker(M-I)^k by the choice of k. It follows that y Î ker(M - I)^k, so that we have x = (M - I)^k(y) = 0. Therefore V₀ Ç V₁ = {0}.

Since W Ê im(M - I) Ê im(M - I)² Ê . . . , we have V₁ Í W and V₀ Í U. So

dim(U + W)	≥	dim(V₀ + V₁)
	=	dim(V₀) + dim(V₁) - dim(V₀ Ç V₁)
	=	nullity(M - I)^k + rank(M - I)^k - 0
	=	dim(V)

and of course dim(V) ≥ dim(U + W); thus dim(V) = dim(U + W). It follows that V = U + W, as desired.

Q.E.D.

Lemma 4. Let W be as in Proposition 2, and let M’ be defined on W/rad(W) by M’(w+rad(W)) = M(w)+rad(W). Then M’ is a well-defined isometry on W/rad(W). Furthermore, let X/rad(W) be the largest subspace of W/rad(W) on which M’ acts as a unipotent operator, and let U be as in Lemma 3. If X/rad(W) is nondegenerate, then X Í U and W Í X + X^*.

Proof. Given the symmetric bilinear form B on V, define B’ on W/rad(W) by B’(w+rad(W),w’+rad(W)) = B(w,w’). It is trivial to verify that B’ is a symmetric bilinear form on W/rad(W). To see that B’ is well-defined, note that if w₁ + rad(W) = w + rad(W) and w₁’ + rad(W) = w’ + rad(W), it follows that w - w₁,w’ - w₁’ Î rad(W) so that B(w - w₁,w’) = 0, whence B(w,w’) = B(w₁,w’). By a similar argument, B(w₁,w’) = B(w₁,w₁’). Therefore B(w,w’) = B(w₁,w’) = B(w₁,w₁’).

Given this symmetric bilinear form B’ on W/rad(W), it is easy to prove that M’ is an isometry, for it is clear from the proof of Lemma 3 that M maps W into W. If, say, w and w’ belong to the same coset of W/rad(W), then w-w’ Î rad(W), and so M(w - w’) = w - w’ Î rad(W) by the proof of Proposition 2, since rad(W) Í W^*. Therefore

(M(w) + rad(W)) - (M(w’) + rad(W))	=	(M(w) - M(w’)) + rad(W)
	=	M(w - w’) + rad(W)
	=	(w - w’) + rad(W)
	=	0 + rad(W)

so that M(w) + rad(W) = M(w’) + rad(W). Hence M’ is well-defined.

Now suppose that X/rad(W) as defined above is nondegenerate; let x Î X. Certainly x + rad(W) Î X/rad(W); since M’ acts as a unipotent operator on X/rad(W), there is t Î N such that (M-I)^t(x)+rad(W)=(M’-I)^t(x+rad(W)) = 0 + rad(W); in particular, (M - I)^t(x) Î rad(W). Now rad(W) Í W^* so that (M - I)^t(x) Î W^*; by the proof of Proposition 2, we have M((M - I)^t(x)) = (M - I)^t(x). Therefore, (M - I)^t+1(x) = 0. It follows that x Î U. So X Í U.

We shall now prove that (X/rad(W))^* = X^*/rad(W). Let w+rad(W) Î (X/rad(W))^*. Then if x+rad(W)ÎX/rad(W), we have B(w,x) = B’(w + rad(W),x + rad(W)) = 0. Then w Î X^* so that w + rad(W) Î X^*/rad(W). Therefore (X/rad(W))^* Í X^*/rad(W). On the other hand, let x + rad(W) Î X^*/rad(W); then x Î X^* and if x’ Î X/rad(W), then B’(x+rad(W),x’+rad(W)) = B(x,x’) = 0. Therefore x+rad(W) Î (X/rad(W))^* and X^*/rad(W) Í (X/rad(W))^*. It now follows that (X/rad(W))^* = X^*/rad(W).

Since X/rad(W) is assumed to be nondegenerate, we have

W/rad(W)	=	X/rad(W) + (X/rad(W))^*
	=	X/rad(W) + X^*/rad(W).

Consider w Î W. For some x Î X, x’ Î X^*, we have

w + rad(W)	=	(x + rad(W)) + (x’ + rad(W))
	=	(x + x’) + rad(W)

so that w - (x + x’) Î rad(W); say w’ = w - (x + x’). Then w = x + x’ + w’. To prove that W Í X + X^*, it suffices to show that w’ Î X^*.

Since w’ Î rad(W), then in particular w’ Î W^*. Since X Í W, for any y Î X, we have y Î W, so that B(w’,y) = 0. It follows that w’ Î X^*, which concludes the proof.

Q.E.D.

Proposition 3. The largest subspace U of V on which M acts as a unipotent operator is nondegenerate.

Proof. By induction on dim(V) = n. Let W be as in Proposition 2. Then by Lemma 3 we have V = U + W. We set R = rad(U) = U Ç U^* and try to show that R = {0}.

If n = 1, then either V = U or V = W. If V = U, then U is nondegenerate because V is assumed to be so. If V = W, then by Proposition 2, M has no eigenvalue equal to 1. It follows that if (M - I)(v) = 0 for some v Î V, then v = 0. Likewise, if (M - I)²(v) = 0, then we have 0 = (M - I)²(v) = (M - I)((M - I)(v)) so that (M - I)(v) = 0, whence v = 0. Proceeding inductively in this way, we may conclude that (M - I)^t(v) = 0 Þ v = 0 for any t Î N. It follows that U = {0}; in this case R = U Ç U^* = {0} Ç {0}^* = {0} and we are done.

Now assume that the largest subspace of any nondegenerate vector space of dimension less than n upon which any given isometry acts as a unipotent operator is nondegenerate.

Now consider dim(V) = n. By the above argument, we are done if V = W, so assume that V ¹ W. Note that W^* Í U (since M acts as a unipotent operator on W^* by the proof of Proposition 2); therefore (see O’Meara, p. 92)

R = U Ç U^* Í U^* Í (W^*)^* = W.

Let X/rad(W) be as in Lemma 4. Since W ¹ V, we have dim(W) < dim(V). Clearly dim(W/rad(W)) ≤ dim(W), so that dim(W/rad(W)) ≤ dim(W) < dim(V) = n. It now follows from the induction assumption that X/rad(W) is nondegenerate.

Since U^* Í W, we get R = U Ç U^* Í U Ç W. We shall show that U Ç W Í X so that R Í X. Indeed, let w Î U Ç W; then w Î U so that (M - I)^t(w) = 0 for some t Î N. Letting M’ be as in Lemma 4, we have

(M’ - I)^t(w + rad(W))	=	(M’ - I)^t(w) + rad(W)
	=	0 + rad(W)

whence by the maximality of X/rad(W), w + rad(W) Î X/rad(W) so that w Î X.

Since R = rad(U), R is orthogonal to U; since R Í X = (X^*)^*, R is orthogonal to X^*. Hence R is orthogonal to U + X^*. By Lemma 4 we have X Í U and W Í X + X^*, since X/rad(W) is nondegenerate. Therefore

V = U + W Í U + X + X^* Í U + X^*

and certainly U + X^* Í V, so V = U + X^*. Now V is nondegenerate, so we have V^* = (U + X^*)^* = {0}, whence R Í (U + X^*)^* = {0} gives us R = {0}, as desired.

Q.E.D.

Definition. Let M be an isometry on the quadratic space V with associated quadratic form Q. If M = T(w₁)T(w₂)×××T(w_k), we define the spinor norm of M to be Θ(M) = Q(w₁)Q(w₂)×××Q(w_k) Î F^*/F^*2. This is a well-defined invariant (i.e., independent of the choice of product of reflections) in F^*/F^*2 (see O’Meara, pp. 131-137), Furthermore, it is easily seen that Θ is a homomorphism on the group (under the operation of composition) of isometries of V onto itself.

Proposition 4. If M acts as a unipotent operator on V, then Θ(M) = 1.

Proof. By induction on dim (V) = n.

Let dim(V) = á v ñ. Since M acts as a unipotent operator on V, there must exist some minimal t Î N such that (M - I)((M - I)^t-1(v)) = (M - I)^t(v) = 0. If t = 1, let v’ = v; if t > 1, let v’ = (M - I)^t-1(v). Since v’ ¹ 0 (otherwise v = 0 or t is not minimal, respectively), v’ = αv for some 0 ¹ α Î F and we have 0 = (M - I)(v’) = (M - I)(αv) = α(M - I)(v). It follows that (M - I)(v) = 0, whence M(v) = v. So for any w Î V, say, w = βv, M(w) = M(βv) = βM(v) = βv = w. Therefore M = I.

Since M may be represented as a product of reflections, say, M = T(w₁)×××T(w_k) with Q(w_i) ¹ 0, then in particular, Q(w₁) ¹ 0 and w₁ ¹ 0 so that V = áw₁ñ. Hence for w Î V, say w = βw₁,

T(w₁)T(w₁)(w)	=	T(w₁)T(w₁)(βw₁)
	=	βT(w₁)T(w₁)(w₁)
	=	βT(w₁)(-w₁)
	=	βw₁
	=	w
	=	I(w)
	=	M(w)

so that M = T(w₁)T(w₁). Therefore Θ(M) = Q(w₁)Q(w₁) = (Q(w₁))² = 1 in F^*/F^*2.

Now assume that the spinor norm of any isometry acting as a unipotent operator on any vector space of dimension < n is 1.

Let W be as in Proposition 2. Let 0 ¹ v Î V; then we have (M - I)^t(v) = 0 for some minimal t Î N. If t = 1, then 1 is an eigenvalue of M corresponding to v. If t > 1, then 1 is an eigenvalue of M corresponding to (M - I)^t-1(v) (which is in this case an eigenvector of M as (M - I)((M - I)^t-1(v)) = (M - I)^t(v) = 0). In either case, M has an eigenvalue equal to 1 and by Proposition 2, V ¹ W. Then there is 0 ¹ w Î W^* (for otherwise W^* = {0} implies that V = W by the proof of Proposition 2). Consider the space Y / áwñ, where Y = áwñ + Z and Z = {v Î V : B(v, w) = 0}.

Given the symmetric bilinear form B on V, define B" on the space Y / áwñ by B"(aw + z + áwñ, a’ w + z’ + áwñ) = B(z,z’) where z, z’ Î Z. It is trivial to verify that B" is a symmetrical bilinear form on V/áwñ. To see that B" is well-defined, note that if a₁w + z₁ + áwñ = aw + z + áwñ, a’₁w + z’₁ + áwñ = a’w + z’ + áwñ, then z - z₁, z’ - z₁’ Î áwñ so that (in particular) z - z₁ = βw for some β Î F and

B(z,z’) - B(z₁,z’)	=	B(z - z₁,z’)
	=	B(βw,z’)
	=	βB(w,z’)
	=	0

since z’ Î Z. Hence B(z,z’) = B(z₁,z’). By a similar argument, B(z₁,z’) = B(z₁,z₁’) so that B(z,z’) = B(z₁,z’) = B(z₁,z₁’).

Define M" on the space Y/áwñ by M"(y + áwñ) = M(y) + áwñ. Given the symmetric bilinear form B", it is fairly easy to prove that M" is an isometry; to see that M" is well-defined, suppose that y + áwñ = y’ + áwñ. Then y - y’ Î áwñ so that y - y’ = αw for some α Î F. Therefore M(y) - M(y’) = M(y - y’) = M(αw) = αM(w) = αw since M acts trivially on W^* by the proof of Proposition 2. Thus M(y) - M(y’) Î áwñ so that M(y) + áwñ = M(y’) + áwñ.

Furthermore, M" acts as a unipotent operator on Y/áwñ. To see this, let y + áwñ be any element of Y + áwñ. Since in particular y Î V, and since M acts as a unipotent operator on V, there exists n Î N with (M - I)ⁿ(y) = 0. Thus we have (M” - I)(y + áwñ) = M"(y + áwñ) - I(y + áwñ) = M(y) + áwñ - (y + áwñ) = (M(y) - y) + áwñ = (M - I)(y) + áwñ so that(M” - I)ⁿ(y + áwñ) = (M - I)ⁿ(y) + áwñ = áwñ. It follows that M" acts as a unipotent operator on Y/áwñ.

Note that for i = 1 to k, Q"(w_i + áwñ) = B"(w_i + áwñ,w_i + áwñ) = B(w_i,w_i) = Q(w_i) ¹ 0. Now let T"(w_i + áwñ) be the symmetry on Y/áwñ with respect to w_i + áwñ. For arbitrary y + áwñ Î Y/áwñ, we have

T"(w_i + áwñ)(y + áwñ)

y + áwñ -

2B"(w_i + áwñ,y + áwñ)

Q"(w_i + áwñ)

(w_i + áwñ)

y + áwñ -

2B(w_i,y)

Q(w_i)

(w_i + áwñ)

(y -

2B(w_i,y)

Q(w_i)

w_i) + áwñ

T(w_i)(y) + áwñ

so that

[T"(w₁ + áwñ)×××T"(w_k + áwñ)](y + áwñ)	=	[T"(w₁ + áwñ)×××T"(w_{k -1} + áwñ)](T"(w_k + áwñ)(y + áwñ))
	=	[T"(w₁ + áwñ)×××T"(w_{k -1} + áwñ)](T(w_k)(y) + áwñ)
	=	[T"(w₁ + áwñ)×××T"(w_{k -2} + áwñ)](T(w_{k -1})T(w_k)(y) + áwñ)
	=	×××××××××××××××
	=	×××××××××××××××
	=	[T"(w₁ + áwñ)](T(w₂)×××T(w_k)(y) + áwñ)
	=	T(w₁)×××T(w_k)(y) + áwñ
	=	M(y) + áwñ
	=	M"(y + áwñ).

It follows that M" = T"(w₁ + áwñ)×××T"(w_k + áwñ) is a representation of M" as a product of reflections such that Q"(w_i + áwñ) ¹ 0 for i = 1 to k.

Note that dim(Y/áwñ) < n since w ¹ 0. Therefore

1	=	Θ(M")
	=	Q"(w₁ + áwñ)×××Q"(w_k + áwñ)
	=	Θ(M)

by the induction hypothesis.

Q.E.D.

Lemma 5. M|_U and M|_U^* are well-defined isometries. Furthermore, let the characteristic polynomial of M be given by χ(z). Set χ(z) = (z - 1)^fχ₀(z) where (z - 1) does not divide χ₀(z). Then the characteristic polynomials of M|_U, M|_U^* are given by (z - 1)^f, χ₀(z) respectively.

Proof. Let u Î U; then there is t Î N such that (M - I)^t(u) = 0. Then (M - I)^t(M(u)) = M((M - I)^t(u)) = M(0) = 0. It follows that M(u) Î U for all u Î U. Therefore M|_U is well-defined and is an isometry because M is an isometry.

Since M|_U is an isometry on U, M|_U is injective; since U is finite-dimensional, M must also be surjective. In particular, if we let u Î U, there exists x Î U such that M(x) = M|_U(x) = u. Now let u’ Î U^* be arbitrary. Since M is an isometry, we have B(u,M(u’)) = B(M(x),M(u’)) = B(x,u’) = 0. It follows (since u was also arbitrary) that M(u’) Î U^*. Therefore M|_U^* is well-defined and is an isometry because M is an isometry.

We claim that charpoly(M) = charpoly(M|_U)charpoly(M|_U^*). Indeed, since V = U + U^*, there exists a basis {v₁, . . . ,v_m,v_m+1, . . . ,v_n} for V with v₁, . . . ,v_m Î U, v_m+1, . . . ,v_n Î U^*, and the corresponding matrix for M is in block-diagonal form.

Let α be an eigenvalue of M|_U, and let u be the corresponding eigenvector. Since u Î U, there exists n Î N such that (M - I)ⁿ(u) = 0. Since α is an eigenvalue, M(u) = M|_U(u) = αu and (M - I)(u) = (α - 1)u so that

(M - I)²(u)	=	(M - I)((M - I)(u))
	=	(M - I)((α - 1)u)
	=	(α - 1)(M - I)(u)
	=	(α - 1)²u.

Proceeding inductively in this way, we get 0 = (M - I)ⁿ(u) = (α - 1)ⁿu whence it follows that α = 1. Therefore 1 is the only eigenvalue of M|_U. Furthermore, if 1 is an eigenvalue of M, M(x) = x for some 0 ¹ x Î V whence we get (M - I)(x) = 0 and x Î U. Thus 1 is not an eigenvalue of M|_U^*. We may conclude that (z - 1)^f | charpoly(M|_U) in F[z].

Let {v₁, . . . ,v_n} be the basis for V mentioned above, and let E be a splitting field for charpoly(M|_U). Define V^# = {α₁v₁ + ××× + α_nv_n : α_i Î E} and U^# = {α₁v₁ + ××× + α_mv_m : α_i Î E}; it is not hard to see that these are vector spaces over E. Furthermore, define

B^#(

n
å
i=1

α_iv_i,

n
å
j=1

β_jv_j)

n
å
i=1

n
å
j=1

α_iβ_jB(v_i,v_j)

on V^#; the fact that B^# is a symmetric bilinear form on V^# then follows from the fact that B is a symmetric bilinear form on V. Similarly, define M^#(α₁v₁ + ××× + α_nv_n) = α₁M(v₁) + ××× + α_nM(v_n) on V^#; it follows routinely that M^# is an isometry on V^# with respect to B^#. Furthermore, it is routinely verifiable that M^# acts as a unipotent operator on U^#.

Since M^# and M have the same matrix representations for the given basis by definition of M^#, it makes sense to write “M^#|_U^#”. Indeed, M^#|_U^# and M|_U will have the same matrix representations and hence the same characteristic polynomials. But by an analogue to the above argument, since M^# acts as a unipotent operator on U^#, 1 is the only eigenvalue of M^#|_U^#. Since E is a splitting field for charpoly(M|_U) = charpoly(M^#|_U^#), charpoly(M|_U) factors completely in E[z] as an integer power of z - 1. Since 1 Î F, charpoly(M|_U) must also factor completely in F[z] as an integer power of z - 1. Therefore charpoly(M|_U) = (z - 1)^f, as desired.

Finally, since (z - 1)^fχ₀(z) = χ(z) = charpoly(M) = charpoly(M|_U)charpoly(M|_U^*) = (z - 1)^fcharpoly(M|_U^*), we must have charpoly(M|_U^*) = χ₀(z), which completes the proof.

Q.E.D.

The spinor norm is usually defined (as above) in terms of the symmetries of which the given isometry can be written as a product. Theorem 1, which we are now ready to prove, gives an intrinsic characterization of the spinor norm independent of such a factorization.

Theorem 1. Let M, U, χ(z) be as in Lemma 5, and let V, F be as above. In F^*/F^*2, Θ(M)χ₀(1) = 2ⁿd(U^*).

Proof. Since U is nondegenerate by Proposition 3, V = U + U^* and thus Θ(M) = Θ(M|_U)Θ(M|_U^*) (see O’Meara, p. 139). Indeed, Θ(M) = Θ(M|_U^*) since Θ(M|_U) = 1 by Proposition 4. By Lemma 5, χ₀(z) is the characteristic polynomial of M|_U^*. Finally, since the largest subspace of U^* upon which M|_U^* acts as a unipotent operator is {0}, the orthogonal complement of that space is just U^*. Hence it suffices to prove the theorem for U^*.

Let V’ = U^*, M’ = M|_U^* = M|_V’. Let W and {w₁, . . . w_k} be as in Proposition 2. By Lemma 5, (z - 1) does not divide charpoly(M’) so that M’ has no eigenvalue equal to 1; it follows from Proposition 2 that V’ = W. Furthermore, by (2) in the proof of that result, we have

χ₀(1)

(-1)ⁿdet(M’ - I)

2ⁿdet(B(w_i,w_j))

Q(w₁)×××Q(w_n)

2ⁿd(U^*)

Θ(M’)

which is what we wanted.

Q.E.D.

As a special case of Theorem 1, we obtain a formula for the computation of the spinor norm in terms of a simple determinant. This formula was originally obtained by H. Zassenhaus as a by-product of his proof that the spinor norm is well-defined (i.e., independent of the choice of symmetries of which the given isometry is expressed as a product).

Theorem 2. (Zassenhaus) Let M be an isometry on V such that det(M + I) ¹ 0. Then Θ(M) = det((M + I)/2) in F^*/F^*2.

Proof. det(M + I) ¹ 0, so M has no eigenvalue equal to -1. It follows that -M has no eigenvalue equal to 1. As above, take charpoly(-M) = χ(z) = (z - 1)^fχ₀(z).

We assert that U = {0}. Indeed, assume (-M - I)^t(v) = 0 for some v Î V. Let t be the least positive integer for which this is true for v in particular. If t = 1, then either v is an eigenvector corresponding to an eigenvalue of 1 or v = 0. Since there is no such eigenvalue, in this case v = 0. If, however, t > 1, then (-M - I)((-M - I)^t-1(v)) = 0 so that (-M - I)^t-1(v) is either an eigenvector corresponding to the eigenvalue 1 or is 0. Both are impossible, because there is no such eigenvalue and because t was chosen to be minimal; hence this case yields a contradiction. It follows that v = 0, and we have proved our assertion.

Hence V = U + U^* = U^*, so that by Lemma 5, χ(z) = χ₀(z). It follows that χ₀(1) = det(M + I).

Let {x₁, . . . ,x_n} be an orthogonal basis for V. It is almost trivial to verify that -I = T(x₁)×××T(x_n). It will now follow that Θ(-I) = Q(x₁)×××Q(x_n) = d(V). Furthermore, Θ(-M) = Θ(-I)Θ(-M) = d(V)Θ(M).

Hence by Theorem 1, d(V)Θ(M)det(M + I) = Θ(-M)χ₀(1) = 2ⁿd(U^*) = 2ⁿd(V) so that Θ(M)det(M + I) = 2ⁿ. Hence in F^*/F^*2, multiplying both sides by 2^-2ndet(M + I) yields Θ(M) = 2^-ndet(M + I). By elementary linear algebra, 2^-ndet(M + I) = det((M + I)/2), and we are done.

Q.E.D.

Examples. (1) Let F = Q, V any vector space over Q with dim(V) = 3, {v₁,v₂,v₃} a basis for V over Q. Define

(B(v_i,v_j)) =

4 2 2
2 1 1
2 1 1

M(v₁) = v₁
M(v₂) = v₃
M(v₃) = v₁ - v₂.

Then B is a symmetric bilinear form on V and M is an isometry from V onto itself. Since det(M + I) = 4 ¹ 0, Zassenhaus’s Theorem applies and Θ(M) = det((M + I)/2) = 1/8 = (1/4)(2) = (1/2)²(2) = 2 in Q^*/Q^*2.

(2) Let F = Q, V any vector space over Q with dim(V) = 3, {v₁,v₂,v₃} a basis for V over Q. Define

(B(v_i,v_j)) =

2 1 1
1 3 1
1 1 3

M(v₁) = -v₁
M(v₂) = -v₁ + v₃
M(v₃) = -v₁ + v₂.

Then B is a symmetric bilinear form on V and M is an isometry of V onto itself. Unfortunately, det(M + I) = 0, so Zassenhaus’s Theorem does not apply directly. We shall use Theorem 1.

We find that χ(z) = det(zI - M) = (z + 1)²(z - 1) so that χ₀(1) = 4 = 1 in Q^*/Q^*2. Since

M - I =

-2	-1	-1
0	-1	-1
0	1	-1

it follows by induction that (M - I)ⁿ = (-2)ⁿ(M - I) for all n Î N. Therefore u Î U if and only if (M - I)(u) = 0, that is, U = ker(M - I). By setting the matrix M - I = 0 we find that U is generated by the vector v₁ - v₂ - v₃. Therefore U^* = {v Î V : B(v,v₁ - v₂ - v₃) = 0}. If v = α₁v₁ + α₂v₂ + α₃v₃ Î U^*, we have 0 = B(v,v₁ - v₂ - v₃) = -3(α₂ + α₃). In this case α₂ = -α₃ and it follows that {v₁, v₂ - v₃} is a basis for U^*. By direct computation, d(U^*) = 8 = 2 in Q^*/Q^*2.

Applying Theorem 1, we now have Θ(M) = Θ(M)χ₀(1) = 2ⁿd(U^*) = (8)(2) = 16 = 1 in Q^*/Q^*2.

BIBLIOGRAPHY

Anton, Howard. Elementary Linear Algebra. John Wiley & Sons, New York, 1973.

Dickson, L. E. Studies in the Theory of Numbers. University of Chicago Press, 1930.

Mason, Geoffrey. “Groups, Discriminants and the Spinor Norm.” Bulletin of the London Mathematical Society 21 (1989) 51-56.

O'Meara, O. T. Introduction to Quadratic Forms. Springer-Verlag, Berlin, 1963.

Zassenhaus, H. “On the Spinor Norm.” Archiv der Mathematik, Basel, 13 (1962) 434-451.

Author’s Note: I wrote this paper in the spring of 1991 as the research paper for a Master of Science degree in Mathematics, with which degree I graduated that May from Southern Illinois University at Carbondale. It is in the subspecialty of abstract linear algebra.