|
|
Issue
1, June 1999
Biological & Biomedical Sciences
The
Gramian and K-Volume in N-Space: Some Classical Results in Linear
Algebra1
Nils Barth
Harvard University
Abstract
We give a formula
for determining when a set of k vectors in n-space is linearly independent,
and if so, what is the volume of the parallelepiped with these vectors
as its sides. This function, the gramian, allows one to partially
apply the determinant when the number of vectors you have is less
than the dimension of the ambient space. These results were classically
known, but are not part of the standard linear algebra curriculum.
Prerequisites: familiarity with matrices and determinants.
Introduction
Given
k vectors in n-dimensional space, when are they linearly independent,
and what is the k-volume of the parallelepiped that they define?
When k = n, the answer is familiar: arrange the vectors in a matrix
and take the determinant. The vectors are then linearly dependent
iff the determinant is zero, and otherwise the volume of the parallelepiped,
Vol(v1,...,vk), is the absolute value of the
determinant. For k < n, the widely known answers are piecemeal:
for k = 2, n = 3, one can use the cross product; to determine linear
dependence, one can use gaussian elimination, though this is an
algorithm, rather than a formula. Ideally, one would have some analog
for the determinant which applies when the number of vectors is
not equal to the dimension of the ambient space. This answer is
provided in the gramian, which we introduce below; we loosely follow
[1].
Notation and Review
Let
us fix a vector space Kn, where K is a subfield of the
complex numbers. For example, let K = R, the real numbers,
which is euclidean space; the other main example is the complex
numbers, and there is little loss of generality in only considering
these. We write vectors vertically and covectors (elements of the
dual space (Kn)*) horizontally. We write W
< V to indicate that W is a subspace of V. Given a collection
of vectors { v1,...,vk}, denote the parallelepiped
with sides v1,...,vk by P(v1,...,vk).
If
{ v1,...,vk} is linearly dependent, this definition
doesn't make sense; more formally,
| P(v1,...,vk)
= |
{
|
a1v1+...+anvn
|0 =< ai =< 1, a1+...+an=<1}
|
|
|
(1) |
When
{ v1,...,vk} are linearly independent, P(v1,...,vk)
has a well-defined k-volume (volume as a subset of k-dimensional
space); when they are linearly dependent, this volume is zero. Note
that if you have, for instance, 3 vectors in 2-space, then P(v1,...,v3)
has non-zero volume as a subset of 2-space, but it is "flat" and
thus has zero volume as a subset of 3-space.
Recall
that an inner product is a choice of distance (and also
volume) in a vector space; the main example is the dot product,
defined as
which
is a scalar, and where v* = conj(v)T is the
complex conjugate of the transpose. We take complex conjugates so
that complex vectors have real length; if K = R, this reduces
to v·w = vTw. We define the norm (length) of
v as
Matrices
that preserve distances and angles, i.e., Ov·Ow = v·w, are
of particular interest, and are called orthogonal;2 these correspond to rigid motions. For
instance, orthogonal transforms of R2 are rotations
and reflections.
The Gramian and Its Basic Properties
Definition
4 [gramian] The gramian of v1,...,vk
is G(v1,...,vk) = detM*M, where
and
M* = [conj(M)]T is the conjugate transpose
of M.
Note
the similarity to the norm (3);
in fact, G(v) = ||v||, so the gramian of a single vector is the
length, i.e., the 1-dimensional volume, of that vector.
If
we calculate the entries of M*M, we obtain the following
characterization of the gramian:
Definition
6 [gramian-alternate] G(v1,...,vk)
= detA, where aij = vi·vj.
This
presentation shows that the matrix M*M captures all the
geometric information about the vectors { v1,...,vk}:
the length of the vectors and the angles between them. Further,
this is the only information that it contains; we've lost the particular
orientations of the vectors and their embedding in n-space. We claim
that this is enough information to easily determine the volume.
Consider
the case where k = n; then det M*M = [conj(det M)](det
M) = |det M|2. In particular, G(v1,...,vk)
>= 0. Since Vol(v1,...,vk) = |det M|, we
obtain:
| Vol(v1,...,vk)
= |
(det
M*M)^1/2 |
|
=
|
G(v1,...,vk)^1/2 |
(7) |
The (i,j)-th entry of A = M*M is vi·vj;
the above shows that the volume depends only on these inner products.
For
any collection of k vectors, the above holds by simply restricting
to a k-dimensional subspace containing them; that is, Vol(v1,...,vk)
= detA. By using (6),
we obtain the following geometric characterization of the gramian:
Proposition
8 Vol(v1,...,vk) = G(v1,...,vk)^1/2.
Note
that in case k =< n, we still have G(v1,...,vk)
>= 0, so this is well-defined.
Proof.
The proof was sketched above; we fill in the details here.
Given
v1,...,vk, where k =< n, assume that they
are linearly independent3 and pick an orthonormal basis w1,...,wk
for their span, W. Extend to an orthonormal basis w1,...,wn
for Kn; the matrix O sending wi --> ei
is an orthogonal change of coordinates, so it does not change inner
products: O(v)·O(w) = v·w. Let v'i = O(vi);
then
| (v'i)
= |
[
[
[
[
[
[
[
[
[
[
[
[ |
|
|
]
]
]
]
]
]
]
]
]
]
]
] |
, |
since
vi is in the span of w1,...,wk.
The set { v'1,...,v'k} visibly lies in the
k-dimensional subspace of vectors with last n-k coordinates zero,
so the k-volume of P(v'1,...,v'k) is |det
M|, where mij is defined as above. Effectively, we are
restricting to the subspace W. By the discussion before this proposition,
Vol(v'1,...,v'k) = {det M*M}^1/2
(we're just dropping the trailing zeros). Since O is orthogonal,
| Vol(v'1,...,v'k)
= Vol(v1,...,vk) |
|
(9) |
and v'i·v'j = vi·vj, so
|
G(v1,...,vk)^1/2
|
=
Vol(v'1,...,v'k) = Vol(v1,...,vk), |
|
(10) |
as
desired.
When
k > n, the vectors are linearly dependent, so the k-volume of
P(v1,...,vk) is zero. In this case the gramian
is zero, by the argument in the proof of (11),
below. [X]
In
particular, the k-volume of P(v1,...,vk) is
zero iff { v1,...,vk} don't form the sides
of a k-parallelepiped; that is, if P(v1,...,vk)
has dimension less than k. This yields:
Corollary
11 [Gram's criterion for linear dependence of vectors]
The set of vectors { v1,...,vk } is linearly
dependent iff G(v1,...,vk) = 0.
Proof.
We just dealt with the case k =< n; it remains to show that if
k > n, the gramian is zero. Consider A = M*M; each
row of the k×k matrix A is a linear combination of rows of M, of
which there are n. Thus, the row-rank of A is at most n < k,
so its determinant is zero. [X]
Note
that the gramian is always nonnegative,4 and equals zero iff the vectors are linearly
dependent. As a consequence, we obtain the familiar:
Corollary
12 [Bunyakovskii-Cauchy-Schwarz inequality] (v·w)2
=< ||v|| ||w||, with equality iff v,w are linearly dependent
(one is a multiple of the other).
Proof.
G(v,w) => 0, with equality iff v,w are linearly dependent. Now
| |
)
)
) |
=
|
det
|
|
(
(
( |
|
)
)
) |
=
||v|| ||w||-(v·w)2;(13) |
the
result follows.
Extensions
The above discussion with matrices didn't
depend on choice of coordinates, so it holds for any linear transform
T:V --> W, so long as you have fixed inner products on V, W,
which is equivalent to isomorphisms V--~-->V*, W--~-->W*.
This yields
| V --T--> W --~--> W*
--T*--> V* --~--> V, |
|
(14) |
the composition of which is an endomorphism of V (linear operator
on V), and thus has a determinant. The gramian of T (with respect
to the inner products on V, W) is then the determinant of this map.
Recall that the inner product of v, w with respect to an inner
product other than the dot product is given by [conj(v)]TPw,
where P is some invertible matrix. Similarly, M*PM is
the gramian with respect to this inner product, which follows from
expanding (6).
The key part of the restriction on K is that the characteristic
be zero; the above doesn't work in positive characteristic. For
instance, suppose charK = p > 0, and consider
| v1T
= ( |
p
1,...,1
|
, |
p
0,...,0
|
),v2T
= ( |
p
0,...,0
|
, |
p
1,...,1
|
). |
|
Then
but { v1,v2 } is clearly linearly independent.
The problem is that nonzero vectors can have zero norms; worse,
there are non-trivial subspaces (such as the span of { v1,v2
}) in which the inner product of any two vectors is zero. This prevents
us from naively following our geometric intuition from euclidean
space, and is one of the reasons that the study of quadratic forms
(generalized inner products) over positive characteristic is interesting.
Footnotes
1
Date: July 16, 1999.
1991 Mathematics Subject Classification. 15A15
The author wishes to thank Antonia Bluher for suggesting this topic.
2
If the vector space is complex, they are instead called unitary.
3
If they are not linearly independent, instead take W to be some
k-dimensional subspace containing them.
4
In particular, it is real; this eliminates the need to take the
absolute value to obtain the volume. This is more significant in
the complex case, where it shows that there is a well-defined notion
of real-valued volume.
References
- [1]
- F. R.
Gantmacher, Matrix theory, vol. 1, Chelsea Publishing
Company, New York, 1959, Translation of TEORIYA MATRITS by G
AHTMAXEP.
Journal of Young Investigators. 1999. Volume Two.
Copyright © 1999 by Nils Barth and JYI. All rights reserved.
|
|