A vector in mathematics is just a tuple of numbers. When you first come in contact with vectors maybe in school you usually deal with tuples of two numbers like or with three numbers like
.
So far so good. You have vectors and you can represent them geometrically as arrows in a nice two-dimensional coordinate system. You can add them, multiply them and perform the dot product. At this point the term of a vector basis does not concern you at all.
In order to go a step further I want to focus here on the way vectors are defined in respect to a specific vector basis and how to perform a coordinate transformation to change the vector basis. We want to think of vectors here as tuples of coordinates that are always defined in respect to a certain vector basis.
So what is a vector basis? A basis for a vector space is a set of vectors that are linearly independent of each other and span this vector space. By span I mean that all other vectors in the vector space can be derived by linear combination of these basis vectors. The most basic vector basis is the standard basis . Let’s work out all examples in
here:
So let’s assume the vector is defined in respect to the standard basis. That means the tuple
represents the coordinates of
in respect to the standard basis. We notate it like this:
This is the easy way of defining a vector by taking as the basis. You can draw a 2d coordinate system and draw an arrow from the origin to the point (2, -1) to geometrically represent vector v.
Now to show the difference we further take another vector defined in respect to the basis
.
,
So this is an important step. We have a vector basis B being different from E. But hey, we have defined B in respect to the standard basis ! Why? Vectors forming a basis are just vectors and they are defined in respect to a specific vector basis like every vector is. Let’s analyse this further. Defining a vector basis always means that you create a relation towards another basis. So a basis is not defined somewhere on the green lawn but in reference to another existing basis. In this way we create a relation between vectors throughout different coordinate systems.
A straight forward way to realize this is to define a vector basis like different from the standard basis and try to draw the basis vectors of
on a paper without taking the standard basis in mind as a reference for the coordinates. You can draw them in an arbitrary way. It further means that vectors themselves are not aware of the basis they are defined in respect to.
In the above example I derived B from the standard basis being rotated by 90° in counter-clock-wise order. To finally get the coordinates of the two (rotated) basis vectors I thought of what coordinates they have in respect to the standard basis.
Linear combination and basis change
Let’s go further and see how we can represent a vector by its basis. The vector basis definition says every vector can be represented by a linear combination of the basis vectors and its coordinates. For the vector v it looks like this:
In contrast we have the linear combination for :
We notice that the linear combination for yields a resulting vector equal to
. In contrast the linear combination representing
yields coordinates different to the original coordinates of
:
. So what happened here? We actually changed the vector basis by calculating the result of the linear combinations. The coefficients in the linear combination are the coordinates of the vector in respect to the basis the vector is defined in. For
this is
. But the result of the linear combination are the coordinates of the vector in respect to the basis where the vectors basis itself is defined in. As the two base vectors of the basis
are defined in respect to the standard basis
we get the coordinates of
in respect to
. That is
.
Note that we didn’t see a change of coordinates in the linear combination of vector above because
is already defined in respect to the standard basis
.
Well let us summarize what we observed so far. We have a vector basis formed from a set of base vectors which coordinates are defined in respect to another vector basis
. We further have a vector
and the coordinates of
in respect to
. If we represent
by linearly combining the base vectors of
we obtain the coordinates of
in respect to
. We call this vector basis change or coordinate transformation.
Let me try to explain this insight more intuitively.
We formed a basis by defining base vectors in respect to an already existing basis. So the coordinates of the base vectors tell us how we have to linearly combine the existing base vectors to derive the new base vectors. Thus the base vectors of the new basis represent the way to get from an existing basis to the new basis. For me this sounded confusing first, because considering this I would expect doing the linear combination would get my coordinates from the existing basis to the new basis. But the opposite is the case as we saw above. The reason is that every base vector of the new basis comprises the way to derive itself from the existing base vectors. And every vector defined in respect to this new basis also comprises the way how to be derived from the the new base vectors. In result we have a chain of linear combinations going from the existing basis throughout every new basis being defined in respect to just the previous basis. We just link linear combinations together. If you substitute the respective basis vectors within the linear combinations by their own linear combination from their basis vectors you will see this relation.
Invert the basis change
So far we only observed the one way ticket of vector basis change. With the above example of our vector we transformed its coordinates from the basis
to the standard basis
. Now the question arises how to transform the coordinates in the opposite direction?
We put two vector basis in relation to each other by defining one basis in respect to the other basis. Remember: In our example above we defined the base vectors of basis in respect to the standard basis
. To transform coordinates in the opposite direction we have to create the opposite relation between the two basis. For our example we have to define the standard basis
in respect to the basis
. This yields two vectors again. Their coordinates are the coefficients in the linear combination with the basis vectors of
.
Taking our above example we have to define the standard basis in respect to the basis
. As we derived
by rotating the standard basis 90° counter-clock-wise we can easily give the coordinates for both base vectors of
in respect to
.
These two vectors represent the coordinate-transformed standard basis and we denote it by .
Note that the coordinates of in respect to the standard basis
given by
are still valid for the transformed basis
as we only described this basis in respect to another basis. If we now take the coordinates of
in respect to
and linearly combine them with the basis
we should obtain the coordinates of
in respect to
that we defined at the very beginning above.
Switch to matrix notation
The linear combination we use to represent a vector through its basis can be notated more compactly by the use of a matrix. Therefore you take all base vectors of the vector basis and write them as columns into a matrix. This matrix is often denoted as the change of basis matrix. An example: We have the vector basis defined in respect to the standard basis
and a vector
defined in respect to
.
,
The change of basis matrix would be:
The linear combination is done as matrix multiplication by the use of .
If you put in the coordinates of in respect to
from above you get:
Note that every linear combination can be expressed by the use of such a matrix where the combined vectors are the columns of the matrix and the multiplied column vector contains the coefficients.
The matrix notation gives us a more intuitive way to invert the change of basis. Just take the inverse matrix of and multiply it with coordinates defined in vector basis
.
Geometric interpretation
My domain is 3d computer graphics. That’s why I am always dealing with vectors as arrows in my mind. In fact you could also deal with vectors in a pure mathematical sense and just treat them as tuples defined in a linear space. But often the visual illustration of vector concerns is quite helpful to clarify things.
The image above shows a vector being defined in respect to a basis
on the left side. The green lines illustrate the coordinates of
in respect to the base vectors
. On the right side we have
defined in respect to a rotated basis
. We observe how the coordinates of
in respect to
on the right side are different from its coordinates in respect to
on the left side.
Note that we deal with vectors here rather than points. If we consider orthogonal vector basis only, then a vector basis change means geometrically that the standard vector basis gets rotated or scaled to derive a different basis. We cannot move the vector basis away from the origin. You can consider any vector basis to be located at the same origin.
Learn more: Video tutorial series on vector basis change.
Author: Michael Keutel | 06.05.2015
Incredibly helpful article! Thanks for writing this! 🙂