Matrices and transforms

Introduction

Before reading this tutorial, it is advised to read the previous one about Vector math as this one is a direct continuation.

This tutorial will be about transformations and will cover a little about matrices (but not in-depth).

Transformations are most of the time applied as translation, rotation and scale so they will be considered as priority here.

Oriented coordinate system (OCS)

Imagine we have a spaceship somewhere in space. In Godot this is easy, just move the ship somewhere and rotate it:

../../_images/tutomat1.png

Ok, so in 2D this looks simple, a position and an angle for a rotation. But remember, we are grown ups here and don’t use angles (plus, angles are not even that useful when working in 3D).

We should realize that at some point, someone designed this spaceship. Be it for 2D in a drawing such as Paint.net, Gimp, Photoshop, etc. or in 3D through a 3D DCC tool such as Blender, Max, Maya, etc.

When it was designed, it was not rotated. It was designed in its own coordinate system.

../../_images/tutomat2.png

This means that the tip of the ship has a coordinate, the fin has another, etc. Be it in pixels (2D) or vertices (3D).

So, let’s recall again that the ship was somewhere in space:

../../_images/tutomat3.png

How did it get there? What moved it and rotated it from the place it was designed to its current position? The answer is… a transform, the ship was transformed from their original position to the new one. This allows the ship to be displayed where it is.

But transform is too generic of a term to describe this process. To solve this puzzle, we will superimpose the ship’s original design position at their current position:

../../_images/tutomat4.png

So, we can see that the “design space” has been transformed too. How can we best represent this transformation? Let’s use 3 vectors for this (in 2D), a unit vector pointing towards X positive, a unit vector pointing towards Y positive and a translation.

../../_images/tutomat5.png

Let’s call the 3 vectors “X”, “Y” and “Origin”, and let’s also superimpose them over the ship so it makes more sense:

../../_images/tutomat6.png

Ok, this is nicer, but it still does not make sense. What do X,Y and Origin have to do with how the ship got there?

Well, let’s take the point from top tip of the ship as reference:

../../_images/tutomat7.png

And let’s apply the following operation to it (and to all the points in the ship too, but we’ll track the top tip as our reference point):

GDScript

C#

  1. var new_pos = pos - origin
  1. var newPosition = pos - origin;

Doing this to the selected point will move it back to the center:

../../_images/tutomat8.png

This was expected, but then let’s do something more interesting. Use the dot product of X and the point, and add it to the dot product of Y and the point:

GDScript

C#

  1. var final_pos = Vector2(x.dot(new_pos), y.dot(new_pos))
  1. var finalPosition = new Vector2(x.Dot(newPosition), y.Dot(newPosition));

Then what we have is.. wait a minute, it’s the ship in its design position!

../../_images/tutomat9.png

How did this black magic happen? The ship was lost in space, and now it’s back home!

It might seem strange, but it does have plenty of logic. Remember, as we have seen in the Vector math, what happened is that the distance to X axis, and the distance to Y axis were computed. Calculating distance in a direction or plane was one of the uses for the dot product. This was enough to obtain back the design coordinates for every point in the ship.

So, what we have been working with so far (with X, Y and Origin) is an Oriented Coordinate System. X an Y are the Basis, and Origin is the offset.

Basis

We know what the Origin is. It’s where the 0,0 (origin) of the design coordinate system ended up after being transformed to a new position. This is why it’s called Origin, But in practice, it’s just an offset to the new position.

The Basis is more interesting. The basis is the direction of X and Y in the OCS from the new, transformed location. It tells what has changed, in either 2D or 3D. The Origin (offset) and Basis (direction) communicate “Hey, the original X and Y axes of your design are right here, pointing towards these directions.”

So, let’s change the representation of the basis. Instead of 2 vectors, let’s use a matrix.

../../_images/tutomat10.png

The vectors are up there in the matrix, horizontally. The next problem now is that.. what is this matrix thing? Well, we’ll assume you’ve never heard of a matrix.

Transforms in Godot

This tutorial will not explain matrix math (and their operations) in depth, only its practical use. There is plenty of material for that, which should be a lot simpler to understand after completing this tutorial. We’ll just explain how to use transforms.

Transform2D

Transform2D is a 3x2 matrix. It has 3 Vector2 elements and it’s used for 2D. The “X” axis is the element 0, “Y” axis is the element 1 and “Origin” is element 2. It’s not divided in basis/origin for convenience, due to its simplicity.

GDScript

C#

  1. var m = Transform2D()
  2. var x = m[0] # 'X'
  3. var y = m[1] # 'Y'
  4. var o = m[2] # 'Origin'
  1. var m = new Transform2D();
  2. Vector2 x = m[0]; // 'X'
  3. Vector2 y = m[1]; // 'Y'
  4. Vector2 o = m[2]; // 'Origin'

Most operations will be explained with this datatype (Transform2D), but the same logic applies to 3D.

Identity

An important transform is the “identity” matrix. This means:

  • ‘X’ Points right: Vector2(1,0)
  • ‘Y’ Points up (or down in pixels): Vector2(0,1)
  • ‘Origin’ is the origin Vector2(0,0)

../../_images/tutomat11.png

It’s easy to guess that an identity matrix is just a matrix that aligns the transform to its parent coordinate system. It’s an OCS that hasn’t been translated, rotated or scaled.

GDScript

C#

  1. # The Transform2D constructor will default to Identity
  2. var m = Transform2D()
  3. print(m)
  4. # prints: ((1, 0), (0, 1), (0, 0))
  1. // Due to technical limitations on structs in C# the default
  2. // constructor will contain zero values for all fields.
  3. var defaultTransform = new Transform2D();
  4. GD.Print(defaultTransform);
  5. // prints: ((0, 0), (0, 0), (0, 0))
  6. // Instead we can use the Identity property.
  7. var identityTransform = Transform2D.Identity;
  8. GD.Print(identityTransform);
  9. // prints: ((1, 0), (0, 1), (0, 0))

Operations

Rotation

Rotating Transform2D is done by using the “rotated” function:

GDScript

C#

  1. var m = Transform2D()
  2. m = m.rotated(PI/2) # rotate 90°
  1. var m = Transform2D.Identity;
  2. m = m.Rotated(Mathf.Pi / 2); // rotate 90°

../../_images/tutomat12.png

Translation

There are two ways to translate a Transform2D, the first one is moving the origin:

GDScript

C#

  1. # Move 2 units to the right
  2. var m = Transform2D()
  3. m = m.rotated(PI/2) # rotate 90°
  4. m[2] += Vector2(2,0)
  1. // Move 2 units to the right
  2. var m = Transform2D.Identity;
  3. m = m.Rotated(Mathf.Pi / 2); // rotate 90°
  4. m[2] += new Vector2(2, 0);

../../_images/tutomat13.png

This will always work in global coordinates.

If instead, translation is desired in local coordinates of the matrix (towards where the basis is oriented), there is the Transform2D.translated() method:

GDScript

C#

  1. # Move 2 units towards where the basis is oriented
  2. var m = Transform2D()
  3. m = m.rotated(PI/2) # rotate 90°
  4. m = m.translated( Vector2(2,0) )
  1. // Move 2 units towards where the basis is oriented
  2. var m = Transform2D.Identity;
  3. m = m.Rotated(Mathf.Pi / 2); // rotate 90°
  4. m = m.Translated(new Vector2(2, 0));

../../_images/tutomat14.png

You could also transform the global coordinates to local coordinates manually:

GDScript

C#

  1. var local_pos = m.xform_inv(point)
  1. var localPosition = m.XformInv(point);

But even better, there are helper functions for this as you can read in the next sections.

Local to global coordinates and vice versa

There are helper methods for converting between local and global coordinates.

There are Node2D.to_local() and Node2D.to_global() for 2D as well as Spatial.to_local() and Spatial.to_global() for 3D.

Scale

A matrix can be scaled too. Scaling will multiply the basis vectors by a vector (X vector by x component of the scale, Y vector by y component of the scale). It will leave the origin alone:

GDScript

C#

  1. # Make the basis twice its size.
  2. var m = Transform2D()
  3. m = m.scaled( Vector2(2,2) )
  1. // Make the basis twice its size.
  2. var m = Transform2D.Identity;
  3. m = m.Scaled(new Vector2(2, 2));

../../_images/tutomat15.png

These kind of operations in matrices are accumulative. It means every one starts relative to the previous one. For those who have been living on this planet long enough, a good reference of how transform works is this:

../../_images/tutomat16.png

A matrix is used similarly to a turtle. The turtle most likely had a matrix inside (and you are likely learning this many years after discovering Santa is not real).

Transform

Transform is the act of switching between coordinate systems. To convert a position (either 2D or 3D) from “designer” coordinate system to the OCS, the “xform” method is used.

GDScript

C#

  1. var new_pos = m.xform(pos)
  1. var newPosition = m.Xform(position);

And only for basis (no translation):

GDScript

C#

  1. var new_pos = m.basis_xform(pos)
  1. var newPosition = m.BasisXform(position);

Inverse transform

To do the opposite operation (what we did up there with the rocket), the “xform_inv” method is used:

GDScript

C#

  1. var new_pos = m.xform_inv(pos)
  1. var newPosition = m.XformInv(position);

Only for Basis:

GDScript

C#

  1. var new_pos = m.basis_xform_inv(pos)
  1. var newPosition = m.BasisXformInv(position);

Orthonormal matrices

However, if the matrix has been scaled (vectors are not unit length), or the basis vectors are not orthogonal (90°), the inverse transform will not work.

In other words, inverse transform is only valid in orthonormal matrices. For this, these cases an affine inverse must be computed.

The transform, or inverse transform of an identity matrix will return the position unchanged:

GDScript

C#

  1. # Does nothing, pos is unchanged
  2. pos = Transform2D().xform(pos)
  1. // Does nothing, position is unchanged
  2. position = Transform2D.Identity.Xform(position);

Affine inverse

The affine inverse is a matrix that does the inverse operation of another matrix, no matter if the matrix has scale or the axis vectors are not orthogonal. The affine inverse is calculated with the affine_inverse() method:

GDScript

C#

  1. var mi = m.affine_inverse()
  2. pos = m.xform(pos)
  3. pos = mi.xform(pos)
  4. # pos is unchanged
  1. var mi = m.AffineInverse();
  2. position = m.Xform(position);
  3. position = mi.Xform(position);
  4. // position is unchanged

If the matrix is orthonormal, then:

GDScript

C#

  1. # if m is orthonormal, then
  2. pos = mi.xform(pos)
  3. # is the same is
  4. pos = m.xform_inv(pos)
  1. // if m is orthonormal, then
  2. position = mi.Xform(position);
  3. // is the same is
  4. position = m.XformInv(position);

Matrix multiplication

Matrices can be multiplied. Multiplication of two matrices “chains” (concatenates) their transforms.

However, as per convention, multiplication takes place in reverse order.

Example:

GDScript

C#

  1. var m = more_transforms * some_transforms
  1. var m = moreTransforms * someTransforms;

To make it a little clearer, this:

GDScript

C#

  1. pos = transform1.xform(pos)
  2. pos = transform2.xform(pos)
  1. position = transform1.Xform(position);
  2. position = transform2.Xform(position);

Is the same as:

GDScript

C#

  1. # note the inverse order
  2. pos = (transform2 * transform1).xform(pos)
  1. // note the inverse order
  2. position = (transform2 * transform1).Xform(position);

However, this is not the same:

GDScript

C#

  1. # yields a different results
  2. pos = (transform1 * transform2).xform(pos)
  1. // yields a different results
  2. position = (transform1 * transform2).Xform(position);

Because in matrix math, A * B is not the same as B * A.

Multiplication by inverse

Multiplying a matrix by its inverse, results in identity:

GDScript

C#

  1. # No matter what A is, B will be identity
  2. var B = A.affine_inverse() * A
  1. // No matter what A is, B will be identity
  2. var B = A.AffineInverse() * A;

Multiplication by identity

Multiplying a matrix by identity, will result in the unchanged matrix:

GDScript

C#

  1. # B will be equal to A
  2. B = A * Transform2D()
  1. // B will be equal to A
  2. var B = A * Transform2D.Identity;

Matrix tips

When using a transform hierarchy, remember that matrix multiplication is reversed! To obtain the global transform for a hierarchy, do:

GDScript

C#

  1. var global_xform = parent_matrix * child_matrix
  1. var globalTransform = parentMatrix * childMatrix;

For 3 levels:

GDScript

C#

  1. var global_xform = gradparent_matrix * parent_matrix * child_matrix
  1. var globalTransform = grandparentMatrix * parentMatrix * childMatrix;

To make a matrix relative to the parent, use the affine inverse (or regular inverse for orthonormal matrices).

GDScript

C#

  1. # transform B from a global matrix to one local to A
  2. var B_local_to_A = A.affine_inverse() * B
  1. // transform B from a global matrix to one local to A
  2. var bLocalToA = A.AffineInverse() * B;

Revert it just like the example above:

GDScript

C#

  1. # transform back local B to global B
  2. B = A * B_local_to_A
  1. // transform back local B to global B
  2. B = A * bLocalToA;

OK, hopefully this should be enough! Let’s complete the tutorial by moving to 3D matrices.

Matrices & transforms in 3D

As mentioned before, for 3D, we deal with 3 Vector3 vectors for the rotation matrix, and an extra one for the origin.

Basis

Godot has a special type for a 3x3 matrix, named Basis. It can be used to represent a 3D rotation and scale. Sub vectors can be accessed as:

GDScript

C#

  1. var m = Basis()
  2. var x = m[0] # Vector3
  3. var y = m[1] # Vector3
  4. var z = m[2] # Vector3
  1. var m = new Basis();
  2. Vector3 x = m[0];
  3. Vector3 y = m[1];
  4. Vector3 z = m[2];

Or, alternatively as:

GDScript

C#

  1. var m = Basis()
  2. var x = m.x # Vector3
  3. var y = m.y # Vector3
  4. var z = m.z # Vector3
  1. var m = new Basis();
  2. Vector3 x = m.x;
  3. Vector3 y = m.y;
  4. Vector3 z = m.z;

The Identity Basis has the following values:

../../_images/tutomat17.png

And can be accessed like this:

GDScript

C#

  1. # The Basis constructor will default to Identity
  2. var m = Basis()
  3. print(m)
  4. # prints: ((1, 0, 0), (0, 1, 0), (0, 0, 1))
  1. // Due to technical limitations on structs in C# the default
  2. // constructor will contain zero values for all fields.
  3. var defaultBasis = new Basis();
  4. GD.Print(defaultBasis);
  5. // prints: ((0, 0, 0), (0, 0, 0), (0, 0, 0))
  6. // Instead we can use the Identity property.
  7. var identityBasis = Basis.Identity;
  8. GD.Print(identityBasis);;
  9. // prints: ((1, 0, 0), (0, 1, 0), (0, 0, 1))

Rotation in 3D

Rotation in 3D is more complex than in 2D (translation and scale are the same), because rotation is an implicit 2D operation. To rotate in 3D, an axis, must be picked. Rotation, then, happens around this axis.

The axis for the rotation must be a normal vector. As in, a vector that can point to any direction, but length must be one (1.0).

GDScript

C#

  1. #rotate in Y axis
  2. var m3 = Basis()
  3. m3 = m3.rotated( Vector3(0,1,0), PI/2 )
  1. // rotate in Y axis
  2. var m3 = Basis.Identity;
  3. m3 = m3.Rotated(new Vector3(0, 1, 0), Mathf.Pi / 2);

Transform

To add the final component to the mix, Godot provides the Transform type. Transform has two members:

Any 3D transform can be represented with Transform, and the separation of basis and origin makes it easier to work translation and rotation separately.

An example:

GDScript

C#

  1. var t = Transform()
  2. pos = t.xform(pos) # transform 3D position
  3. pos = t.basis.xform(pos) # (only rotate)
  4. pos = t.origin + pos # (only translate)
  1. var t = new Transform(Basis.Identity, Vector3.Zero);
  2. position = t.Xform(position); // transform 3D position
  3. position = t.basis.Xform(position); // (only rotate)
  4. position = t.origin + position; // (only translate)