You might be aware that the beautiful worlds that you see in your favourite video games, or the character in your favourite animated movies, are all made by drawing lots of little triangles to the screen.
Triangles are used as the primitive for the Rasterization algorithm because a triangle is the most basic polygon, any shape you wish to create, can be created with a combination of a finite number of triangles.
So then, the it follows that the algorithm to draw a triangle to the screen given it’s vertices is the most important part of the algorithm. In this blog post, I will be exploring 3 different ways to draw a triangle to the raster, i.e the screen which is made of discrete pixels.
And a tutorial about any algorithm is incomplete without some code, so here the GitHub repository for this blog.
The first thing to understand is what the screen really is. When I say screen, I refer to an array of pixel values.
An image of 256×256 resolution means that the image data is contained in an array of size 256x256x3. The size is multiplied by 3 because each pixel has a Red, Green and Blue color value.
So then, drawing a triangle to the screen means coloring in the pixels that are going to fall inside of the triangle.
This method loops over all the pixel on the screen and checks whether they fall inside of the triangle. The pixels that pass the test are then colored.
The way this test is conducted then, is using Barycentric Coordinates . I have a video up on my YouTube channels for the uninitiated, so go check that out to learn about barycentric coordinates.
Here is the utility function that calulcates the barycentric coordinates of a point for us
bool get_b_coords(Vec3D v1 , Vec3D v2 , Vec3D v3 , Vec3D point , Vec3D* output)
{
double area_total = get_tri_area(v1 , v2 , v3);
double area1 = get_tri_area(v2 , point , v3);
double area2 = get_tri_area(v3 , point , v1);
double area3 = get_tri_area(v1 , point , v2);
output->x = area1 / area_total;
output->y = area2 / area_total;
output->z = area3 / area_total;
// a little bit of tolerance needs to be allowed because floating point operations are not perfectly accurate.
if (output->x + output->y + output->z < 1.01 && output->x + output->y + output->z > 0.99)
{
return true;
} else
{
return false;
}
}
// This one calculates the area of a given triangle.
double get_tri_area(Vec3D v1 , Vec3D v2 , Vec3D v3)
{
Vec3D side12 = v2 - v1;
Vec3D side23 = v3 - v2;
Vec3D side31 = v1 - v3;
double height = ((side12*-1) - (side12 * -1).ProjectOnto(side23)).Magnitude();
return height * side23.Magnitude();
}
Now we call this function for every pixel on the screen. Any pixels that evaluate to true are coloured in
for (int i=0 ; i < height ; i++)
{
for (int j=0 ;j < width ; j++)
{
point.x = (j / (double)width) + pixel_width/2;
point.y = (i / (double)height) + pixel_height/2;
if (get_b_coords(v1 , v2 , v3 , point , &b_coords)==true)
{
// color the pixel red
}
else
{
// color the pixel blue
}
}
}
And that’s all ! It’s really that simple. I timed this method and on average for I got 500 miliseconds, or half a second. That number doesnt mean much by itself, but as you will see, this method is quite slow compared to the others.
But we can speed it up by a lot using a Bounding Box. Instead of looping over the entire image, the loop only goes over a rectangular area that we determine using the algorithm I will now be discussing. The rectangle is such created that the entire triangle will lie inside of the rectangle.
void get_bounding_box(Vec3D v1 , Vec3D v2 , Vec3D v3 , Vec3D* top_left , Vec3D bottom_right)
{
// Suppose min will return the minimum value among whatever is passed to it
top_left->x = min(v1.x , v2.x , v3.x);
top_left->y = min(v1.y , v2.y , v3.y);
bottom_right->x = max(v1.x , v2.x , v3.x);
bottom_right->y = max(v1.y , v2.y , v3.y);
}
Just get the lowest x and y coordinates and the highest x and y coordinates among the vertices of the triangle. This is what the bounding box looks like around the triangle.
Only the pixels coloured in purple or red need to be looped over. You can see that the purple rectangle contains the entire triangle. The runtime of this fucntion goes from 500 milliseconds to 125 milliseconds.
This method also works a bit like the barycentric coordinate method in the sense that we loop over every pixel in the entire image to test whether it falls inside the triangle.
However, we use a different test to determine whether a point is to be colored.
A triangle is constructed of 3 sides. These 3 side can be mathematically expressed as an equation. Using 2 points that the lie on the straight line, the following formula gives us the equation of the line.
\[ {(y - y_1) \over (y_2 - y_1)} = {(x - x_1) \over (x_2 - x_1)} \]
Once we have the equation, we turn it into an inequality using the > or the < sign. This shades a half of the image. Any pixels that lie within the shaded area will satisfy the equation.
We obtain the equations for the three sides of the triangle and then we use equation, or more formally, the in-eqations to check whether the point lies within the triangle. Only a point saisfying all three inequations will like inside the triangle.
Here is a line inequation class that contains data about the equation of the line and which side of the line is shaded.
class Line
{
public:
float a;
float b;
float c;
bool shaded_side; // if this is true, means the upper shaded_side is shaded
Line(Vec3D p1 , Vec3D p2 , Vec3D check_point)
: a(p1.y - p2.y),
b(p2.x - p1.x),
c(p2.y*p1.x - p1.y*p2.x)
{
if (a*check_point.x + b*check_point.y + c > 0) {
shaded_side = true;
} else {
shaded_side = false;
}
}
// check if a given point lines within the shaded region of the line
bool point_check(Vec3D point)
{
if (shaded_side == true)
{
if (a*point.x + b*point.y + c > 0) {
return true;
} else {
return false;
}
}
else
{
if (a*point.x + b*point.y + c < 0) {
return true;
} else {
return false;
}
}
}
};
Similar to the previous method, we loop over every pixel in the image and color in the pixels that are satsify all three of the line equations.
Vec3D v1(0.5f , 0.2f , 0.0f);
Vec3D v2(0.7f , 0.5f , 0.0f);
Vec3D v3(0.3f , 0.8f , 0.0f);
// These are the three line equations that we will check each pixel against
Line line1(v1 , v2 , v3);
Line line2(v2 , v3 , v1);
Line line3(v3 , v1 , v2);
The scanline method is inarguably the fastest and also the most confusing method of all three that we have discussed here.
This method is a little bit different from the previous methods because it does not have to loop over all the pixel on the screen. The scanline method can directly color in the correct pixel without having to check every pixel on the screen. I will cover it quickly here but you to read about it in more detail check out this tutorial.
In the scanline method we traverse down the y-axis, from the topmost to the bottom-most vertext of the triangle. For each y-coordinates, a bunch of continuous pixels are filled in the x-direction, like drawing a line for each y-coordinate in the triangle. Thus the name scanline.
Each of these lines starts at some x-coordinate and ends at some other x-coordinate. The most difficult part of implementing the algorithm is figuring out which x-coordinates to start and end at, such that only pixels inside of the triangle are filled.
The page I linked to before gives a very nice explanation, better than I could give here, so please do read that. And finally, here is the result.
Make sure to check out the GitHub repository for the code because life is incomplete without the implemenation, and stay tuned on my page for the many more tutorials to come. Thank you for reading !