Due Wednesday 2/7 at 11:59 pm. You must work individually.
Create a program that reads in and renders a triangle mesh (of type .obj) to an image via software rasterization. You may use existing resources to load in the mesh and to write out an image. You must write your own rasterizer. In general, the required steps for the program are:
Image
class is used.Download the Assignment 1 base code, which has a mesh loader and an image writer. Compile and run the code by following the same set of commands as in Lab -1.
Note that we are using the following external libraries:
When the mesh is loaded, the vertex positions are stored in the posBuf
array as follows:
-----------------------------------------------------------------------------------------------
| x0 | y0 | z0 | x1 | y1 | z1 | x2 | y2 | z2 | x3 | y3 | z3 | x4 | y4 | z4 | x5 | y5 | z5 | ... <- posBuf array
-----------------------------------------------------------------------------------------------
| vertex 0 | vertex 1 | vertex 2 | vertex 3 | vertex 4 | vertex 5 |
| triangle 1 | triangle 2 |
Every triplet of three consecutive vertices in this array forms a triangle. In other words, every nine elements form the (x,y,z) coordinates of the three vertices of a triangle. (For now, you can ignore the normal and the texture coords.) Example mesh files are included in the base code. In addition, there are numerous OBJ meshes on the web. For grading purposes, your program will be run using the provided Stanford Bunny and Utah Teapot.
Ultimately you will want each triangle to be represented in a C/C++ structure/class, with 3 vertices and a color per vertex. In addition, your triangle data should include a 2D bounding box, which will represent the triangle’s extent in image coordinates.
Add a command line argument to accept the following command line arguments.
For example, your program should be able to be run as follows:
> ./A1 ../resources/bunny.obj output.png 512 512 1
(In Xcode, the \(1\)st argument should be ../../resources/bunny.obj
.) Add error checking to specify the required command line arguments if an incorrect number is given. Your program should not dump core if no input file is specified, or fail without an error message! Follow the golden rule; treat your user/grader/instructor the way you’d like to be treated as a user/grader/instructor.
Write code to convert each 3D coordinates into 2D image coordinates. Assume the camera is at the origin looking down the negative z axis. Make sure the object completely fills the image without any distortion. To do so, you need to compute the scale and translation factors as we discussed in class. Some tips for starting out:
RANDOM_COLORS
. For the i
th triangle, the color should be RANDOM_COLORS[i%7]
.tri.obj
, which contains a single triangle.First, write out the bounding box, rather than the triangles, to the image. If you do this with the provided tri.obj
, sphere.obj
, teapot.obj
, and bunny.obj
, you should see blocky images like below. The 0
th triangle should use RANDOM_COLORS[0]
, the 1
st triangle should use RANDOM_COLORS[1]
, etc.
The object must take up the whole image, be centered, and be undistorted (not stretched). Make sure you test nonuniform window sizes. As shown below, the aspect ratio of the object should be preserved no matter what the image size is, and the object should fill out the image.
Note: The output image depends on the order in which the triangles are processed, so your resulting output may not exactly match the reference images. (For example, the red triangles in the teapot may form a line from bottom-left to top-right.) Once Z-Buffering is implemented in Task 5, the results should match exactly. For now, just make sure that the rectangles form a similar pattern as in the reference images.
Once the bounding boxes are displayed correctly, add the barycentric test to write out the triangles as in (optional) Lab 2. This should be the output when the “task” command line argument is set to 2
. You should not see any gaps between the triangles. Hint: You may need a small “epsilon” when checking the barycentric coordinates to deal with floating point errors. In other words, rather than checking to see if a barycentric coord is greater than exactly zero, check to see if it is greater than a small negative value. You may “borrow” some code for this from the web, but if you do, make sure to cite the source in your code and README.
Depending on the order in which triangles are processed, the diagonals in the teapot render may not match the image above. (The image above shows the back surface of the teapot.) You can ignore this difference for now, since Z-buffering will fix this issue later.
Here is another image of a bunny and a teapot, from Alice in Wonderland [Wikimedia].
Instead of using random per-triangle colors, use random per-vertex colors. For each pixel inside each triangle, you need to interpolate the pixel’s color from the three vertices using the pixel’s barycentric coordinates.
With a single triangle (left image), the three vertices should be assigned RANDOM_COLORS[0]
, RANDOM_COLORS[1]
, and RANDOM_COLORS[2]
, defined at the top of main.cpp
. In the left image, these correspond to the blue, red, and yellow vertices, which are the left, right, and top vertices, respectively. In the general case with multiple triangles, the color of the vertex should correspond to the index of the vertex in the posBuf
array. See the illustration in the figure in the Setting Up Your Code section: vertex 0
should get RANDOM_COLORS[0]
, vertex 1
should get RANDOM_COLORS[1]
etc., with vertex 7
wrapping around to get RANDOM_COLORS[0]
again.
Be careful about going beyond the valid range of an unsigned char
: \([0, 255]\). Due to floating point calculations, the final value might be slightly under or over this range. Capping to this range should help in this case.
Optional information: Because of the way we are loading the mesh, the triangles do not share any vertices. For example, if we were to load a square consisting of four vertices and two triangles, we end up with six vertices – three for each of the triangles. In other words, we end up duplicating any shared vertices. Therefore, when we assign a color to each vertex, triangles having a vertex at a common position can have different colors assigned at this vertex position. For example, in the sphere image above, the center vertex is incident to eight triangles, and so it has been duplicated eight times, each time with a different random color. For further information check out indexed drawing.
Next, use the y-value to linearly interpolate two colors: blue (0 0 255)
and red (255 0 0)
. The color should vary smoothly from the top of the object to the bottom of the object, rather than from the top of the whole image to the bottom of the whole image. In other words, the top tip of the triangle should be fully red. This means that the bounding box of the object must be used to compute these colors.
Now that you have interpolated colors, implement z-buffer tests. First, create a data structure to support z-buffer tests. Your z-buffer should be a separate buffer from your image pixel buffer, and it should be the same size as your pixel buffer. The z-buffer contains the z-coordinate of each pixel, which is interpolated from the z-coordinates of the three vertices using the pixel’s barycentric coordinates.
The left image shows tri2.obj
rotated slightly to show how the two triangles intersect. Without z-buffering, the 2nd triangle writes over the 1st triangle, giving us the middle image. Once you have z-buffer implemented, both tips should show up properly, giving us the right image.
Now, use the z value of the pixel as the red color (use \(0\) for both blue and green). To do this, you have to map the z-value to the range 0 to 255. Just like in the previous task, the bounding box of the object must be used. The closest point on the object should be red, and the farthest point on the object should be black. If your z-buffer test is not working, you’ll see some strange results, since some pixels that are farther from the camera may be drawn on top of closer pixels. If your z-buffer is working, and if you’re interpolating the colors correctly, you should get the following results for the four obj files.
Make sure to pass your std::vector
by reference rather than by value. (E.g., void foo(std::vector<float> &bar)
) Otherwise, your program may become too slow. Since the Image
class has an std::vector
inside it, it should also be passed by reference.
When we load the obj file, we are also loading the vertex “normals” of the mesh, in addition to the vertex positions. These are stored in the norBuf
variable alongside the posBuf
variable that is already being used. Store this “normal” information (a 3D vector) in each vertex. Then, when coloring the pixels, interpolate the normals of the three vertices of the triangle to compute the normal of the pixel.
A normal is a 3D direction, so its (x,y,z) values can be anything in the range \([-1, 1]\). To display the normal as a color, we need to map these values to the range \([0, 255]\). The expression for this mapping is
255 * (0.5 * x + 0.5);
r = 255 * (0.5 * y + 0.5);
g = 255 * (0.5 * z + 0.5); b =
This coloring scheme is often used for debugging normals. For example, if the normal is [1,0,0]
then the color would be [255,127,127]
, which is fluorescent pink, and if the normal is [0,1,0]
then the color would be [127,255,127]
, which is fluorescent green. Note, for example, that the top of the bunny’s tail is greenish because the normals there are pointing up.
Apply a simple “lighting” calculation to compute the color of each pixel. After the normal \(\hat{n}\) has been computed at a pixel, compute the scalar \(c\) as follows: \[ c = \text{max}(\hat{l} \cdot \hat{n}, 0), \quad \hat{l} = \frac{1}{\sqrt{3}}\begin{pmatrix}1\\1\\1\end{pmatrix}. \] The “light” vector \(\hat{l}\) is the 3D direction of the light. Take the dot product between the normal and the light vector, and take the maximum of the result and zero. Without this zero, pixels that are facing away from the light will also be lit. The resulting scalar, \(c\), should be applied to RGB equally: \[ r = c, \quad g = c, \quad b = c. \]
Note that with the single triangle scene, we get an empty image because the triangle is facing away from the light.
Apply a rotation of \(\pi/4\) radians (\(45\) degrees) about the Y axis. The rotation should be applied immediately after loading the vertices, before fitting to the window. This ensures that the rotated object fits the window exactly as before. Notice that the teapot appears bigger than before, since the rotated teapot takes less horizontal space in window coordinates.
Transform both the vertex position and the normal using the equation below: \[ \begin{pmatrix} x\\ y\\ z \end{pmatrix} = \begin{pmatrix} \cos(\theta) & 0 & \sin(\theta)\\ 0 & 1 & 0\\ -\sin(\theta) & 0 & \cos(\theta)\\ \end{pmatrix} \begin{pmatrix} x_0\\ y_0\\ z_0 \end{pmatrix}. \] Here, \(\theta = \pi/4\) radians, and \((x_0, y_0, z_0)\) are the vertex coordinates loaded from the obj file. For each vertex, set \((x_0, y_0, z_0)\) to the vertex position, multiply by the rotation matrix, and store the resulting vector back into the position buffer. Then repeat with the normal: set \((x_0, y_0, z_0)\) to the vertex normal, multiply by the rotation matrix, and store the resulting vector back into the normal buffer.
main()
, and remember to pass big data by reference.Total: 100 points
Failing to follow these points may decrease your “general execution” score.
On Linux/Mac, make sure that your code compiles and runs by typing:
> mkdir build
> cd build
> cmake ..
> make
> ./A1 <ARGUMENTS>
If you’re on Windows, make sure that you can build your code using the same procedure as in Lab -1.
README.txt
or README.md
) that includes:
src/
, resources/
, CMakeLists.txt
, and your README file.(*.~)
(*.o)
(.vs)
(.git)
UIN.zip
(e.g., 12345678.zip
).UIN/
(e.g. 12345678/
).src/
, CMakeLists.txt
, etc..zip
format (not .gz
, .7z
, .rar
, etc.).