Course Project

Your Course project can take one of two forms.

  1. Practice (preferred): An implementation of RL in some domain of your choice - ideally one that you are using for research or in some other class. In this case, please describe the domain and your initial plans on how you intend to implement learning. What will the states and actions be? What algorithm(s) do you expect will be most effective?
  2. Theory: A proposal, implementation and testing of an algorithmic modification to an RL algorithm presented in class. In this case, please describe the modification you propose to investigate and on what type of domain (possibly a toy domain) it is likely to show an improvement over things considered in the book.

You may try to build on some of the chapters or research papers you have read (or will read) in the class; you may try to reimplement something you've found interesting that others have done; you can try to do something that has never been done before; you may write new code from scratch; you may modify existing code. It's up to you!

Our lab (Pi-Star) also has a few high-impact project ideas that you may consider working on for your course project. If you are interested, please review the projects description and attend the TA's office hours.

You are required to work in teams of 2 and are strongly encouraged to work on all aspects together (i.e. pair programming rather than divide and conquer). Teams should only turn in one submission (only one team member shoudl upload each file). However, each person must turn in an independently written summary of each person's contribution to the final product.

You may build on existing work for this project and utilize existing code (your own or code found on the web), but you must give proper attribution to all existing work that you build on and make clear what your new contribution is. Any unattributed or uncited work that you use will be considered a breach of academic honesty and dealt with according to the course policy in the syllabus. Furthermore, you may not claim your own existing work as a new contribution. You may build on your own work, but it must be clearly cited as existing work and you must do new work for the class project.


Submission

Submit your reports through Canvas, on or before the specified deadline. Include the full name and UIN for each team member. Submit only single copy of your report (submitted by one of the two team members).









The project instructions are based of text by Dr. Peter Stone from UT-Austin (with his permission).