Skip to content

Commit

Permalink
Sam Morris completion news, and new student page for supervised stude…
Browse files Browse the repository at this point in the history
…nts, starting with Sam Morris
  • Loading branch information
timothy-wiley committed Nov 5, 2024
1 parent ad94cdc commit 9b8bef2
Show file tree
Hide file tree
Showing 7 changed files with 122 additions and 9 deletions.
3 changes: 2 additions & 1 deletion essai.html
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="selected" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand Down Expand Up @@ -140,7 +141,7 @@ <h2>PDF Notes</h2>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
Expand Down
10 changes: 8 additions & 2 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand Down Expand Up @@ -125,6 +126,11 @@ <h2>News</h2>
<div class="news-box">
<table>

<tr>
<td class="date">06 Nov 2024</td>
<td class="info">Sam Morris, my Master's by research Student with Michael Dann, has formally completed his thesis: "Policy Transfer for Deep Reinforcement Agents Using Game Entity Substitution - Applied to Infinite Mario", which is available on RMIT's research repository.</td>
</tr>

<tr>
<td class="date">28 Oct 2024</td>
<td class="info">My <a href="https://academics.rmit.edu.au/timothy-wiley">RMIT Staff Profile</a> is updated to RMIT's new public facing staff pages.</td>
Expand Down Expand Up @@ -157,7 +163,7 @@ <h2>News</h2>

<tr>
<td class="date">03 Oct 2024</td>
<td class="info">Samuel Ord, my PhD student, has submitted his thesis "Fixed-Wing UAV System for Aerial Tethered Delivery of Small to Medium Packages" for examination!</td>
<td class="info">Samuel Ord, my PhD student with Matthew Marino, has submitted his thesis "Fixed-Wing UAV System for Aerial Tethered Delivery of Small to Medium Packages" for examination!</td>
</tr>

<tr>
Expand Down Expand Up @@ -469,7 +475,7 @@ <h2>Former Roles and Activities</h2>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>
<br/>Flag Icons by GoSquared (http://www.gosquared.com/)
</div>
</footer>
Expand Down
3 changes: 2 additions & 1 deletion media.html
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="selected" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand Down Expand Up @@ -132,7 +133,7 @@ <h2>Radio</h2>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
Expand Down
3 changes: 2 additions & 1 deletion publications.html
Original file line number Diff line number Diff line change
Expand Up @@ -28,6 +28,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="selected" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand Down Expand Up @@ -347,7 +348,7 @@ <h2>Honours Thesis</h2>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
Expand Down
3 changes: 2 additions & 1 deletion robocup.html
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="selected" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand Down Expand Up @@ -266,7 +267,7 @@ <h2>Former Teams and Results</h2>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
Expand Down
7 changes: 4 additions & 3 deletions software.html
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,7 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="selected" href="software.html">Software</a></li>
<li><a class="" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
Expand All @@ -39,13 +40,13 @@ <h5 id="quals">PhD, BSc (Computer Science)</h5>
<div class="container">

<p>
This page list the publically available software and code releases related to my research and work.
This page list the publicly available software and code releases related to my research and work.
</p>

<h3>ASP-QSIM</h3>
<p>
<a href="https://github.com/timothy-wiley/aspqsim">ASP-QSIM</a> is an ASP implementation of the QSIM algorithm extended with qualitative rules. Released as part of the work of: Wiley, T. (2017). A Planning and Learning Hierarchy for the Online Acquisition of Robot Behaviours, School of Computer Science and Engineering, The University of New South Wales, Sydney, Australia. <br />
Publically available on GitHub at: <a href="https://github.com/timothy-wiley/aspqsim">https://github.com/timothy-wiley/aspqsim</a>
Publicly available on GitHub at: <a href="https://github.com/timothy-wiley/aspqsim">https://github.com/timothy-wiley/aspqsim</a>
</p>

<h3>RedbackBots Soccer Code Releases</h3>
Expand Down Expand Up @@ -82,7 +83,7 @@ <h3>RedbackBots GameSight Code Releases</h3>

<footer>
<div class="container">
<time datestamp="2024-10">October 2024</time>
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
Expand Down
102 changes: 102 additions & 0 deletions students.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,102 @@
<!DOCTYPE html>
<html lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Dr Timothy Wiley - Students</title>

<link rel="stylesheet" type="text/css" href="styles/common.css" />

<link rel="stylesheet" type="text/css" href="styles/publications.css" />


<link rel="shortcut icon" href="images/logo.png">
</head>
<body>
<header>
<div class="container">
<h1>Dr Timothy Wiley</h1>
<h5 id="quals">PhD, BSc (Computer Science)</h5>
<div id="subinfo">Lecturer</div>
<div id="subinfo">School of Computing Technologies, STEM College</div>
<div id="subinfo">RMIT University</div>
</div>
</header>

<nav>
<ul>
<li><a class="" href="index.html">Home</a></li>
<li><a class="" href="robocup.html">RoboCup</a></li>
<li><a class="" href="publications.html">Publications</a></li>
<li><a class="" href="software.html">Software</a></li>
<li><a class="selected" href="students.html">Students</a></li>
<li><a class="" href="essai.html">ESSAI'23-24</a></li>
<li><a class="" href="media.html">Media</a></li>
<!--li><a class="" href="quicklinks.html">Quick Links</a></li-->
</ul>
</nav>

<div id="page">


<section>
<div class="container">

<p>
An overview of the students, current and former, whom I have supervised highlighting their projects and completions.
</p>

<h2>Completions (Masters)</h2>

<h3>Samuel Arthur Morris (2024)</h3>
<div class="citation">
<p>
<span class="title">Policy Transfer for Deep Reinforcement Agents Using Game Entity Substitution - Applied to Infinite Mario</span> <br />
School of Computing Technologies, STEM College, RMIT University, Melbourne, Australia. <br />
<span class="supervisor">Supervisors: Dr. Timothy Wiley, Dr. Michael Dann.</span>
</p>
<p>
Abstract <br />
Deep Reinforcement Learning (DRL) agents have shown impressive ability in
mastering computer games, but notoriously take a long time to learn. As an
agent progresses through a game, it will often encounter new states containing
previously unencountered game entities, e.g., new enemies. In such situations,
DRL agents typically struggle to generalise their prior knowledge to the new
entities, owing to differences in state and object representations. In particular,
even when new entities <em>behave</em> similarly to previously encountered ones, if they
<em>appear</em> to be different then DRL agents can take a long time to adapt. <br />
Policy transfer learning offers a promising approach for allowing DRL
agents to adapt their knowledge; however, establishing the connection between
the newly presented states (the target task) and previously encountered ones
(the source task) requires guidance from a domain expert. Guidance in the
form of externally constructed mapping of state-action pairings, must be continually
maintained in response to new game entity encounters. <br />
This thesis proposes an alternative approach, where policy transfer is accomplished
by leveraging an intermediate state transformation, removing the
need for manual mapping. Each entity is mapped to a unique entity ID, and
when a new game entity is encountered, a "substitution agent" strives to learn
a mapping between the new entity ID and a previously encountered one. For
example, if the new entity is a type of enemy, the substitution agent will ideally
learn to map the new ID to a previously encountered enemy’s ID, rather
than, say, the ID of a powerup item. Experimental results show that this
approach is effective, allowing for rapid improvement of end-of-episode scores
when encountering new entity representations in the game, <em>Infinite Mario</em>.
</p>
</div>

<!-- <h2>Completions (Honours)</h2> -->


</div>
</section>

</div>

<footer>
<div class="container">
<time datestamp="2024-11">November 2024</time>

</div>
</footer>
</body>
</html>

0 comments on commit 9b8bef2

Please sign in to comment.