Skip to content

Commit 691f566

Browse files
committed
aug 8 & 9
1 parent 7b06d3a commit 691f566

File tree

1 file changed

+4
-6
lines changed

1 file changed

+4
-6
lines changed

2023/index.html

+4-6
Original file line numberDiff line numberDiff line change
@@ -197,14 +197,12 @@ <h1>2023 Tea Time Talks</h1>
197197
</p>
198198
<br>
199199

200-
<p>
201-
<b>Samuel Robertson (August 8, 2023; CSC 3-33)</b><br>
202-
<i>Computational and Statistical Bounds for Linear Bandits</i><br>
203-
</p>
204-
<br>
205-
206200
<p>
207201
<b>Homayoon Farrahi (August 9, 2023; Amii)</b><br>
202+
<i>Automatic Meta-Descent</i>
203+
<p align="justify">
204+
Online continual-learning agents need to adapt to the nonstationarities in the environment. Since the environment can combine stationary and nonstationary components, each weight of the learning agent can benefit from its own adjustable step size. In meta-descent, both the weight and step-size values are learned using gradient descent to optimize the objective of the problem. We show that the meta-step size of an existing meta-descent algorithm has to be tuned to perform reasonably on different problems. We propose a novel meta-descent algorithm using the normalization technique of Adam and empirically compare it with Adam in a continuous-control reinforcement learning task.
205+
</p>
208206
</p>
209207
<br>
210208

0 commit comments

Comments
 (0)