[Feature Request] Expand RNN Options and Algorithm Flexibility #220

mtnusf97 · 2023-12-14T01:54:29Z

🚀 Feature

I suggest expanding the system's recurrent components by introducing various recurrent neural networks (RNNs) like vanilla RNN, GRU, and maybe some lesser-know networks like LMU, and ctRNN. Additionally, I propose compatibility with other RL algorithms beyond PPO, specifically A2C.

Motivation

The motivation is to enhance flexibility, allowing users to choose from a diverse set of recurrent networks and RL algorithms.

Pitch

Introduce different recurrent net options for different RL algorithms such as A2C, providing users with a more comprehensive toolkit for designing and experimenting RL with recurrent components.

Alternatives

Focus on LstmPPO: While effective, this limits exploration and potentially misses out on the strengths of other RNNs.

Develop custom algorithms: This is resource-intensive and may not be as widely applicable as expanding existing options.

Additional context

I have already implemented most of these features in my personal repository and successfully utilized them in my research.

Checklist

I have checked that there is no similar issue in the repo
If I'm requesting a new feature, I have proposed alternatives

masterdezign · 2023-12-28T13:28:12Z

Hi @mtnusf97, I am working on #201 so I may add several types of recurrent networks to SAC.

araffin · 2024-01-10T12:26:08Z

I propose compatibility with other RL algorithms beyond PPO, specifically A2C.

A2C is already included by the recurrent PPO implementation: https://arxiv.org/abs/2205.09123

introducing various recurrent neural networks (RNNs) like vanilla RNN, GRU, and maybe some lesser-know networks like LMU, and ctRNN.

I have already implemented most of these features in my personal repository and successfully utilized them in my research.

do you have a benchmark to share?
and are you willing to implement and benchmark those alternatives? (I would start with GRU only at first)
adding more options will add complexity to an already complex algorithm, so we should do that only if it is really beneficial.

mtnusf97 added the enhancement New feature or request label Dec 14, 2023

araffin added the Maintainers on vacation Maintainers are on vacation so they can recharge their batteries, we will be back soon ;) label Dec 14, 2023

araffin removed the Maintainers on vacation Maintainers are on vacation so they can recharge their batteries, we will be back soon ;) label Jan 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Expand RNN Options and Algorithm Flexibility #220

[Feature Request] Expand RNN Options and Algorithm Flexibility #220

mtnusf97 commented Dec 14, 2023

masterdezign commented Dec 28, 2023

araffin commented Jan 10, 2024

[Feature Request] Expand RNN Options and Algorithm Flexibility #220

[Feature Request] Expand RNN Options and Algorithm Flexibility #220

Comments

mtnusf97 commented Dec 14, 2023

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

masterdezign commented Dec 28, 2023

araffin commented Jan 10, 2024