Add pole balancing/reinforcement learning tutorial #1151

AlexisWis · 2024-12-12T14:52:46Z

Added polebalancing tutorial which currently includes:
-Simulation Renderer
-Physics Simulation
-Classical Q-based agent

…unction

…nted with some ideas

…e measuring tools

into cart_pole_tutorial

…le_tutorial

clinssen · 2025-05-21T10:07:32Z

See https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003024:

We would like to point out the similarity of the TD-LTP learning rule to a reward-modulated spike-timing-dependent plasticity rule we call R-STDP [6], [16], [30]–[32]. In R-STDP, the effects of classic STDP [33]–[36] are stored into an exponentially decaying, medium term (time constant ), synapse-specific memory, called an eligibility trace. This trace is only imprinted into the actual synaptic weights when a global, neuromodulatory success signal is sent to the synapses. In R-STDP, the neuromodulatory signal is the reward minus a baseline, i.e., . It was shown [32] that for R-STDP to maximize reward, the baseline must precisely match the mean (or expected) reward. In this sense, is a reward prediction error signal; a system to compute this signal is needed. Since the TD error is also a reward prediction error signal, it seems natural to use instead of . This turns the reward-modulated learning rule R-STDP into a TD error-modulated TD-STDP rule (Figure 2A, bottom). In this form, TD-STDP is very similar to TD-LTP. The major difference between the two is the influence of post-before-pre spike pairings on the learning rule: while these are ignored in TD-LTP, they cause a negative contribution to the coincidence detection in TD-STDP.

Added Polebalancing directory in doc/tutorials

e57bf90

AlexisWis temporarily deployed to external December 12, 2024 14:52 — with GitHub Actions Inactive

clinssen changed the title ~~Added Polebalancing directory in doc/tutorials~~ Add pole balancing/reinforcement learning tutorial Dec 16, 2024

clinssen temporarily deployed to external December 16, 2024 18:05 — with GitHub Actions Inactive

add first draft for spiking neural network

f450731

clinssen temporarily deployed to external January 6, 2025 14:55 — with GitHub Actions Inactive

Changed datastructure to flat iterator and implemented index lookup f…

e11f83b

…unction

AlexisWis temporarily deployed to external January 9, 2025 11:53 — with GitHub Actions Inactive

generate NESTML model code

d11ade7

clinssen temporarily deployed to external January 9, 2025 13:10 — with GitHub Actions Inactive

generate NESTML model code

b5c6dd4

clinssen temporarily deployed to external January 9, 2025 13:56 — with GitHub Actions Inactive

updates on spiking controller

9a1e63c

clinssen temporarily deployed to external January 13, 2025 14:54 — with GitHub Actions Inactive

Added heatmap for q-values, added better simulation control, experime…

89c8302

…nted with some ideas

AlexisWis temporarily deployed to external January 16, 2025 18:51 — with GitHub Actions Inactive

Some code cleanup

eb3b591

AlexisWis temporarily deployed to external January 17, 2025 11:48 — with GitHub Actions Inactive

fixed typo

c9c4834

AlexisWis temporarily deployed to external January 17, 2025 12:16 — with GitHub Actions Inactive

more questions and mysteries

8d2a1b4

AlexisWis temporarily deployed to external January 23, 2025 13:32 — with GitHub Actions Inactive

some stuff

fc1dc6a

AlexisWis temporarily deployed to external January 23, 2025 13:48 — with GitHub Actions Inactive

Connected SNN and physics simulation, cleaned up some code, added som…

d22a549

…e measuring tools

AlexisWis temporarily deployed to external January 29, 2025 15:26 — with GitHub Actions Inactive

Added some plots

8253663

AlexisWis temporarily deployed to external February 6, 2025 13:39 — with GitHub Actions Inactive

input now scatterplot, added neuron_test notebook

85bce12

AlexisWis temporarily deployed to external February 9, 2025 13:14 — with GitHub Actions Inactive

cleaned up notebook

4dbcc6d

clinssen temporarily deployed to external February 19, 2025 14:46 — with GitHub Actions Inactive

wisniewski.alexis and others added 3 commits March 11, 2025 16:21

Added step-by-step simulation, added network storing

d8bf999

Merge remote-tracking branch 'upstream/master' into cart_pole_tutorial

806a2ee

Merge branch 'cart_pole_tutorial' of https://github.com/AlexisWis/nestml

b563db6

into cart_pole_tutorial

AlexisWis temporarily deployed to external March 11, 2025 15:26 — with GitHub Actions Inactive

Removed small debug code

79f5cd4

AlexisWis temporarily deployed to external March 11, 2025 15:27 — with GitHub Actions Inactive

C.A.P. Linssen added 2 commits March 11, 2025 16:39

Merge remote-tracking branch 'origin/cart_pole_tutorial' into cart_po…

dc6c394

…le_tutorial

fix SNN

75d4cc6

clinssen temporarily deployed to external March 12, 2025 07:54 — with GitHub Actions Inactive

fixed boxes according to Liu&Pan code

93498f8

clinssen temporarily deployed to external March 14, 2025 13:30 — with GitHub Actions Inactive

SNN updates

8ca8491

clinssen temporarily deployed to external March 20, 2025 13:53 — with GitHub Actions Inactive

SNN updates

047415c

clinssen temporarily deployed to external March 20, 2025 15:08 — with GitHub Actions Inactive

SNN updates

3318fbe

clinssen temporarily deployed to external March 20, 2025 15:51 — with GitHub Actions Inactive

SNN updates

bcb1791

clinssen temporarily deployed to external March 20, 2025 16:03 — with GitHub Actions Inactive

fix SNN

022abb6

clinssen temporarily deployed to external March 26, 2025 10:37 — with GitHub Actions Inactive

fix SNN

962a1a1

clinssen temporarily deployed to external March 27, 2025 09:09 — with GitHub Actions Inactive

Sped up plot renderer

ffc76a1

AlexisWis temporarily deployed to external April 2, 2025 11:43 — with GitHub Actions Inactive

clinssen mentioned this pull request Jun 16, 2025

Add mountain car reinforcement learning tutorial #1227

Open

clinssen closed this Jun 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add pole balancing/reinforcement learning tutorial #1151

Add pole balancing/reinforcement learning tutorial #1151

Uh oh!

AlexisWis commented Dec 12, 2024

Uh oh!

clinssen commented May 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add pole balancing/reinforcement learning tutorial #1151

Add pole balancing/reinforcement learning tutorial #1151

Uh oh!

Conversation

AlexisWis commented Dec 12, 2024

Uh oh!

clinssen commented May 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants