Skip to content

timurgepard/Symphony-Saya

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Symphony-Saya (Heaven)

Nothing is possible without our Lord and Saviour Jesus Christ. But everything is possible with Him. I was morally dying, addicted to ponrography and video-games. But He intervened, and gave me a new life. Reinfocement Learning to play with and University to support me.

This repository was created to support the 2024 draft paper. It is unification and simplification of Symphony-1.0, Symphony-2.0(2.1) and Symphony-3.0 (Draft) into single Symphony-Saya-UTD-5 version (Model-free Deterministic Algorithm)

Some ideas were dropped and some proven their worth were solidified:

⚙ No multi-agents/Without big ensemble of Critics/Model-free/Off-policy

  1. Temporal (Immediate) Advantage ✅ (though UTD-5, batch size 128>>768)

  2. Fading Replay Buffer ✅

  3. Rectified Learnable Sine Wave Activation Function ✅

  4. Rectified Huber Symmetric and Asymmetric Loss Functions ✅

  5. Seamless Actor-Critic updates ✅

  6. Silent Dropouts ✅

  7. "movement is life" concept

  8. reduced objective to learn Bellman's sum of dumped reward's variance

  9. improve reward variance through immediate Advantage

Some modules were transferred from Pytorch nn.Module to Pytorch jit.ScriptModule.

Architecture: image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages