Skip to content
View soonjune's full-sized avatar

Highlights

  • Pro

Block or report soonjune

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. pytorch-a2c-ppo-acktr-gail pytorch-a2c-ppo-acktr-gail Public

    Forked from ikostrikov/pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

    Python

  2. pytorch-soft-actor-critic pytorch-soft-actor-critic Public

    Forked from pranz24/pytorch-soft-actor-critic

    PyTorch implementation of soft actor critic

    Python

  3. recsim_ng-forked recsim_ng-forked Public

    Forked from google-research/recsim_ng

    RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems

    Jupyter Notebook

  4. TempoRL TempoRL Public

    Forked from automl/TempoRL

    Python

  5. facebookresearch/RandomizedValueFunctions facebookresearch/RandomizedValueFunctions Public archive

    Randomized Value Functions via Multiplicative Normalizing Flows

    Python 18 10

  6. twoyak_back twoyak_back Public

    Ruby