Releases: Stable-Baselines-Team/stable-baselines3-contrib
Releases · Stable-Baselines-Team/stable-baselines3-contrib
sb3-contrib v1.5.0: Bug fixes and newer gym version
sb3-contrib v1.4.0: Trust Region Policy Optimization (TRPO) and Augmented Random Search (ARS) algorithms
Breaking Changes:
- Dropped python 3.6 support
- Upgraded to Stable-Baselines3 >= 1.4.0
MaskablePPO
was updated to match latest SB3PPO
version (timeout handling and new method for the policy object)
New Features:
- Added
TRPO
(@cyprienc) - Added experimental support to train off-policy algorithms with multiple envs (note:
HerReplayBuffer
currently not supported) - Added Augmented Random Search (ARS) (@sgillen)
Others:
- Improve test coverage for
MaskablePPO
sb3-contrib v1.3.0 : PPO with invalid action masking
WARNING: This version will be the last one supporting Python 3.6 (end of life in Dec 2021).
We highly recommended you to upgrade to Python >= 3.7.
Breaking Changes:
- Removed
sde_net_arch
- Upgraded to Stable-Baselines3 >= 1.3.0
New Features:
sb3-contrib v1.2.0 : Train/Eval mode support
Breaking Changes:
- Upgraded to Stable-Baselines3 >= 1.2.0
Bug Fixes:
- QR-DQN and TQC updated so that their policies are switched between train and eval mode at the correct time (@ayeright)
Others:
- Fixed type annotation
- Added python 3.9 to CI
SB3 v1.1.0: dictionary observation support and timeout handling
Breaking Changes
- Added support for Dictionary observation spaces (cf. SB3 doc)
- Upgraded to Stable-Baselines3 >= 1.1.0
- Added proper handling of timeouts for off-policy algorithms (cf. SB3 doc)
- Updated usage of logger (cf. SB3 doc)
Bug Fixes
- Removed unused code in
TQC
Others
- SB3 docs and tests dependencies are no longer required for installing SB3 contrib
Documentation
- updated QR-DQN docs checkmark typo (@minhlong94)
Stable-Baselines3 v1.0
Blog post: https://araffin.github.io/post/sb3/
Breaking Changes
- Upgraded to Stable-Baselines3 v1.0
Bug Fixes
- Fixed a bug with
QR-DQN
predict method when usingdeterministic=False
with image space
v1.0rc1: Bug fix for QR-DQN (#21)
* Bug fix for QR-DQN * Upgrade SB3
QR-DQN, SB3 upgrade and time feature wrapper
Breaking Changes:
- Upgraded to Stable-Baselines3 >= 0.11.1
New Features:
- Added
TimeFeatureWrapper
to the wrappers - Added
QR-DQN
algorithm (@ku2482
_)
Bug Fixes:
- Fixed bug in
TQC
when saving/loading the policy only with non-default number of quantiles - Fixed bug in
QR-DQN
when calculating the target quantiles (@ku2482, @guyk1971)
Others:
- Updated
TQC
to match new SB3 version - Moved
quantile_huber_loss
tocommon/utils.py
(@ku2482)