Add nethack to sf_examples #289

BartekCupial · 2023-12-29T19:58:06Z

Overview

This PR introduces NetHack to Sample Factory.
The code is based on three repositories:

Key Contributions

Easy installation and experiments

By adding NetHack to examples in Sample Factory my main goal is to improve reproducibility and allow for easy experimentation with NetHack as I found that there were many issues with installation and experiments. For example when trying to reproduce experiments in D&D repository Dockerfile required fixing moolib library and Cmake.txt file. Additionally since D&D repo implemented APPO from scratch (implementation details in RL matter) I found that just my moving to SF I've managed to increase the score from 2k to about 2.8k.

Additional metrics

Sample Factory supports logging of additional policy stats if any are found in info["episode_extra_stats"]. I've added wrappers which log blstats and additional auxiliary scores. Look at sf_examples/nethack/utils/task_rewards.py].

`render_mode=rgb_array`

NLE natively doesn't support rgb_array. I've added rgb_array mode for rendering by using RenderCharImagesWithNumpyWrapperV2 wrapper introduced by https://github.com/Miffyli/nle-sample-factory-baseline.
By using rgb_array in enjoy we can save and examine episodes. Example: https://github.com/BartekCupial/sample-factory/assets/92169405/47884b73-beeb-4303-a72f-75d202aa87a8

Evaluation

NetHack can have very long episodes (100k env steps) and additionally since the environment is highly stochastic we usually need a lot of evaluation episodes to measure the policy performance (usually 1024). I highly recommend using recently introduced eval.py since using enjoy would be really long.

codecov-commenter · 2023-12-29T20:13:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (f69cf46) 77.96% compared to head (e97c18a) 77.96%.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #289   +/-   ##
=======================================
  Coverage   77.96%   77.96%           
=======================================
  Files         101      101           
  Lines        7759     7759           
=======================================
  Hits         6049     6049           
  Misses       1710     1710

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

klyuchnikova-ana

Great stuff!

A few minor comments, but overall looks super clean! 🔥

docs/09-environment-integrations/nethack.md

setup.py

sf_examples/nethack/README.md

BartekCupial · 2024-01-05T11:49:55Z

UPDATE:

Training results as well as comparison with Dungeons and Data APPO are present in docs docs/09-environment-integrations/nethack.md. I've also created model card in hugging face https://huggingface.co/LLParallax/sample_factory_human_monk (its also linked in docs).

klyuchnikova-ana

👍

BartekCupial and others added 9 commits December 28, 2023 14:48

add nethack examples

89b1253

update nle installation

09700b4

use nle tasks

822273c

handle rendering with nle

056390f

refactor wrappers, add render with rgb_array

5155218

render rgb_array

0640261

nethack integration md

c3b1461

Update nethack.md

9502585

pre-commit

4e47a07

A K added 2 commits December 30, 2023 20:41

Add conda install pybind11

5229e22

Minor pre-commit fixes

e6d7f34

klyuchnikova-ana approved these changes Dec 31, 2023

View reviewed changes

BartekCupial added 7 commits December 31, 2023 12:40

remove duplicate README for nethack

9ed63e8

rename render_utils to nethack_render_utils

47886b1

add info about python3.10

d9720e5

add report with training on human monk

2853da0

revert model

448c6c7

add help in nethack_params

640016a

finish docs, model card and evaluation

e97c18a

klyuchnikova-ana approved these changes Jan 8, 2024

View reviewed changes

precommit

f0862b2

klyuchnikova-ana merged commit d76dd09 into alex-petrenko:master Jan 9, 2024
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add nethack to sf_examples #289

Add nethack to sf_examples #289

BartekCupial commented Dec 29, 2023 •

edited

Loading

codecov-commenter commented Dec 29, 2023 •

edited

Loading

klyuchnikova-ana left a comment

BartekCupial commented Jan 5, 2024

klyuchnikova-ana left a comment

Add nethack to sf_examples #289

Add nethack to sf_examples #289

Conversation

BartekCupial commented Dec 29, 2023 • edited Loading

Overview

Key Contributions

Easy installation and experiments

Additional metrics

render_mode=rgb_array

Evaluation

codecov-commenter commented Dec 29, 2023 • edited Loading

Codecov Report

klyuchnikova-ana left a comment

Choose a reason for hiding this comment

BartekCupial commented Jan 5, 2024

UPDATE:

klyuchnikova-ana left a comment

Choose a reason for hiding this comment

BartekCupial commented Dec 29, 2023 •

edited

Loading

`render_mode=rgb_array`

codecov-commenter commented Dec 29, 2023 •

edited

Loading