Evaluation worker feature #192

alex-petrenko · 2022-07-27T05:36:35Z

This adds a new feature that allows real-time evaluation and visualization of agents during the training session.
The evaluation worker is supposed to run in a separate process from the training session and thus enables evaluation on a small number of agents (i.e. 1 or 64) which still leaves enough resources to train on thousands of agents.

This will be triggered in isaacgymenvs using a new flag. Alternatively player can be started with evaluation=True dir_to_monitor=/some/experiment/dir/containing/checkpoints to monitor an existing training session.

Player with evaluation=True will continuously monitor the experiment dir for new checkpoints, load them, and visualize a new policy.

alex-petrenko · 2022-07-27T05:37:06Z

@ViktorM FYI

Denys88 · 2022-07-27T05:42:38Z

@alex-petrenko will it work with IG on one gpu? as I remember I cannot create second IG on same gpu anyway?

alex-petrenko · 2022-07-28T00:12:09Z

@Denys88 it worked fine on my 1080Ti provided there's enough memory, although I only tried on my machine.
Is there a fundamental reason why two IG can't coexist?

alex-petrenko · 2022-08-22T04:13:19Z

@ViktorM this is the version we'll need to use for the demo

Denys88 · 2022-08-22T05:43:14Z

@alex-petrenko please let me know if you are going to add a few more changes or I can just merge it and refactor later.
Btw do you still need #195 this one?

alex-petrenko · 2022-08-22T23:23:26Z

@Denys88 I think it's solid and works reliably. We were able to use it with both IGE and Omniverse IsaacGym.
It should be rather safe to merge since it does not do anything unless the evaluation flag is turned on.

If you don't want the file monitor thing (watchdog) to be in the main list of dependencies, you can remove it from setup py and add a warning that it should be installed under the evaluation section in the code.

@Denys88 not sure about #195 - this is something @ArthurAllshire should know more about

ViktorM · 2022-08-23T16:20:47Z

@Denys88 is it good to go?

Denys88 · 2022-08-23T16:41:22Z

@ViktorM not yet. need to test envpool and ray vecenvs first. and update readme.
you can create a block with a new version.

Denys88 · 2022-08-01T20:13:01Z

rl_games/common/player.py

+            os.makedirs(self.eval_checkpoint_dir, exist_ok=True)
+
+            patterns = ["*.pth"]
+            from watchdog.observers import Observer


can we move this logic to the separate file?

Evaluation worker feature

fc85ec9

Denys88 reviewed Jan 15, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation worker feature #192

Evaluation worker feature #192

alex-petrenko commented Jul 27, 2022

alex-petrenko commented Jul 27, 2022

Denys88 commented Jul 27, 2022

alex-petrenko commented Jul 28, 2022 •

edited

Loading

alex-petrenko commented Aug 22, 2022

Denys88 commented Aug 22, 2022

alex-petrenko commented Aug 22, 2022 •

edited

Loading

ViktorM commented Aug 23, 2022

Denys88 commented Aug 23, 2022

Denys88 Aug 1, 2022

Evaluation worker feature #192

Are you sure you want to change the base?

Evaluation worker feature #192

Conversation

alex-petrenko commented Jul 27, 2022

alex-petrenko commented Jul 27, 2022

Denys88 commented Jul 27, 2022

alex-petrenko commented Jul 28, 2022 • edited Loading

alex-petrenko commented Aug 22, 2022

Denys88 commented Aug 22, 2022

alex-petrenko commented Aug 22, 2022 • edited Loading

ViktorM commented Aug 23, 2022

Denys88 commented Aug 23, 2022

Denys88 Aug 1, 2022

Choose a reason for hiding this comment

alex-petrenko commented Jul 28, 2022 •

edited

Loading

alex-petrenko commented Aug 22, 2022 •

edited

Loading