README asset #5

jcwleo · 2018-11-17T17:51:32Z

jcwleo · 2018-11-20T02:21:03Z

jcwleo · 2018-11-20T12:25:20Z

jcwleo · 2019-01-05T02:34:37Z

kslazarev · 2019-01-11T00:45:33Z

https://github.com/jcwleo/random-network-distillation-pytorch/blob/master/config.conf
Is this config the last one to get similar results as on images above?

I see last pull request is about normalization, maybe UseNorm = True improve reward_per_epi or speed of convergence? And what about UseNoisyNet, when it could better to use?

jcwleo · 2019-01-11T01:29:53Z

@kslazarev
Hi, I used that config. but only NumEnv is 128 and MaxStepPerEpisode is 4500.
In paper, author did not announce Advantage Norm and Noisynet.
so I disabled that config.

kslazarev · 2019-01-11T12:12:26Z

Result by config in master branch.

MontezumaRevengeNoFrameskip-v4

Right now, set NumEnv==128 and MaxStepPerEpisode==4500.
I'll attach result when get 1200-2000 updates.

kslazarev · 2019-01-11T19:00:26Z

@jcwleo I see the difference in x-axis scale in reward_per_epi and reward_per_rollout plots.
On your MontezumaRevengeNoFrameskip-v4 image they are 1.200k and 12.00k (10x scale).
But on my temporary progress image they are 200 and 600 (3x scale). Maybe need to change additional option in config?

kslazarev · 2019-01-11T19:09:21Z

Or the x-axis scale (global_update and sample_episode) depends on player survival/experience so on later updates x-axis scale will be the same?

jcwleo · 2019-01-11T23:01:39Z

@kslazarev per_rollout and per_epi is not same scale. per_rollout means just one global update(enter agent.train_model()). but per_epi means Env’s one episode info that is one of parallel env.
If one episode’s total step is 1024 and Num_step(rollout size) is 128, each scale of x-axis is 8 times different.

kslazarev · 2019-01-11T23:58:01Z

@jcwleo Yes, correct. I have another small questions about code. How could be appropriate to ask? Every question as new issue, or move forward to ask in this issue?

jcwleo · 2019-01-12T04:19:25Z

@kslazarev I want you to create an issue for each question. :)

kslazarev · 2019-01-13T13:30:21Z

NumEnv=128 and MaxStepPerEpisode==4500

Looks similar as in README. On NumEnv=128 I've stopped the process because swap is used.

xiaioding · 2023-04-17T08:23:15Z

Hello, can you tell me how many Gpus you used and how long it took you to see this effect?

kslazarev · 2023-04-17T08:42:18Z

Hello. Not fast. Don't remember exactly, 1 or 2 NV 1080 Ti

xiaioding · 2023-04-17T09:09:14Z

@kslazarev Excuse me, I use 1 3090,2 envs, run for more than 2 hours, the reward is still 0, is this normal? I didn't load the pre-training model

kslazarev · 2023-04-17T09:20:34Z

It was 3 years ago. Could not help, I don't remember exactly what problem could cause.

xiaioding · 2023-04-17T09:46:06Z

@kslazarev Ok, thanks

jcwleo · 2023-04-17T15:44:51Z

@kslazarev
Thank you for answering for me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README asset #5

README asset #5

jcwleo commented Nov 17, 2018

jcwleo commented Nov 20, 2018

jcwleo commented Nov 20, 2018

jcwleo commented Jan 5, 2019

kslazarev commented Jan 11, 2019

jcwleo commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 11, 2019

kslazarev commented Jan 11, 2019 •

edited

Loading

jcwleo commented Jan 11, 2019

kslazarev commented Jan 11, 2019 •

edited

Loading

jcwleo commented Jan 12, 2019

kslazarev commented Jan 13, 2019 •

edited

Loading

xiaioding commented Apr 17, 2023

kslazarev commented Apr 17, 2023

xiaioding commented Apr 17, 2023

kslazarev commented Apr 17, 2023

xiaioding commented Apr 17, 2023

jcwleo commented Apr 17, 2023

README asset #5

README asset #5

Comments

jcwleo commented Nov 17, 2018

jcwleo commented Nov 20, 2018

jcwleo commented Nov 20, 2018

jcwleo commented Jan 5, 2019

kslazarev commented Jan 11, 2019

jcwleo commented Jan 11, 2019 • edited Loading

kslazarev commented Jan 11, 2019 • edited Loading

kslazarev commented Jan 11, 2019

kslazarev commented Jan 11, 2019 • edited Loading

jcwleo commented Jan 11, 2019

kslazarev commented Jan 11, 2019 • edited Loading

jcwleo commented Jan 12, 2019

kslazarev commented Jan 13, 2019 • edited Loading

xiaioding commented Apr 17, 2023

kslazarev commented Apr 17, 2023

xiaioding commented Apr 17, 2023

kslazarev commented Apr 17, 2023

xiaioding commented Apr 17, 2023

jcwleo commented Apr 17, 2023

jcwleo commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 11, 2019 •

edited

Loading

kslazarev commented Jan 13, 2019 •

edited

Loading