Implement optimality tightening #60

Kaixhin · 2016-12-11T21:21:15Z

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening potentially speeds up Q-learning by an order of magnitude! Apparently not too hard to implement either.

petrosgk · 2016-12-15T21:51:50Z

I gave it a shot, however I am not sure how the discounted reward R is supposed to be used and I also need to check if future and past k-transitions are valid

https://github.com/petrosgk/Atari/tree/opt-tightening

Kaixhin · 2016-12-16T00:13:50Z

Awesome - I'll try and have a look soon or next week! Would you be able to test it to try and replicate one of the results from the paper?

I started on this myself as well, so will see how our implementations compare.

Aeroone · 2017-02-16T00:05:09Z

Hi, have you reproduced that optimality tightening results? I have tried some games based on tensorflow and openai gym but the results seem much worse than the papers' results. I am not sure whether I misunderstand something or miss some tricks in the paper. It seems that the paper doesn't include everything about their works.

DanielTea · 2017-02-16T00:16:23Z

Does anyone know wether they have published the source code for optimal tightening, from the paper?

Aeroone · 2017-02-16T00:48:31Z

No, they haven't published their code as far as I know. The tricks they use are not hard to implement but I can not still achieve their performance.

petrosgk · 2017-02-16T01:31:45Z

I have tried implementing optimality tightening (see earlier post) but the results I get are also much worse than the paper's.

Kaixhin · 2017-02-16T09:05:36Z

In my experience the smallest details in a paper can be key to reproducing results - and these may be missing or ambiguous. If anyone is reasonably confident in their implementation, you should try contacting one of the authors with specific questions.

ShibiHe · 2017-04-25T12:55:33Z

Hi guys,
I have released the code at https://github.com/ShibiHe/Q-Optimality-Tightening. Please have a look.

Best,
Shibi

Kaixhin added enhancement help wanted labels Dec 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement optimality tightening #60

Implement optimality tightening #60

Kaixhin commented Dec 11, 2016

petrosgk commented Dec 15, 2016 •

edited

Loading

Kaixhin commented Dec 16, 2016 •

edited

Loading

Aeroone commented Feb 16, 2017 •

edited

Loading

DanielTea commented Feb 16, 2017

Aeroone commented Feb 16, 2017 •

edited

Loading

petrosgk commented Feb 16, 2017

Kaixhin commented Feb 16, 2017

ShibiHe commented Apr 25, 2017

Implement optimality tightening #60

Implement optimality tightening #60

Comments

Kaixhin commented Dec 11, 2016

petrosgk commented Dec 15, 2016 • edited Loading

Kaixhin commented Dec 16, 2016 • edited Loading

Aeroone commented Feb 16, 2017 • edited Loading

DanielTea commented Feb 16, 2017

Aeroone commented Feb 16, 2017 • edited Loading

petrosgk commented Feb 16, 2017

Kaixhin commented Feb 16, 2017

ShibiHe commented Apr 25, 2017

petrosgk commented Dec 15, 2016 •

edited

Loading

Kaixhin commented Dec 16, 2016 •

edited

Loading

Aeroone commented Feb 16, 2017 •

edited

Loading

Aeroone commented Feb 16, 2017 •

edited

Loading