Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Network is not learning when convolutional layers are applied. #14

Open
DBaller opened this issue Jan 9, 2022 · 2 comments
Open

Network is not learning when convolutional layers are applied. #14

DBaller opened this issue Jan 9, 2022 · 2 comments

Comments

@DBaller
Copy link

DBaller commented Jan 9, 2022

Hey Phil! Thanks for the course. I'm really enjoying it so far.

I've implemented the first real Deep Q Network, and it is not learning. Whenever I take off the convolutional layers and just use the fully connected layers and test it on the CartPole-v1 it is able to learn, however, whenever I test it on Pong or Breakout with the convolutional layers it does not work. I've gone through all of my code many times. I don't know what it is I messed up. I've checked the wrapper, the network, the agent, even the main loop. Could it possibly be my imports?

I'm not sure what's the best way to upload code. Let me know if there is a better way.
ExperienceReplay.txt
GymWrapper.txt
TrainAgent.txt
DeepQNetwork.txt

@DBaller
Copy link
Author

DBaller commented Jan 11, 2022

Hey Phil,

I did some more experimenting and am finding that my observations are returning values of 0. It is not an issue for the game pong, but it is for all other atari games.

BreakoutObservationOutput
BoxingObservationOutput
PongObservationOutput

Any reason as to why this may be?

@DBaller
Copy link
Author

DBaller commented Jan 11, 2022

The first 2 screen shots are of breakout and boxing, the last one (with the filled tensors) is from Pong

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant