Possible mistake in Deep Q Learning Space Invaders notebook #51

karolisjan · 2019-04-22T10:48:39Z

Hey. Shouldn't self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_)) in DQN class be self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_), axis=1), i.e. reduced along columns so that the output length of self.Q is equal to the batch size? If not then self.Q will be a scalar while self.target_Q will be a vector of batch size length.

The text was updated successfully, but these errors were encountered:

ali-ehsan · 2019-05-02T03:11:20Z

@karolisjan I agree.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible mistake in Deep Q Learning Space Invaders notebook #51

Possible mistake in Deep Q Learning Space Invaders notebook #51

karolisjan commented Apr 22, 2019

ali-ehsan commented May 2, 2019

Possible mistake in Deep Q Learning Space Invaders notebook #51

Possible mistake in Deep Q Learning Space Invaders notebook #51

Comments

karolisjan commented Apr 22, 2019

ali-ehsan commented May 2, 2019