Skip to content

Conversation

@bjg2
Copy link
Contributor

@bjg2 bjg2 commented Feb 6, 2019

Stats gathering in AsyncSamplesOptimizer is improved, as samples_throughput and train_throughput were giving bad values (mostly 0, with periodical peaks, instead of a steady graph showing throughput per second).

@bjg2 bjg2 changed the title WINGMAN: Improved stats changes in AsyncSamplesOptimizer [WINGMAN -> rllib] Improved stats changes in AsyncSamplesOptimizer Feb 6, 2019
@bjg2 bjg2 changed the title [WINGMAN -> rllib] Improved stats changes in AsyncSamplesOptimizer [wingman -> rllib] Improved stats changes in AsyncSamplesOptimizer Feb 6, 2019
@bjg2 bjg2 mentioned this pull request Feb 6, 2019
@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11615/
Test FAILed.

assert len(self.remote_evaluators) > 0

# Stats
self.timers = {k: TimerStat() for k in ["train", "sample"]}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does it also make sense to remove the timers?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Well, I think so. Replaced self.timers with self._optimizer_step_timer.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11639/
Test FAILed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11642/
Test FAILed.

Copy link
Contributor

@ericl ericl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just tried out it, looks good.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11657/
Test PASSed.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11689/
Test PASSed.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11694/
Test FAILed.

@ericl
Copy link
Contributor

ericl commented Feb 8, 2019

Pushed some lint fixes. Note that num remote_evaluators should be >0 still, since async samples optimizer doesn't have fallback code for 0 workers case like other optimizers do.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11699/
Test FAILed.

@ericl ericl self-assigned this Feb 8, 2019
"timing_breakdown": timing,
"learner_queue": self.learner.learner_queue_size.stats(),
}
}, **self.get_mean_stats_and_reset())
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This syntax is needed for python 2.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11710/
Test FAILed.

@ericl
Copy link
Contributor

ericl commented Feb 9, 2019

jenkins retest this please

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/11729/
Test PASSed.

@ericl ericl merged commit e703b9f into ray-project:master Feb 10, 2019
@ericl
Copy link
Contributor

ericl commented Feb 10, 2019

Merged, thanks!

@bjg2 bjg2 deleted the optimizer-stats branch February 25, 2019 09:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants