Add test: running a driver for twice. #2464

surehb · 2018-07-24T10:31:08Z

What do these changes do?

Add test for PR: Use different serialization context for each driver..

Related issue number

#2406

This reverts commit 32b181e.

…iver."

surehb · 2018-07-24T10:31:44Z

@robertnishihara, please take a look. Thanks.

AmplabJenkins · 2018-07-24T11:40:12Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6795/
Test PASSed.

robertnishihara · 2018-07-25T03:28:16Z

test/multi_node_test.py

+        ray.shutdown()
+
+    def testRunDriverForTwice(self):
+        # We used to have issue 2165 and 2288: driver will hang when we run it


Can you include the full URL instead of just the issue number? This will make it easier to navigate to the issue.

robertnishihara · 2018-07-25T03:37:00Z

test/multi_node_test.py

+    def tearDown(self):
+        ray.shutdown()
+
+    def testRunDriverForTwice(self):


I'd suggest changing this test to not use Ray tune (since the implementation of Ray tune may change). Instead, I'd consider doing something like the following

driver1 = """ ... def serializer(obj): return 1 def deserializer(serialized_obj): assert serialized_obj == 1 return 1 class Foo(object): pass ray.register_custom_serializer(serializer=serializer, deserializer=deserializer) @ray.remote def f(x): assert x == 1 return Foo() for _ in range(10): time.sleep(0.1) assert ray.get([f.remote() for _ in range(10)]) == 1 """

In driver2 all of the 1s should be replaced by 2.

I'd make num_cpus=1 so that all tasks run on the same worker.

This is just an example, but I think it isolates the lower-level behavior a little more clearly. What do you think?

Also, please make sure that the test fails without the patch.

I would prefer the existing way since:

What this test covers is whether we can run a driver for twice successfully, I believe we will need this kind of test sooner or later;

Ray tune is the typical case that used to run into problem, it makes more sense to use it to test the fix for this case;

I understand that the current test is more like a E2E test, and what you propose is more like unit test. I agree that it will be better for us to have completed unit test, for example we need to test: type register on worker -> verify driver and other workers; type register on driver -> verify all workers. I can do it later.

Ok, I think having a unit test at some point would be valuable, but I agree that this test makes sense.

robertnishihara · 2018-07-25T03:37:39Z

test/multi_node_test.py

+
+        for i in range(2):
+            out = run_string_as_driver(driver_script)
+            self.assertIn("success", out)


How long does this test take to run? If it's quick then maybe we want both.

It runs about 10 seconds, what do you mean by "both"?

I meant "this test" and "the unit test I proposed above."

Anyway, 10 seconds is on the long side. It's ok in this case, but in general we should keep tests as short as possible.

surehb · 2018-07-25T07:22:10Z

Thank you @robertnishihara!

AmplabJenkins · 2018-07-25T07:54:28Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/6826/
Test FAILed.

surehb · 2018-07-26T02:55:49Z

@robertnishihara, can you help to merge? Thank you!

jinjiang and others added 3 commits July 24, 2018 11:47

merge from ray

13ad9b8

Revert "merge from ray"

166df51

This reverts commit 32b181e.

Add test for PR2406: "Use different serialization context for each dr…

25862ae

…iver."

robertnishihara reviewed Jul 25, 2018

View reviewed changes

Add issue link in the comment.

8e999f3

surehb force-pushed the RunDriverTwiceTest branch from b9d844b to 8e999f3 Compare July 25, 2018 06:28

robertnishihara approved these changes Jul 25, 2018

View reviewed changes

robertnishihara merged commit 29451cc into ray-project:master Jul 27, 2018

Add test: running a driver for twice. #2464

Add test: running a driver for twice. #2464

Uh oh!

Conversation

surehb commented Jul 24, 2018

What do these changes do?

Related issue number

Uh oh!

surehb commented Jul 24, 2018

Uh oh!

AmplabJenkins commented Jul 24, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

surehb commented Jul 25, 2018

Uh oh!

AmplabJenkins commented Jul 25, 2018

Uh oh!

surehb commented Jul 26, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants