Skip to content

Conversation

@rxin
Copy link
Contributor

@rxin rxin commented Dec 5, 2014

It's been reported that when the number of disks is large and the number of nodes is small, Netty network throughput is low compared with NIO. We suspect the problem is that only a small number of disks are utilized to serve shuffle files at any given point, due to connection reuse. This patch adds a new config parameter to specify the number of concurrent connections between two peer nodes, default to 2.

@SparkQA
Copy link

SparkQA commented Dec 5, 2014

Test build #24193 has started for PR 3625 at commit 3e1306c.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 5, 2014

Test build #24193 has finished for PR 3625 at commit 3e1306c.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24193/
Test FAILed.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can make this a private static class if we make this a constructor parameter.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2 spaces for indent

@SparkQA
Copy link

SparkQA commented Dec 5, 2014

Test build #24198 has started for PR 3625 at commit 9076b4a.

  • This patch merges cleanly.

@aarondav
Copy link
Contributor

aarondav commented Dec 5, 2014

Looks mostly good to me, a few remaining synchronization issues. Will take another long look after you address all comments. I'd really appreciate a test, though, if we can get one in -- we really don't want to be regressing at this point, and we also really want to make sure we're fixing the issue.

@rxin rxin changed the title [SPARK-4740] [WIP] Create multiple concurrent connections between two peer nodes in Netty. [SPARK-4740] Create multiple concurrent connections between two peer nodes in Netty. Dec 5, 2014
@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24199 has started for PR 3625 at commit 41dfcb2.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24198 has finished for PR 3625 at commit 9076b4a.

  • This patch fails Spark unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24198/
Test FAILed.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24200 has started for PR 3625 at commit 0fefabb.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24199 has finished for PR 3625 at commit 41dfcb2.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24199/
Test PASSed.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24200 has finished for PR 3625 at commit 0fefabb.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24200/
Test PASSed.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: Can we use JavaUtils.closeQuietly(client) here?

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24206 has started for PR 3625 at commit ad4241a.

  • This patch merges cleanly.

@AmplabJenkins
Copy link

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24205/
Test FAILed.

@aarondav
Copy link
Contributor

aarondav commented Dec 6, 2014

LGTM

@rxin
Copy link
Contributor Author

rxin commented Dec 6, 2014

Jenkins, retest this please.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24207 has started for PR 3625 at commit ad4241a.

  • This patch merges cleanly.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24206 has finished for PR 3625 at commit ad4241a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24206/
Test PASSed.

@SparkQA
Copy link

SparkQA commented Dec 6, 2014

Test build #24207 has finished for PR 3625 at commit ad4241a.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/24207/
Test PASSed.

@rxin
Copy link
Contributor Author

rxin commented Dec 10, 2014

I'm merging this one in master & branch-1.2.

asfgit pushed a commit that referenced this pull request Dec 10, 2014
…nodes in Netty.

It's been reported that when the number of disks is large and the number of nodes is small, Netty network throughput is low compared with NIO. We suspect the problem is that only a small number of disks are utilized to serve shuffle files at any given point, due to connection reuse. This patch adds a new config parameter to specify the number of concurrent connections between two peer nodes, default to 2.

Author: Reynold Xin <[email protected]>

Closes #3625 from rxin/SPARK-4740 and squashes the following commits:

ad4241a [Reynold Xin] Updated javadoc.
f33c72b [Reynold Xin] Code review feedback.
0fefabb [Reynold Xin] Use double check in synchronization.
41dfcb2 [Reynold Xin] Added test case.
9076b4a [Reynold Xin] Fixed two NPEs.
3e1306c [Reynold Xin] Minor style fix.
4f21673 [Reynold Xin] [SPARK-4740] Create multiple concurrent connections between two peer nodes in Netty.

(cherry picked from commit 2b9b726)
Signed-off-by: Reynold Xin <[email protected]>
@asfgit asfgit closed this in 2b9b726 Dec 10, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants