-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HDFS-16320. Datanode retrieve slownode information from NameNode #3654
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
🎊 +1 overall
This message was automatically generated. |
|
@symious Thanks. |
|
@aajisaka @Hexiaoqiao @jojochuang Would you give any advices? |
|
@ferhui Thank you for the review. Updated a document to better illustrate the idea. https://docs.google.com/document/d/10-qrEJ6n-wVCSKlJXsykvKzcS3UBOi6EkuKFm4Z-k40/edit?usp=sharing |
|
@symious Thanks for your document, I think it's a good solution. |
|
@symious Thanks for submitting the PR and sharing your document. I think it's a good solution too.
I want to clarify about "the clients already using the slownode". Does it mean the case that the DataNode is not a slownode when the client starts writing the file, but it becomes a slownode while the client is writing the file? Does it happen often? |
|
@tasanuma Thanks for the review.
Yes, normally it shouldn't happen quite often if we enable the config. But the consequence might be not acceptable. In our cluster, the client complains writing a file costs 1.5h, which is not acceptable especially in a streaming job. |
|
@symious Thank you for your reply. That definitely should be fixed. And getting such information from the datanodes makes sense to me. |
|
@symious You can fix the checkstyle issues. If no objection, you can continue to do the next work mentioned in your document. |
|
@ferhui Sure. Updated the patch. Please have a check. |
|
💔 -1 overall
This message was automatically generated. |
|
🎊 +1 overall
This message was automatically generated. |
|
@ferhui @tasanuma @Hexiaoqiao Thank you for the review. The follow-up ticket is https://issues.apache.org/jira/browse/HDFS-16348. Please help to check. |
Description of PR
The current information of slownode is reported by reportingNode, and stored in NameNode.
This ticket is to let the slownode retrieve the information from NameNode, so that it can do other performance improvement actions based on this information.
Jira ticket: https://issues.apache.org/jira/browse/HDFS-16320
Document: https://docs.google.com/document/d/10-qrEJ6n-wVCSKlJXsykvKzcS3UBOi6EkuKFm4Z-k40/edit?usp=sharing
How was this patch tested?
unit test
For code changes:
LICENSE,LICENSE-binary,NOTICE-binaryfiles?