HDDS-13445. Make `ozone debug replicas chunk-info` stream json output between datanode calls #8914

Gargi-jais11 · 2025-08-07T08:22:54Z

What changes were proposed in this pull request?

ozone debug replicas chunk-info prints chunk information from all replicas for all chunks of all blocks within a file. It gathers all the information from the datanodes, stores it in memory, then prints it all at once. For large files in the GB range, this could result in a large amount of information stored in the client memory before printing. It would be better to print information about one block at a time in between each getBlock call to the datanode. The Json structure can remain the same.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-13445

How was this patch tested?

ran manually on docker-cluster

adoroszlai · 2025-08-07T08:33:31Z

@Gargi-jais11 the PR description does not seem to match the actual change, and the branch references HDDS-13445, not HDDS-12998

Gargi-jais11 · 2025-08-07T08:56:53Z

@Gargi-jais11 the PR description does not seem to match the actual change, and the branch references HDDS-13445, not HDDS-12998

So sorry. I have changed PR description.
Thank you for info.

...-ozone/tools/src/main/java/org/apache/hadoop/ozone/debug/replicas/chunk/ChunkKeyHandler.java

errose28

Thanks for the improvement. Mostly looks good. I tested it manually as well. Just two minor comments.

hadoop-ozone/tools/pom.xml

...-ozone/tools/src/main/java/org/apache/hadoop/ozone/debug/replicas/chunk/ChunkKeyHandler.java

sarvekshayr

Tested locally and verified the streaming output. Other than Ethan’s comments, overall LGTM.

errose28

LGTM

Tejaskriya · 2025-08-26T17:06:04Z

Thanks @Gargi-jais11 for the patch, and @errose28 @sarvekshayr for the reviews

Make stream json output between datanode calls

f7e7e96

Gargi-jais11 changed the title ~~HDDS-12998. Bring real container size in pb message when exporting/importing containers~~ HDDS-13445. Make ozone debug replicas chunk-info stream json output between datanode calls Aug 7, 2025

errose28 added the tools Tools that helps with debugging label Aug 7, 2025

errose28 reviewed Aug 7, 2025

View reviewed changes

...-ozone/tools/src/main/java/org/apache/hadoop/ozone/debug/replicas/chunk/ChunkKeyHandler.java Outdated Show resolved Hide resolved

use jackson to stream json output

03eaad3

Gargi-jais11 force-pushed the HDDS-13445 branch from 7dc7c36 to 03eaad3 Compare August 11, 2025 10:16

Gargi-jais11 marked this pull request as ready for review August 11, 2025 11:57

Gargi-jais11 requested a review from errose28 August 11, 2025 11:57

errose28 reviewed Aug 19, 2025

View reviewed changes

hadoop-ozone/tools/pom.xml Show resolved Hide resolved

...-ozone/tools/src/main/java/org/apache/hadoop/ozone/debug/replicas/chunk/ChunkKeyHandler.java Show resolved Hide resolved

sarvekshayr reviewed Aug 20, 2025

View reviewed changes

Gargi Jaiswal added 2 commits August 20, 2025 12:15

removed jackson-core dependency and added a newline

8cf69c9

added dependency again

fc743c8

Gargi-jais11 requested a review from errose28 August 20, 2025 08:12

errose28 approved these changes Aug 22, 2025

View reviewed changes

Tejaskriya merged commit 9946ac6 into apache:master Aug 26, 2025
42 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-13445. Make `ozone debug replicas chunk-info` stream json output between datanode calls #8914

HDDS-13445. Make `ozone debug replicas chunk-info` stream json output between datanode calls #8914

Gargi-jais11 commented Aug 7, 2025 •

edited

Loading

Uh oh!

adoroszlai commented Aug 7, 2025

Uh oh!

Gargi-jais11 commented Aug 7, 2025

Uh oh!

Uh oh!

errose28 left a comment

Uh oh!

Uh oh!

Uh oh!

sarvekshayr left a comment

Uh oh!

errose28 left a comment

Uh oh!

Uh oh!

Tejaskriya commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HDDS-13445. Make ozone debug replicas chunk-info stream json output between datanode calls #8914

HDDS-13445. Make ozone debug replicas chunk-info stream json output between datanode calls #8914

Conversation

Gargi-jais11 commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

adoroszlai commented Aug 7, 2025

Uh oh!

Gargi-jais11 commented Aug 7, 2025

Uh oh!

Uh oh!

errose28 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sarvekshayr left a comment

Choose a reason for hiding this comment

Uh oh!

errose28 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Tejaskriya commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HDDS-13445. Make `ozone debug replicas chunk-info` stream json output between datanode calls #8914

HDDS-13445. Make `ozone debug replicas chunk-info` stream json output between datanode calls #8914

Gargi-jais11 commented Aug 7, 2025 •

edited

Loading