Skip to content

Conversation

@duongkame
Copy link
Contributor

@duongkame duongkame commented May 3, 2024

What changes were proposed in this pull request?

See HDDS-10780.

We have NPE for the following scenarios. Let's say a pipeline p1 with dn1, dn2 and dn3.

Some chunks are written to p1 -> flush 1 -> watchCommit 1 timed out because dn3 hasn't caught up to the latest index. -> dn3 is removed from commitInfoMap.

Some other chunks are written to p1 -> flush 2 -> watchCommit 2 -> latest index has been committed to all dn1, dn2 and dn3 -> NPE because dn3 has been removed from commitInfoMap.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-10780

How was this patch tested?

CI.

@duongkame duongkame requested a review from szetszwo May 3, 2024 04:51
Copy link
Contributor

@szetszwo szetszwo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 the change looks good.

@szetszwo szetszwo merged commit e1d2d9c into apache:master May 3, 2024
jojochuang pushed a commit to jojochuang/ozone that referenced this pull request May 29, 2024
(cherry picked from commit e1d2d9c)

 Conflicts:
	hadoop-hdds/client/src/main/java/org/apache/hadoop/hdds/scm/XceiverClientRatis.java

Change-Id: Ifca0361f4e24bf6bac9b8ce806c3e6f1f501c15a
xichen01 pushed a commit to xichen01/ozone that referenced this pull request Oct 16, 2024
@duongkame duongkame deleted the HDDS-10780 branch April 12, 2025 00:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants