Skip to content

Conversation

@siddhantsangwan
Copy link
Contributor

What changes were proposed in this pull request?

getContainerReplicationHealth() in ReplicationManager does not set pendingOps and maintenanceRedundancy when building ContainerCheckRequest. This leads to errors such as:

scm_1         | java.lang.NullPointerException
scm_1         | 	at java.base/java.util.Collections$UnmodifiableCollection.<init>(Collections.java:1030)
scm_1         | 	at java.base/java.util.Collections$UnmodifiableList.<init>(Collections.java:1303)
scm_1         | 	at java.base/java.util.Collections.unmodifiableList(Collections.java:1290)
scm_1         | 	at org.apache.hadoop.hdds.scm.container.replication.ContainerCheckRequest.<init>(ContainerCheckRequest.java:45)
scm_1         | 	at org.apache.hadoop.hdds.scm.container.replication.ContainerCheckRequest.<init>(ContainerCheckRequest.java:31)
scm_1         | 	at org.apache.hadoop.hdds.scm.container.replication.ContainerCheckRequest$Builder.build(ContainerCheckRequest.java:119)
scm_1         | 	at org.apache.hadoop.hdds.scm.container.replication.ReplicationManager.getContainerReplicationHealth(ReplicationManager.java:923)
scm_1         | 	at org.apache.hadoop.hdds.scm.container.balancer.MoveManager.move(MoveManager.java:253)
...

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-8359

How was this patch tested?

Added UT.

… that it builds ContainerCheckRequest correctly
@adoroszlai adoroszlai merged commit c90c4cb into apache:master Apr 4, 2023
errose28 added a commit to errose28/ozone that referenced this pull request Apr 6, 2023
* master: (155 commits)
  update readme (apache#4535)
  HDDS-8374. Disable flaky unit test: TestContainerStateCounts
  HDDS-8016. updated the ozone doc for linked bucket and deletion async limitation (apache#4526)
  HDDS-8237. [Snapshot] loadDb() used by SstFiltering service creates extraneous directories. (apache#4446)
  HDDS-8035. Intermittent timeout in TestOzoneManagerHAWithData.testOMHAMetrics (apache#4362)
  HDDS-8039. Allow container inspector to run from ozone debug. (apache#4337)
  HDDS-8304. [Snapshot] Reduce flakiness in testSkipTrackingWithZeroSnapshot (apache#4487)
  HDDS-7974. [Snapshot] KeyDeletingService to be aware of Ozone snapshots (apache#4486)
  HDDS-8368. ReplicationManager: Create ContainerReplicaOp with correct target Datanode (apache#4532)
  HDDS-8358. Fix the space usage comparator in ContainerBalancerSelectionCriteria (apache#4527)
  HDDS-8359. ReplicationManager: Fix getContainerReplicationHealth() so that it builds ContainerCheckRequest correctly (apache#4528)
  HDDS-8361. Useless object in TestOzoneBlockTokenIdentifier (apache#4517)
  HDDS-8325. Consolidate and refine RocksDB metrics of services (apache#4506)
  HDDS-8135. Incorrect synchronization during certificate renewal in DefaultCertificateClient. (apache#4381)
  HDDS-8127. Exclude deleted containers from Recon container count (apache#4440)
  HDDS-8364. ReadReplicas may give wrong results with topology-aware read enabled (apache#4522)
  HDDS-8354. Avoid WARNING about ObjectEndpoint#get (apache#4515)
  HDDS-8324. DN data cache gets removed randomly asking for data from disk (apache#4499)
  HDDS-8291. Upgrade to Hadoop 3.3.5 (apache#4484)
  HDDS-8355. Mark TestOMRatisSnapshots#testInstallSnapshot as flaky
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants