Skip to content

Conversation

@slfan1989
Copy link
Contributor

@slfan1989 slfan1989 commented Sep 29, 2024

What changes were proposed in this pull request?

Background

During our use of deletion, I noticed that it can be very slow, especially after we switched to the EC policy.

Our Ozone01 cluster currently has about 1K machines. Initially, we chose to use a Ratis-3Replica strategy, but for cost
considerations, we gradually switched to the EC-6-3 strategy in July.

The following chart shows the deletion speed for Ratis-3Replica .

image

The following chart shows the deletion speed for EC-6-3.

image

By reviewing the code and analyzing the logs, we found that the following situation can cause deletion to be very slow. We will illustrate this with an example.

Example

We want to delete data from an EC container with ContainerId = 1000. Since it is EC-6-3, there are 9 replicas (DN1, DN2, DN3, ... DN9).

Before deletion, we first select a batch of DNs; at this time, we may only select DN1 to DN6. We then send the deletion command to these 6 DNs, and the command executes normally, successfully deleting 6 blocks. However, if DN7 to DN9 are not selected, our deletion process will get stuck.

I came up with a possible solution to eliminate this stuck situation. We require that all replicas of the container to be deleted must be present in the selected DN list simultaneously. Otherwise, we will skip that container.

private void getTransaction(DeletedBlocksTransaction tx,
      DatanodeDeletedBlockTransactions transactions,
      Set<DatanodeDetails> dnList, Set<ContainerReplica> replicas,
      Map<UUID, Map<Long, CmdStatus>> commandStatus) {
    DeletedBlocksTransaction updatedTxn =
        DeletedBlocksTransaction.newBuilder(tx)
            .setCount(transactionStatusManager.getOrDefaultRetryCount(
              tx.getTxID(), 0))
            .build();
    
     // Requiring that replicas must be present in the DN list simultaneously ensures that the deletion commands for all 
     // replicas of the same container can be issued at once, avoiding situations where some replicas of the container are 
     // deleted while others are not.
    for (ContainerReplica replica : replicas) {
      DatanodeDetails datanodeDetails = replica.getDatanodeDetails();
      if (!dnList.contains(datanodeDetails)) {
        return;
      }
    }

    for (ContainerReplica replica : replicas) {
      DatanodeDetails details = replica.getDatanodeDetails();
      if (!dnList.contains(details)) {
        continue;
      }
      if (!transactionStatusManager.isDuplication(
          details, updatedTxn.getTxID(), commandStatus)) {
        transactions.addTransactionToDN(details.getUuid(), updatedTxn);
      }
    }
  }

Internal deployment results.

We rolled out this improvement internally in the SCM around 7 PM on September 26th, and we observed a significant enhancement in deletion efficiency, with 50 million blocks being fully processed within 5 hours.

The core aspect of this improvement is to ensure that all DNs within the same container receive the delete command simultaneously. When they send their ACKs, they can reach the SCM at approximately the same time, which facilitates the confirmation of block deletions.

image

What is the link to the Apache JIRA

JIRA: HDDS-11498. Improve SCM deletion efficiency.

How was this patch tested?

Production environment validation.

@slfan1989 slfan1989 marked this pull request as ready for review September 29, 2024 12:41
@whbing whbing requested a review from xichen01 September 29, 2024 13:57
@slfan1989
Copy link
Contributor Author

@adoroszlai @xichen01 Could you help review this PR? Thank you very much! Based on actual usage, it indeed improves the overall deletion speed. I also made some changes to the logs and removed a piece of code that will not be called.

Copy link
Contributor

@xichen01 xichen01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@slfan1989 Thanks for your patch, some comments for you refer.

@adoroszlai adoroszlai requested a review from sodonnel September 30, 2024 13:06
@slfan1989
Copy link
Contributor Author

@sodonnel Could you spare some time to help review this PR? Thank you very much! This PR improves the deletion efficiency of SCM to some extent.

@adoroszlai adoroszlai requested a review from sumitagrawl October 3, 2024 09:54
@slfan1989
Copy link
Contributor Author

@sumitagrawl Could you help review this PR? Thank you very much! The purpose of this PR is to enhance the efficiency of SCM deletion.

@slfan1989
Copy link
Contributor Author

@xichen01 Do you have any other comments? If so, I will further improve this PR. Thank you very much!

@xichen01
Copy link
Contributor

xichen01 commented Oct 7, 2024

LGTM +1

@slfan1989
Copy link
Contributor Author

slfan1989 commented Oct 9, 2024

LGTM +1

Thank you for your response! I would like to provide an update on the execution status. We have achieved significant efficiency improvements in both EC-6-3 and Ratis-3 replicas. We are in the process of converting the cluster's data from Ratis-3 replicas to EC-6-3, primarily due to server cost considerations. EC-6-3 allows us to reduce our logical replication rate to 1.5 replicas, which necessitates a large-scale deletion of Ratis-3 replicas. After implementing this PR, we are able to delete 4PB of logical data each day, and the deletion efficiency is in line with our expectations.

I sincerely hope the community accepts this change, as I believe this PR will help more users achieve better deletion performance.

Ratis-3 Cluster

image

EC-6-3 Cluster

image

cc: @adoroszlai

@adoroszlai
Copy link
Contributor

@ashishkumar50 @sumitagrawl please review

@slfan1989
Copy link
Contributor Author

@xichen01 @adoroszlai Could we continue to move forward with this PR? I'm concerned that other team members might not have noticed it. Our core logic ensures that the deleted container is in the initially selected DN list, while the other changes mainly involve improving logs and cleaning up some unused code.

// feedback to SCM at roughly the same time.
// This avoids the issue of deletion blocking,
// where some replicas of a Container are deleted while others do not receive the delete command.
long containerId = tx.getContainerID();
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We have already provided relevant explanations regarding the core logic, which clarifies the purpose of this PR.
Line 289-307

@slfan1989
Copy link
Contributor Author

@sodonnel @sumitagrawl @ashishkumar50 I wonder if you could help review this PR? Thank you very much!

Copy link
Contributor

@sumitagrawl sumitagrawl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@slfan1989 Thanks for working over this. The case as getting fix refers case where dnList do not have the DNs. This is possible if DN is not healthy OR overloaded, and hence not considered.
Earlier thought was If replica is there means DNs are healthy, but with overload condition, this might not be true.

Please check if this is same scenario where this is happening?

Overall changes looks good with few minor comments.

@ashishkumar50
Copy link
Contributor

@slfan1989 Thanks for the improvement.

We then send the deletion command to these 6 DNs, and the command executes normally, successfully deleting 6 blocks. However, if DN7 to DN9 are not selected, our deletion process will get stuck.

In your case whether any DN was slow or unhealthy? causing not to be selected for deletion.
With current behaviour SCM will keep track of remaining DN7 to DN9, until it waits to be successfully deleted.
After your change it requires all DN1 to DN9 to be in healthy and normal state, so that SCM will send deletion request in one time and process it.
But in certain case(may have happened in your environment as well) when some DNs are unhealthy or slow. In this case SCM will keep waiting and space reclaim from all DNs will halt for some time because of particular unhealthy DN. Do we need to consider this?

@slfan1989
Copy link
Contributor Author

@slfan1989 Thanks for working over this. The case as getting fix refers case where dnList do not have the DNs. This is possible if DN is not healthy OR overloaded, and hence not considered. Earlier thought was If replica is there means DNs are healthy, but with overload condition, this might not be true.

Please check if this is same scenario where this is happening?

Overall changes looks good with few minor comments.

@sumitagrawl @ashishkumar50

Thank you very much for your feedback and suggestions !

The main issue we are facing is the uncertainty of the deletion rate: sometimes it is relatively fast, but most of the time it is very slow, particularly during certain periods.

Regarding the "dnList do not have the DNs" situation, it may be due to some DNs being unhealthy or under heavy load. In distributed systems, it is common for some DNs to be in an unhealthy state. While the replicas in these unhealthy DNs cannot be deleted, this should not affect the deletion efficiency of data in other healthy DNs.

Based on our monitoring metrics, here is the deletion monitoring data from October 4, 2024, 00:00 to October 7, 2024, 00:00:

Overall monitoring shows that during the periods of October 4, 2024, from 00:00 to 04:00 and from 04:30 to 08:30, the deletion operations were relatively fast. However, during other time periods, the deletion efficiency significantly decreased, making it nearly impossible to complete deletion tasks.

image

To better compare the deletion efficiency, I will provide two additional monitoring reports for the periods of October 4, 2024, from 04:30 to 08:30 and October 6, 2024, from 04:30 to 08:30.

We can observe that during the period of October 4, 2024, from 04:30 to 08:30, the ratio of NumBlockDeletionTransactionSuccess to NumBlockDeletionTransactionCompleted is 3:1. In contrast, during the period of October 6, 2024, from 04:30 to 08:30, this ratio is significantly higher.

2024-10-04 04:30:00 ~ 2024-10-04 08:30:00

image

2024-10-06 04:30:00 ~ 2024-10-06 08:30:00

image

From the monitoring data for October 6, 2024, from 04:30 to 08:30, we can infer that a significant number of container replicas within the SCM were partially deleted, preventing the SCM from completing the deletion confirmation.

Our cluster consists of 1,500 machines, and the number of malfunctioning machines does not exceed 10. If more than 10 machines fail, we receive an alert and begin manual intervention for these DNs. In my personal view, the slow deletion issue arises from a combination of multiple factors, and it should not be attributed solely to unhealthy DNs or high load.

The contributing factors I personally speculate are as follows:

  1. The OM continuously issues deletion requests for blocks to the SCM, which alters the order in which the SCM selects transaction iterators.
  2. The replicas of a certain container are only sent to the DNs included in the DN list.
  3. Due to some unhealthy DNs or high load on DNs, the DN list is constantly changing.

Based on the current monitoring observations, once some DNs in the replicas of a certain container (A) do not receive the deletion command, even if those DNs later recover, they may struggle to quickly receive the deletion command for container (A) due to potential changes in the SCM's transaction iterator (as the OM continuously issues new deletion requests). As a result, these DNs may take a long time to receive the deletion command, leading to a slow deletion process.

The actual purpose of this PR is to make deletions orderly. If a certain container does not meet the conditions for deletion, we will ignore it until it satisfies the deletion conditions (i.e., when all DNs are healthy).

@slfan1989
Copy link
Contributor Author

@slfan1989 Thanks for the improvement.

We then send the deletion command to these 6 DNs, and the command executes normally, successfully deleting 6 blocks. However, if DN7 to DN9 are not selected, our deletion process will get stuck.

In your case whether any DN was slow or unhealthy? causing not to be selected for deletion. With current behaviour SCM will keep track of remaining DN7 to DN9, until it waits to be successfully deleted. After your change it requires all DN1 to DN9 to be in healthy and normal state, so that SCM will send deletion request in one time and process it. But in certain case(may have happened in your environment as well) when some DNs are unhealthy or slow. In this case SCM will keep waiting and space reclaim from all DNs will halt for some time because of particular unhealthy DN. Do we need to consider this?

@ashishkumar50 Thank you for your question! In my previous response, I provided some detailed thoughts on this matter. Overall, I believe that the slow deletion might be related to some unhealthy DNs, but their unhealthiness is just one of the factors involved. In this PR, we will skip containers that do not meet the conditions, meaning that if a certain DN is unhealthy, no deletions will be executed for all replicas of the containers above it.

@ashishkumar50
Copy link
Contributor

The actual purpose of this PR is to make deletions orderly. If a certain container does not meet the conditions for deletion, we will ignore it until it satisfies the deletion conditions (i.e., when all DNs are healthy).

This makes sense when we have enough containers to delete(during SCM delete iteration) for which all DNs are healthy, my only point is at some point when when there are only containers left which doesn't satisfy this condition, whether we should wait for all DNs to become healthy or just proceed with the deletion for the available healthy DNs at that point?

@slfan1989
Copy link
Contributor Author

The actual purpose of this PR is to make deletions orderly. If a certain container does not meet the conditions for deletion, we will ignore it until it satisfies the deletion conditions (i.e., when all DNs are healthy).

This makes sense when we have enough containers to delete(during SCM delete iteration) for which all DNs are healthy, my only point is at some point when when there are only containers left which doesn't satisfy this condition, whether we should wait for all DNs to become healthy or just proceed with the deletion for the available healthy DNs at that point?

@ashishkumar50 I believe waiting for all data nodes (DNs) to be ready before proceeding with deletion is a better choice. In the case of 3 replicas, it's acceptable for SCM to wait, but for EC-6-3, it’s less friendly. If one DN has an issue, the deletion of containers on the other 8 DNs will be affected. Issuing delete commands to these 8 DNs will occupy their deletion queues, leading to slower deletion of other containers.

@ashishkumar50
Copy link
Contributor

@slfan1989 Thanks for the explanation, LGTM.

@sumitagrawl
Copy link
Contributor

@slfan1989 two of comment is still pending...

@slfan1989
Copy link
Contributor Author

@slfan1989 two of comment is still pending...

@sumitagrawl Could you please help review this PR again? Thank you very much!

@slfan1989
Copy link
Contributor Author

@slfan1989 Thanks for the explanation, LGTM.

@ashishkumar50 Thank you very much for helping to review the code!

Copy link
Contributor

@sumitagrawl sumitagrawl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@slfan1989 Plz find comment for still open cases

Copy link
Contributor

@sumitagrawl sumitagrawl left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@adoroszlai adoroszlai merged commit 4670a5e into apache:master Oct 17, 2024
@adoroszlai
Copy link
Contributor

Thanks @slfan1989 for the patch, @ashishkumar50, @sumitagrawl, @xichen01 for the review.

@slfan1989
Copy link
Contributor Author

@ashishkumar50, @sumitagrawl, @xichen01 Thank you very much for helping to review the code! I appreciate your suggestions.

@adoroszlai Thank you very much for merging this PR and for helping to find members to review it!

xichen01 pushed a commit to xichen01/ozone that referenced this pull request Oct 21, 2024
ptlrs pushed a commit to ptlrs/ozone that referenced this pull request Nov 21, 2024
aswinshakil added a commit to aswinshakil/ozone that referenced this pull request Dec 19, 2024
…239-container-reconciliation-merge

Commits:
0066526 HDDS-11869. Enable OM Ratis in TestOzoneDelegationTokenSecretManager (apache#7594)
4fe166d HDDS-11957. Make breadcrumb scrollable for long path names in DiskUsage page (apache#7590)
a523e17 HDDS-11846. [Recon] Recon Schema version_number column is always set as -1. (apache#7554)
f5ff2f0 HDDS-11868. Enable OM Ratis in TestQuotaRepairTask (apache#7593)
3a0e3b5 HDDS-11845. Extract k8s definitions for HttpFS and Recon from getting-started example (apache#7523)
6e0c753 HDDS-11509. logging improvements for directory deletion (apache#7314)
1f29e05 HDDS-11934. Split compat suite to old/new (apache#7578)
bde8cf4 HDDS-11759. Remove LegacyReplicationManager (apache#7580)
a27e4ec HDDS-11779. Add DN metrics to show deletion progress (apache#7552)
976e45f HDDS-11711. Add SCM metrics for delete commands sent and response received per datanode (apache#7522)
c28e16e HDDS-11950. Enable sortpom in dev-support module. (apache#7586)
dae388b HDDS-11907. OzoneSecretKey does not need to implement Writable (apache#7574)
8bb0587 HDDS-11712. Process other DeletedBlocksTransaction before retrying failed one. (apache#7532)
3648b59 HDDS-11906. Add sortpom dependency, sort root POM. (apache#7555)
54f0272 HDDS-11807. Make callId different for each request in openKeyCleanupService (apache#7551)
c523825 HDDS-11926 - Rename bucket name for bucket info/ls for linked buckets (apache#7581)
daf2f9f HDDS-11863. Speed up TestFSORepairTool (apache#7561)
f5e5493 HDDS-11927. Fix flaky TestContainerBalancerStatusInfo.testGetCurrentStatisticsWhileBalancingInProgress (apache#7579)
bef2415 HDDS-11940. Bump jline to 3.28.0 (apache#7576)
1453fd9 HDDS-11935. Bump develocity-maven-extension to 1.23 (apache#7577)
202b0c7 HDDS-11860. Improve BufferUtils.writeFully. (apache#7564)
008f9a6 HDDS-11852. Reduce duplication in some GenericCli subclasses (apache#7553)
7a46080 HDDS-11914. Snapshot diff should not filter SST Files based by reading SST file reader (apache#7563)
1835326 HDDS-11927. Mark testGetCurrentStatisticsWhileBalancingInProgress as flaky
66ccc25 HDDS-11908. Snapshot diff DAG traversal should not skip node based on prefix presence (apache#7567)
16ba289 Revert "HDDS-11413. PipelineManagerImpl lock optimization reduces AllocateBlock latency (apache#7160)"
bf6f323 HDDS-11413. PipelineManagerImpl lock optimization reduces AllocateBlock latency (apache#7160)
853d657 HDDS-11893. Fix full snapshot diff fallback logic because of DAG pruning (apache#7549)
b5d04e2 HDDS-11915. Netty OpenSsl not available in acceptance tests on arm64 (apache#7570)
8536490 HDDS-11367. Fix flaky balancer robot test (apache#7569)
745ed1c HDDS-11367. Improve ozone balancing status command output (apache#7139)
6b9cbe0 HDDS-11909. Intermittent timeout building Hadoop in s3a test (apache#7559)
eea5600 HDDS-11911. Return consistent error code when snapshot is not found in the DB or Snapshot Chain. (apache#7557)
e8f3b25 HDDS-11873. Skip old-only xcompat read tests (apache#7534)
ec348a7 HDDS-11889. Include Maven dependencies for hdds-rocks-native in cache (apache#7546)
aa37ae8 HDDS-11885. Download Hadoop for S3A test from mirrors if available (apache#7545)
befd64e HDDS-11694. Safemode Improvement: Introduce factory class to create safemode rules. (apache#7433)
a46153d HDDS-11872. Disable Apache snapshots repo (apache#7536)
092b000 HDDS-11890. Update project description in GitHub (apache#7547)
80c6446 HDDS-8101. Add tool to repair broken FSO tree (apache#7368)
23197e2 HDDS-11605. Directory deletion service should support multiple threads (apache#7349)
055b13c HDDS-11751. Use Java 21 in CI (apache#7458)
d0d82c5 HDDS-11886. Bump license-maven-plugin to 2.5.0 (apache#7539)
f7fe30a HDDS-11691. Support object tags in ObjectEndpointStreaming#put (apache#7543)
9854591 HDDS-11882. Make BOM, not aggregate one (apache#7544)
af345b2 HDDS-11877. Enable Maven cache for more checks (apache#7538)
51c6ed6 HDDS-11830. Subcommands should not extend GenericCli .(apache#7537)
959a39d HDDS-11851. Finer-grained subcommand interface for OzoneDebug and OzoneRepair. (apache#7526)
d4c41e5 HDDS-11334. Improve EC xcompat acceptance test (apache#7492)
8526f2e HDDS-11826. Interactive mode for ozone shell. (apache#7515)
fc63710 HDDS-11833. Return NotImplemented for S3 put-object-acl request. (apache#7531)
e8ad7ad HDDS-11782. ozone debug ldb --with-keys defaults to false instead of true (apache#7521)
cb0a402 HDDS-11859. Remove mention of fuse from s3 interface docs page (apache#7530)
e17f92c HDDS-10821. Ensure ChunkBuffer fully writes buffer to FileChannel (apache#6652)
d66c088 HDDS-11848. Serialization bug in Recon listKeys API (apache#7524)
faab1e8 HDDS-11656. Default native ACL limits to user and user's primary group (apache#7455)
8a1967e HDDS-11719. Remove dependency on server components from ozonefs-common (apache#7438)
9fcecc1 HDDS-11855. Mark TestContainerBalancerDatanodeNodeLimit#checkIterationResultException as flaky
65df308 HDDS-11794. Display HostName in OM / SCM Overview. (apache#7482)
0bde3a2 HDDS-11410. Refactoring more tests from TestContainerBalancerTask (apache#7156)
98e070e HDDS-11728. Refactor subcommand layouts of ozone debug and repair (apache#7489)
1c5676b HDDS-11849. Mark TestBlockOutputStreamWithFailures.test2DatanodesFailure as flaky
9dd8bb9 HDDS-11847. Mark TestSnapshotDeletingServiceIntegrationTest#testParallelExcecutionOfKeyDeletionAndSnapshotDeletion as flaky
b27714f HDDS-11266. Update proto.lock for Ozone 1.4.1 (apache#7504)
9b61937 HDDS-11806. Add HttpFS and Recon in getting-started k8s example (apache#7485)
77ce962 HDDS-10568. When the ldb command is executed, it is output by line (apache#7467)
69538b0 HDDS-11687. Robot warning: replace "is not" with "!=" (apache#7516)
b60d897 HDDS-11820. Create test principals at test run time (apache#7507)
6ba309a HDDS-11831. Finer-grained interface for dynamically registered subcommands (apache#7514)
db36e39 HDDS-11822. Register subcommands in OzoneShell (apache#7513)
e6bd3f5 HDDS-11829. Bump zstd-jni to 1.5.6-8 (apache#7510)
09a4a90 HDDS-11827. Bump exec-maven-plugin to 3.5.0 (apache#7512)
ad867bc HDDS-11824. Bump sqlite-jdbc to 3.47.1.0 (apache#7511)
850306d HDDS-11823. Bump cyclonedx-maven-plugin to 2.9.1 (apache#7508)
34f9d9e HDDS-11742. Update metrics with leaderId if known when starting SCM (apache#7471)
2c6c116 HDDS-11265. Add Ozone 1.4.1 to compatibility acceptance tests (apache#7503)
ebcdc6a HDDS-11810. Secure acceptance test on arm64 fails with LoginException: Checksum failed (apache#7498)
7f40624 HDDS-11821. Mark TestECKeyOutputStream#testECKeyCreatetWithDatanodeIdChange as unhealthy
9b26156 HDDS-11811. rocksdbjni deleted on exit could be used by other components apache#7493
3f92663 HDDS-11785. DataNode aborts ContainerStateMachine if it does not know any follower next index (apache#7480)
f0a2c87 HDDS-11773. Prevent frequent DataNode Ratis snapshotting. (apache#7473)
1383c18 HDDS-11718. Some CI checks passing despite error (apache#7483)
f98eac2 HDDS-11561. Refactor Open Key Search Endpoint and Consolidate with OmDBInsightEndpoint Using StartPrefix Parameter. (apache#7336)
a99ab27 HDDS-11243. SCM SafeModeRule Support EC. (apache#7008)
6871547 HDDS-11716. Address Incomplete Upgrade Scenario in Recon Upgrade Framework (apache#7452)
9bc9145 HDDS-10411. Support incremental ChunkBuffer checksum calculation (apache#7189)
579a38e HDDS-11723. Tool to better micro benchmark hbase performance in Ozone (apache#7463)
cc1a374 HDDS-11704. Hadoop test leaves running containers in case of failure (apache#7435)
c4b2056 Update documentation to mention that container schemaV3 is default (apache#7481)
20c4cfa HDDS-11386. Multithreading bug in ContainerBalancerTask (apache#7339)
b090312 HDDS-11780. Increase client write retry when SCM is in safe mode (apache#7470)
f8e4db9 HDDS-11793. Bump maven-checkstyle-plugin to 3.6.0 (apache#7476)
6a1ff84 HDDS-11791. Bump commons-io to 2.18.0 (apache#7478)
d7f8235 HDDS-11788. Bump log4j2 to 2.24.2 (apache#7479)
6547de7 HDDS-11790. Bump commons-lang3 to 3.17.0 (apache#7475)
1d8abd6 HDDS-11789. Bump zstd-jni to 1.5.6-7 (apache#7477)
6ca7230 HDDS-11769. Add tools folder into ozone src package. (apache#7466)
3b8ed58 HDDS-11682. Bump maven-resources-plugin to 3.3.0 (apache#7384)
f4a9ee0 HDDS-11702. Merge test_bucket_encryption into robot compatibility test (apache#7451)
d6a5488 HDDS-11713. Use seek to reach the start transaction. (apache#7460)
d52615a HDDS-11733. Remove okio versioning and unused okhttp dependency (apache#7447)
1a49991 HDDS-11617. Update hadoop to 3.4.1 (apache#7376)
9945de6 HDDS-11667. Validating DatanodeID on any request to the datanode (apache#7418)
fc6a2ea HDDS-11650. ContainerId list to track all containers created in a datanode (apache#7402)
a8db9cd HDDS-11749. Extract moveToTrash implementation to client code (apache#7453)
3ba3474 HDDS-11755. mktemp --suffix does not work on Mac (apache#7457)
433c7bb HDDS-11729. Update skipRecon property to skip only frontend build (apache#7454)
6b40003 HDDS-11739. Extract generic unmarshaller for S3 requests (apache#7449)
c7f65e7 HDDS-11740. Add debug command to show internal component versions (apache#7450)
0f7104e HDDS-11708. Recon ListKeys API should return a proper http response status code if NSSummary rebuild is in progress. (apache#7437)
0e0d5e9 HDDS-11163. Improve Heatmap page UI (apache#7420)
2cef393 HDDS-11696. Limit max number of entries in list keys/status response (apache#7431)
e96e314 HDDS-11697. Integrate Ozone Filesystem Implementation with Ozone ListStatusLight API (apache#7440)
ebcbce7 HDDS-11644. Close OMLayoutVersionManager (apache#7445)
20e4969 HDDS-11737. UnsupportedOperationException in S3 setBucketAcl (apache#7448)
b252181 HDDS-10804. Include only limited set of ports in Pipeline proto (apache#6655)
79ca956 HDDS-8829. Symmetric Keys for Delegation Tokens (apache#7394)
3e798e6 HDDS-11698. Use hadoop images from GitHub in CI (apache#7432)
3e278b7 HDDS-10655. Support PutObjectTagging, GetObjectTagging, and DeleteObjectTagging (apache#6756)
036e727 HDDS-11732. Fix ACL check on bucket resolution while reading from snapshot (apache#7446)
dbda703 HDDS-11736. Bump maven-javadoc-plugin to 3.11.1 (apache#7444)
238f232 HDDS-11692. Skip spotbugs for modules with only generated code. (apache#7428)
f60ad61 HDDS-11705. Snapshot operations on linked buckets should work on actual underlying bucket (apache#7434)
dd22dbe HDDS-11615. Add Upgrade Action for Initial Schema Constraints for Unhealthy Container Table in Recon. (apache#7372)
4066c7c HDDS-117. Add convenience methods for port management in DatanodeDetails (apache#7408)
12419fa HDDS-11695. SCM follower should not log NotLeaderException during Pipeline Report processing. (apache#7430)
5275ded HDDS-10133. Add a method to check key name in OMKeyRequest (apache#6012)
fd5c6d8 HDDS-11689. Extract scheduled workflow for populate-cache (apache#7429)
889ba80 HDDS-11653. Bump Ratis to 3.1.2 (apache#7427)
47ec4dd HDDS-11671. Refer to website for supported versions (apache#7412)
10cac80 HDDS-11686. Use ozone image from GitHub in CI (apache#7425)
aa6da3e HDDS-9781. Limited maxOpenFiles, disabled enableCompactionDag, and createCheckpointDirs when creating OMMetadataManager instance for bootstrapping (apache#7095)
9dd6a83 HDDS-11645. Mark TestReconScmSnapshot#testExplicitRemovalOfNode as flaky
8e617dc HDDS-11672. Mark TestSnapshotBackgroundServices#testCompactionLogBackgroundService as flaky
d09e6d4 HDDS-11646. Mark TestXceiverClientMetrics#testMetrics as flaky
8e4a508 HDDS-11668. Recon List Keys API: Reuse key prefix if parentID is the same (apache#7410)
ee63232 HDDS-11684. Remove suppression of HiddenField (apache#7423)
a33d8a3 HDDS-10166. Replace GenericTestUtils temporary directories with `@TempDir` (apache#7399)
3a18a9d HDDS-11664. Hadoop download failure not reported as error (apache#7421)
47c2409 HDDS-64. OzoneClientException should extend IOException. (apache#7403)
27fcd0c HDDS-11685. Use ozone-testkrb5 from GitHub (apache#7424)
5663971 HDDS-11665. Minor optimizations on the write path (apache#7407)
cb81f0c HDDS-11683. Skip shade in most integration checks (apache#7422)
952e0ec HDDS-11681. Bump Bouncy Castle to 1.79 (apache#7387)
2797c45 HDDS-11677. Bump sqlite-jdbc to 3.47.0.0 (apache#7413)
358534b HDDS-11675. Bump maven-site-plugin to 3.21.0 (apache#7414)
cf79245 HDDS-11674. Bump junit to 5.11.3 (apache#7415)
6dd566f HDDS-11583. Use ozone-runner from GitHub in CI (apache#7409)
ef2bf98 HDDS-11669. In OmUtils.normalizeKey isDebugEnabled should be evaluated first (apache#7411)
a7e3014 HDDS-11660. Recon List Key API: Reduce object creation and buffering memory (apache#7405)
5d18b9c HDDS-11659. Improve HSync compatibility test (apache#7404)
4e603aa HDDS-11462. Enhancing DataNode I/O Monitoring Capabilities. (apache#7206)
18f6e8a HDDS-11311. Added Compatibility test for HSync (apache#7400)
0415c0b HDDS-11649. Recon ListKeys API: Simplify filter predicates (apache#7395)
2547ac0 HDDS-11652. Fix SCM start command in SCM-HA doc (apache#7398)
c045839 HDDS-11623. Improve OM Ratis Configuration change log message (apache#7388)
2b1524b HDDS-11609. Switch to Recon v2 UI as the default UI (apache#7358)
efe5892 HDDS-11641. Allow testing Hadoop with custom docker images (apache#7393)
c055036 HDDS-11637. Compile failure is ignored in build check (apache#7389)
67e5261 HDDS-11563. Display OM/SCM service ID as Namespace in web UI (apache#7321)
0fb5e50 HDDS-11587. Ozone Manager not processing file put requests with multi-tenancy enabled (apache#7316)
786bb49 HDDS-11642. MutableQuantiles should be stopped (apache#7392)
76ec9b9 HDDS-11639. Upgrade ozone-runner to Rocky Linux 9.3 (apache#7391)
3bc3b8a HDDS-11621. Fix missing HADOOP_ variables in MR acceptance test (apache#7375)
6f9db61 HDDS-11200. Hsync client-side metrics (apache#7371)
58d1443 HDDS-10240. Cleanup zero-copy EC (apache#7340)
5b065d8 HDDS-11638. Bump cyclonedx-maven-plugin to 2.9.0 (apache#7383)
c7a196f HDDS-11635. Memory leak when using Ozone FS via Hadoop FileContext API (apache#7382)
c9956a1 HDDS-11601. Intermittent failure in acceptance balancer test. (apache#7343)
a737fc3 HDDS-11619. Remove dependency on hadoop-shaded-guava (apache#7373)
c4d6857 HDDS-11584. Document ozone debug ldb command (apache#7313)
dded26e HDDS-11588. Add main artifact jar to classpath file (apache#7324)
afed6d9 HDDS-11558. Make OM client retry idempotent (apache#7329)
e85b32d HDDS-11591. Copy dependencies when building each module (apache#7325)
72e56d7 HDDS-11601. Disable flaky EC balancer acceptance test
ab16cbe HDDS-11507. Add error information to log while handling ServiceException. (apache#7367)
980b960 HDDS-11380. Make node decommission error message more comprehensive (apache#7155)
61c094f HDDS-11614. Speed up TestTransferLeadershipShell (apache#7370)
91188b3 HDDS-11352. Remove Flaky annotation from TestOzoneManagerHAWithStoppedNodes using Ratis 3.1.1
faf133d HDDS-11220. Initialize block length using the chunk list from DataNode before seek (apache#7221)
91d41a0 HDDS-11465. Introducing Schema Versioning for Recon to Handle Fresh Installs and Upgrades. (apache#7213)
7a27db2 HDDS-11134. Create compatibility test for FSO bucket usage (apache#7350)
30906d1 HDDS-11612. Bump jnr-posix to 3.1.20 (apache#7360)
bed4aef HDDS-11611. Bump docker-maven-plugin to 0.45.1 (apache#7362)
e2c3d57 HDDS-11610. Bump maven-dependency-plugin to 3.8.1 (apache#7361)
24c1000 HDDS-11041. Add admin request filter for S3 requests and UGI support for GrpcOmTransport (apache#7268)
0b84998 HDDS-11594. Update batchPut buffer log for rocksdb. (apache#7356)
c013516 HDDS-11608. Client should not retry invalid protobuf request (apache#7354)
32a8c09 HDDS-11160. Improve Insights page UI (apache#7327)
782ad62 HDDS-11388. Fix unnecessary call to the DB for ContainerBalancer#getBalancerStatusInfo (apache#7224)
35b6a3a HDDS-11600. Intermittent failure in repro due to ordering differences in builddef.lst (apache#7342)
e7bf154 HDDS-11132. Revert client version bump done as part of HDDS-10983 (apache#7348)
ea5cbff HDDS-11602. Bump ozone-runner to 20241022-jdk17-1 (apache#7347)
3f98df5 HDDS-11580. Validate 'hdds.datanode.dir.du.reserved' property (apache#7328)
8568075 HDDS-11570. Fix HDDS Docs build failure with Hugo v0.135.0 (apache#7337)
721ae58 HDDS-11057. Enable reproducible builds (apache#6856)
86b7aae HDDS-11205. Implement a search feature for users to locate keys pending Deletion within the OM Deleted Keys Insights section (apache#6969)
f7b428d HDDS-11503. Add Robot test to verify Container Balancer for EC containers. (apache#7311)
85eb89b HDDS-11483. Make s3g object get and put operation buffer configurable (apache#7233)
515977a HDDS-11582. Bump body-parser to 1.20.3 (apache#7307)
9b66267 HDDS-11589. ReconSCMDBDefinition should be singleton. (apache#7323)
f784a84 HDDS-11578. Unify constants for RATIS_SNAPSHOT_DIR (apache#7310)
4670a5e HDDS-11498. Improve SCM deletion efficiency. (apache#7249)
3fb2cf0 HDDS-11108. Extract keywords for multipart upload tests (apache#7318)
4b24aa9 HDDS-11545. [UI] Add OM and SCM ID information (apache#7287)
860e269 HDDS-11538. Let coverage report link to java sources (apache#7280)
2139367 HDDS-11581. Remove duplicate ContainerStateMachine#RaftGroupId (apache#7312)
64e035d HDDS-11573. Remove lib/gson-2.10.1.jar (apache#7309)
ce07a3c HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7209)
c579d06 HDDS-11574. Ozone client leak in TestS3SDKV1 (apache#7308)
c044b79 HDDS-10390. MiniOzoneCluster to support S3 Gateway (apache#7281)
8eef589 HDDS-11557. Simplify DBColumnFamilyDefinition. (apache#7298)
4c77f6b HDDS-11562. Parameterize TestSCMNodeManager#testProcessLayoutVersion (apache#7300)
b51c4b3 HDDS-11572. Bump commons-io to 2.17.0 (apache#7305)
3a37870 HDDS-11571. Bump log4j2 to 2.24.1 (apache#7301)
494798c HDDS-11564. Mark TestBlockOutputStream#testWriteExactly... as flaky
fabf512 HDDS-11569. Bump restrict-imports-enforcer-rule from 2.5.0 to 2.6.0 (apache#7303)
1e62a0a HDDS-11568. Bump commons-codec to 1.17.1 (apache#7304)
e9f92a7 HDDS-11567. Bump common-custom-user-data-maven-extension to 2.0.1 (apache#7302)
cb44d5e HDDS-11555. SCMDBDefinition should be singleton. (apache#7296)
d473134 HDDS-11486. Reduce log level for NativeLibraryNotLoadedException in SnapshotDiffManager (apache#7290)
3348d91 HDDS-11564. Mark TestBlockOutputStream as flaky
e2f2aeb HDDS-11548. Add some logging to the StateMachine (apache#7291)
523c860 HDDS-11439. De-duplicate code for ReplicatedFileChecksumHelper and ECFileChecksumHelper (apache#7264)
05a409e HDDS-11519. Clean up unused lines in BlockOutputStream
ffe7198 HDDS-11544. Improve work with arrays (apache#7286)
5657604 HDDS-11556. Add a getTypeClass method to Codec. (apache#7295)
256aad9 HDDS-11546. Add regex operation to filter option of ldb scan command. (apache#7289)
7ef7de2 HDDS-11482. EC Checksum throws IllegalArgumentException because the buffer limit is negative (apache#7230)
77c17df HDDS-11551. Provide details about integration check failure (apache#7294)
911a583 HDDS-8188. Support max allowed length in response of ozone admin container list (apache#7181)
7f2e0e3 HDDS-11554. OMDBDefinition should be singleton. (apache#7292)
170761c HDDS-11547. Make MAVEN_OPTS optional (apache#7288)
4846e97 HDDS-11543. Track OzoneClient object leaks via LeakDetector framework. (apache#7285)
e00f7ae HDDS-11159. Improve Containers page UI (apache#7267)
cfda951 HDDS-11520. Fix Delete pending directories key mapping (apache#7269)
2e3de8a HDDS-11476. Implement lesser/greater operation for --filter option of ldb scan command (apache#7222)
06ccdb3 HDDS-11526. Fix hdds.datanode.metadata.rocksdb.cache.size default value mismatch (apache#7284)
b3afaec HDDS-11535. Incomplete SCM roles table header (apache#7278)
ed2a073 HDDS-11536. Bump macOS runner version to macos-13 (apache#7279)
1887f83 HDDS-11537. Bump frontend-maven-plugin to 1.15.1 (apache#7276)
1f1e618 HDDS-6776. Cleanup TestSCMSafeModeManager (apache#7272)
4bee3e9 HDDS-11534. Bump cyclonedx-maven-plugin to 2.8.2 (apache#7277)
789fb53 HDDS-11533. Bump maven-gpg-plugin to 3.2.7 (apache#7275)
eb26677 HDDS-11268. Add --table mode for OM/SCM Roles CLI (apache#7016)
28ea480 HDDS-11527. Avoid unnecessary duplicate build (apache#7270)
30da31f HDDS-3498. Shutdown datanode if address is already in use (apache#7256)
2401d27 HDDS-11046. Coverage decreased due to running tests with Java 17 (apache#7263)
78d8418 HDDS-11524. Bump snappy-java to 1.1.10.7 (apache#7202)
8747c0e HDDS-11518. Recon OmDB Insights show isKey=true for directories (apache#7260)
5d2bbc3 HDDS-11480. Refactor OM volume response tests (apache#7265)
31f9f2c HDDS-11517. Update version to 2.0.0-SNAPSHOT (apache#7258)
a0f0872 HDDS-11444. Make Datanode Command metrics consistent across all commands (apache#7191)
d3b63c6 HDDS-11492. Directory deletion get stuck having millions of directory (apache#7254)
f52f0af HDDS-11127. [hsync] Improve test coverage for XceiverClientRatis.java (apache#7225)
360fea5 HDDS-11494. Improve the duration option of freon ombg (apache#7246)
10d3b21 HDDS-11504. Update Ratis to 3.1.1. (apache#7257)
ce46297 HDDS-11162. Improve Disk Usage page UI (apache#7214)
c91f1c7 HDDS-11491. Avoid sharing clientId among deleting services (apache#7250)
b0943d5 HDDS-11501. Improve logging in XceiverServerRatis (apache#7252)
55925ab HDDS-11502. Class path contains multiple SLF4J providers (apache#7255)
d0ad836 HDDS-11472. Avoid recreating external access authorizer on OM state reload (apache#7238)
254db9e HDDS-11500. RootCARotationManager cancelling wrong task in notifyStatusChanged (apache#7251)
1e6e4b3 HDDS-11499. Remove redundant code from ECReconstructionCoordinator. (apache#7248)
adb2821 HDDS-11490. Bump rollup to 3.29.5 (apache#7232)
189a9fe HDDS-11484. Validate javadoc in CI (apache#7245)
64a29c6 HDDS-11497. Bump commons-configuration2 to 2.11.0 (apache#7242)
95cfadd HDDS-11496. Bump maven-install-plugin to 3.1.3 (apache#7244)
0a999cf HDDS-11493. Bump sqlite-jdbc to 3.46.1.3 (apache#7243)
a214a31 HDDS-11329. Update Ozone images to Rocky Linux-based runner (apache#7241)
56ddb85 HDDS-11371. Handle cases where OM does not have getServerDefaults() implemented. (apache#7130)
b5097c7 HDDS-11347. Add rocks_tools_native lib check in Ozone CLI checknative subcommand (apache#7101)
fb0bf77 HDDS-11489. Bump maven-site-plugin to 3.20.0 (apache#7226)
70e6e40 HDDS-11122. Fix javadoc warnings (apache#7234)
acf3fdc HDDS-11458. Selective checks: trigger checkstyle for properties file changes (apache#7196)
6b87207 HDDS-11469. Statistics of Pipeline and Container (apache#7217)
1b8468b HDDS-11411. Snapshot garbage collection should not run when the keys are moved from a deleted snapshot to the next snapshot in the chain (apache#7193)
1f86ce8 HDDS-10617. Unexpected number of files in ITestS3AContractGetFileStatusV1List (apache#7208)
73a3bcc HDDS-11467. Bump vite to 4.5.5 (apache#7212)
d45aa1d HDDS-11460. Bump express to 4.21.0 (apache#7197)
e2e30b8 HDDS-11354. Intermittent failure in TestOzoneManagerSnapshotAcl#testLookupKeyWithNotAllowedUserForPrefixAcl (apache#7205)
0fcb645 HDDS-11477. [doc] Add configuration description for datanode docs (apache#7223)
3598ee3 HDDS-11464. Removed unused constants from OzoneConsts. (apache#7207)
8c0b54e HDDS-11408. Snapshot rename table entries are propagated incorrectly on snapshot deletes (apache#7200)
719bdf9 HDDS-11396. NPE due to empty Handler#clusterId (apache#7145)
40c4001 HDDS-10479. Add ozone admin ratis local raftMetaConf (apache#7170)
45f9138 HDDS-11394. Fix pipeline close --all command (apache#7138)
2b196d1 HDDS-11468. Enabled DB sync button (apache#7216)
d3899d2 Clean up files created after TestKeyValueHandlerWithUnhealthyContainer#testMarkContainerUnhealthyInFailedVolume (apache#7219)
70b8dd5 HDDS-11157. Improve Datanodes page UI (apache#7168)
151709a HDDS-11446. Downgrade picocli to 4.7.5 due to regression (apache#7215)
7a26aff HDDS-11158. Improve Pipelines page UI (apache#7171)
c365aa0 HDDS-11181. Cleanup of unnecessary try-catch blocks (apache#7210)
88dd436 HDDS-11423. Implement equals operation for --filter option to ozone ldb scan (apache#7167)
e0060a8 HDDS-11196. Improve SCM WebUI Display (apache#6960)
22ddfb9 Revert "HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7192)"
9f5bf43 HDDS-11457. Internal error on S3 CompleteMultipartUpload if parts are not specified (apache#7195)
10c47a1 HDDS-11459. Bump develocity-maven-extension to 1.22.1 (apache#7201)
50f2563 HDDS-11419. Fix waitForCheckpointDirectoryExist log message (apache#7199)
a7d7e37 HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7192)
5feb9ea HDDS-11453. OmSnapshotPurge should be in a different ozone manager double buffer batch (apache#7188)
703c4d5 HDDS-10984. Tool to restore SCM certificates from RocksDB. (apache#6781)
d221065 HDDS-11440. Add a lastTransactionInfo field in SnapshotInfo to check for transactions in flight on the snapshot (apache#7179)
e573701 HDDS-11448. Improve documentation in ContainerStateMachine (apache#7183)
0e49f7a HDDS-11449. Remove unnecessary log from client console. (apache#7184)
cd251f2 HDDS-11438. Ensure DataInputBuffer is closed in OMPBHelper#convert (apache#7182)
4b47812 HDDS-11389. Incorrect number of deleted containers shown in Recon UI. (apache#7149)
0915f0b HDDS-10985. EC Reconstruction failed because the size of currentChunks was not equal to checksumBlockDataChunks. (apache#7009)
0f16195 HDDS-11416. refactor ratis submit request avoid code duplicate (apache#7166)
86fe920 HDDS-11376. Improve ReplicationSupervisor to record replication metrics (apache#7140)
883a63f HDDS-11441. ozone sh key put should only accept positive expectedGeneration (apache#7180)
33dbd4a HDDS-11357. Datanode Usageinfo Support Display Pipeline. (apache#7105)
9477aa6 HDDS-11436. Minor update in Recon API handling. (apache#7178)
8ca33c7 HDDS-11414. Key listing for FSO buckets fails with forward client (apache#7161)
f1ebd39 HDDS-11435. Bump sqlite-jdbc to 3.46.1.0 (apache#7174)
aaf8bd0 HDDS-11434. Bump log4j2 to 2.24.0 (apache#7176)
3510ce7 HDDS-11433. Bump Jetty to 9.4.56.v20240826 (apache#7175)
0047cd2 HDDS-11400. Bump maven-core to 3.9.9 (apache#7144)
274da83 HDDS-10488. Datanode OOM due to run out of mmap handler (apache#6690)
7a452ca HDDS-11391. Addendum to fix test failure.
7e1d9b0 HDDS-11145. ozone admin om cancelprepare --service-id improvement (apache#7159)
6888cf2 HDDS-11383. Improve read key dashboard to include add the read key related OM metrics. (apache#7131)
3e0d76c HDDS-11369. [hsync] Remove KeyOutputStreamSemaphore logs (apache#7136)
b23981c HDDS-11342. [hsync] Add a config as HBase-related features master switch (apache#7126)
3e1188a HDDS-11285. cli to trigger quota repair and status (apache#7104)
2e33978 HDDS-11401. Code cleanup in DatanodeStateMachine (apache#7146)
f563d67 HDDS-11391. Frequent Ozone DN Crashes During OM + DN Decommission with Freon (apache#7154)
18b28d2 HDDS-11312. [hsync] Added upgrade tests (apache#7110)
b29beb3 HDDS-11350. NullPointerException thrown on checking container balancer status (apache#7134)
111b9df HDDS-11407. Use OMLayoutFeature.HBASE_SUPPORT for HSYNC (apache#7152)
966b8d0 HDDS-11390. Removed hsync and hflush capability check in ContentGenerator (apache#7153)
877504a HDDS-11156. Improve Buckets page UI (apache#7100)
814f78f HDDS-11392. ChecksumByteBufferImpl's static initializer fails with java 17+ (apache#7135)
b5e1a8b HDDS-11398. Bump commons-compress to 1.27.1 (apache#7142)
a8e3ea9 HDDS-11397. Bump Jersey2 to 2.45 (apache#7141)
5992837 HDDS-11399. Bump maven-deploy-plugin to 3.1.3 (apache#7143)
47564bb HDDS-11359. Intermittent timeout in TestPipelineManagerMXBean#testPipelineInfo (apache#7132)
cc4e026 HDDS-11304. Make up for the missing functionality in CommandDispatcher (apache#7062)
2d372f6 HDDS-11339. Let PrometheusServlet rely on periodically published metrics (apache#7092)
f22c6f8 HDDS-11164. Improve Navbar UI (apache#7088)
23211c1 HDDS-11381. Adding logging for sortByDistanceCost in NetworkTopologyImpl (apache#7133)
3e9cdb6 HDDS-11378. Allow disabling OM version-specific feature via config (apache#7129)
23f3e5b HDDS-11152. OMDoubleBuffer error when handling snapshot's background operations (apache#7112)
5659b7e HDDS-11375. DN startup fails due to illegal configuration of raft.grpc.message.size.max (apache#7128)
41d8147 HDDS-11368. Remove dependency on Babel in Vite (apache#7119)
0bd8ba1 HDDS-11372. No coverage for org.apache.ozone packages (apache#7124)
3bd237d HDDS-11325. (addendum) Intermittent failure in TestBlockOutputStreamWithFailures#testContainerClose (apache#7121)
8306290 HDDS-11373. Log for EC reconstruction command lists the missing indexes as ASCII control characters (apache#7123)
dab1538 HDDS-11216. Replace HAUtils#buildCAX509List usages with other direct usages (apache#6981)
51a5fb9 Revert "HDDS-11235. Spare InfoBucket RPC call in FileSystem#mkdir() call. (apache#6990)" (apache#7122)
fab56b4 HDDS-11229. Chain optionals in Recon Insight (apache#7064)
2236041 HDDS-11365. Fix the NOTICE file (apache#7120)
2e30dc1 HDDS-11190. Add --fields option to ldb scan command (apache#6976)
0b75cb0 HDDS-11251. Deprecate definitions and remove listTrash and recoverTrash APIs (apache#7060)
be34303 HDDS-9198. Maintain local cache in OMSnapshotPurgeRequest to get updated snapshotInfo and pass the same to OMSnapshotPurgeResponse (apache#7045)
c07b408 HDDS-11208. Change RatisBlockOutputStream to use HDDS-11174. (apache#7072)
8f8d809 HDDS-11309. Increase CONTAINER_STATE Column Length in UNHEALTHY_CONTAINERS to Avoid Truncation (apache#7071)
350a340 HDDS-11364. Bump jgraphx to 3.9.12 (apache#7116)
45b7056 HDDS-11363. Bump develocity-maven-extension to 1.22 (apache#7115)
9dd18f1 HDDS-11362. Bump snappy-java to 1.1.10.6 (apache#7114)
637cb91 HDDS-11361. Bump Jersey2 to 2.44 (apache#7113)

Conflicts:
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/keyvalue/KeyValueHandler.java
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/ozoneimpl/OzoneContainer.java
hadoop-hdds/container-service/src/test/java/org/apache/hadoop/ozone/container/keyvalue/TestKeyValueHandlerWithUnhealthyContainer.java
hadoop-ozone/dist/src/main/smoketest/admincli/container.robot

Modified during conflict:
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/checksum/DNContainerOperationClient.java
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/checksum/ReconcileContainerTask.java
hadoop-ozone/integration-test/src/test/java/org/apache/hadoop/ozone/client/rpc/TestECKeyOutputStream.java
aswinshakil added a commit to aswinshakil/ozone that referenced this pull request Dec 19, 2024
…239-container-reconciliation-merge

Commits:
0066526 HDDS-11869. Enable OM Ratis in TestOzoneDelegationTokenSecretManager (apache#7594)
4fe166d HDDS-11957. Make breadcrumb scrollable for long path names in DiskUsage page (apache#7590)
a523e17 HDDS-11846. [Recon] Recon Schema version_number column is always set as -1. (apache#7554)
f5ff2f0 HDDS-11868. Enable OM Ratis in TestQuotaRepairTask (apache#7593)
3a0e3b5 HDDS-11845. Extract k8s definitions for HttpFS and Recon from getting-started example (apache#7523)
6e0c753 HDDS-11509. logging improvements for directory deletion (apache#7314)
1f29e05 HDDS-11934. Split compat suite to old/new (apache#7578)
bde8cf4 HDDS-11759. Remove LegacyReplicationManager (apache#7580)
a27e4ec HDDS-11779. Add DN metrics to show deletion progress (apache#7552)
976e45f HDDS-11711. Add SCM metrics for delete commands sent and response received per datanode (apache#7522)
c28e16e HDDS-11950. Enable sortpom in dev-support module. (apache#7586)
dae388b HDDS-11907. OzoneSecretKey does not need to implement Writable (apache#7574)
8bb0587 HDDS-11712. Process other DeletedBlocksTransaction before retrying failed one. (apache#7532)
3648b59 HDDS-11906. Add sortpom dependency, sort root POM. (apache#7555)
54f0272 HDDS-11807. Make callId different for each request in openKeyCleanupService (apache#7551)
c523825 HDDS-11926 - Rename bucket name for bucket info/ls for linked buckets (apache#7581)
daf2f9f HDDS-11863. Speed up TestFSORepairTool (apache#7561)
f5e5493 HDDS-11927. Fix flaky TestContainerBalancerStatusInfo.testGetCurrentStatisticsWhileBalancingInProgress (apache#7579)
bef2415 HDDS-11940. Bump jline to 3.28.0 (apache#7576)
1453fd9 HDDS-11935. Bump develocity-maven-extension to 1.23 (apache#7577)
202b0c7 HDDS-11860. Improve BufferUtils.writeFully. (apache#7564)
008f9a6 HDDS-11852. Reduce duplication in some GenericCli subclasses (apache#7553)
7a46080 HDDS-11914. Snapshot diff should not filter SST Files based by reading SST file reader (apache#7563)
1835326 HDDS-11927. Mark testGetCurrentStatisticsWhileBalancingInProgress as flaky
66ccc25 HDDS-11908. Snapshot diff DAG traversal should not skip node based on prefix presence (apache#7567)
16ba289 Revert "HDDS-11413. PipelineManagerImpl lock optimization reduces AllocateBlock latency (apache#7160)"
bf6f323 HDDS-11413. PipelineManagerImpl lock optimization reduces AllocateBlock latency (apache#7160)
853d657 HDDS-11893. Fix full snapshot diff fallback logic because of DAG pruning (apache#7549)
b5d04e2 HDDS-11915. Netty OpenSsl not available in acceptance tests on arm64 (apache#7570)
8536490 HDDS-11367. Fix flaky balancer robot test (apache#7569)
745ed1c HDDS-11367. Improve ozone balancing status command output (apache#7139)
6b9cbe0 HDDS-11909. Intermittent timeout building Hadoop in s3a test (apache#7559)
eea5600 HDDS-11911. Return consistent error code when snapshot is not found in the DB or Snapshot Chain. (apache#7557)
e8f3b25 HDDS-11873. Skip old-only xcompat read tests (apache#7534)
ec348a7 HDDS-11889. Include Maven dependencies for hdds-rocks-native in cache (apache#7546)
aa37ae8 HDDS-11885. Download Hadoop for S3A test from mirrors if available (apache#7545)
befd64e HDDS-11694. Safemode Improvement: Introduce factory class to create safemode rules. (apache#7433)
a46153d HDDS-11872. Disable Apache snapshots repo (apache#7536)
092b000 HDDS-11890. Update project description in GitHub (apache#7547)
80c6446 HDDS-8101. Add tool to repair broken FSO tree (apache#7368)
23197e2 HDDS-11605. Directory deletion service should support multiple threads (apache#7349)
055b13c HDDS-11751. Use Java 21 in CI (apache#7458)
d0d82c5 HDDS-11886. Bump license-maven-plugin to 2.5.0 (apache#7539)
f7fe30a HDDS-11691. Support object tags in ObjectEndpointStreaming#put (apache#7543)
9854591 HDDS-11882. Make BOM, not aggregate one (apache#7544)
af345b2 HDDS-11877. Enable Maven cache for more checks (apache#7538)
51c6ed6 HDDS-11830. Subcommands should not extend GenericCli .(apache#7537)
959a39d HDDS-11851. Finer-grained subcommand interface for OzoneDebug and OzoneRepair. (apache#7526)
d4c41e5 HDDS-11334. Improve EC xcompat acceptance test (apache#7492)
8526f2e HDDS-11826. Interactive mode for ozone shell. (apache#7515)
fc63710 HDDS-11833. Return NotImplemented for S3 put-object-acl request. (apache#7531)
e8ad7ad HDDS-11782. ozone debug ldb --with-keys defaults to false instead of true (apache#7521)
cb0a402 HDDS-11859. Remove mention of fuse from s3 interface docs page (apache#7530)
e17f92c HDDS-10821. Ensure ChunkBuffer fully writes buffer to FileChannel (apache#6652)
d66c088 HDDS-11848. Serialization bug in Recon listKeys API (apache#7524)
faab1e8 HDDS-11656. Default native ACL limits to user and user's primary group (apache#7455)
8a1967e HDDS-11719. Remove dependency on server components from ozonefs-common (apache#7438)
9fcecc1 HDDS-11855. Mark TestContainerBalancerDatanodeNodeLimit#checkIterationResultException as flaky
65df308 HDDS-11794. Display HostName in OM / SCM Overview. (apache#7482)
0bde3a2 HDDS-11410. Refactoring more tests from TestContainerBalancerTask (apache#7156)
98e070e HDDS-11728. Refactor subcommand layouts of ozone debug and repair (apache#7489)
1c5676b HDDS-11849. Mark TestBlockOutputStreamWithFailures.test2DatanodesFailure as flaky
9dd8bb9 HDDS-11847. Mark TestSnapshotDeletingServiceIntegrationTest#testParallelExcecutionOfKeyDeletionAndSnapshotDeletion as flaky
b27714f HDDS-11266. Update proto.lock for Ozone 1.4.1 (apache#7504)
9b61937 HDDS-11806. Add HttpFS and Recon in getting-started k8s example (apache#7485)
77ce962 HDDS-10568. When the ldb command is executed, it is output by line (apache#7467)
69538b0 HDDS-11687. Robot warning: replace "is not" with "!=" (apache#7516)
b60d897 HDDS-11820. Create test principals at test run time (apache#7507)
6ba309a HDDS-11831. Finer-grained interface for dynamically registered subcommands (apache#7514)
db36e39 HDDS-11822. Register subcommands in OzoneShell (apache#7513)
e6bd3f5 HDDS-11829. Bump zstd-jni to 1.5.6-8 (apache#7510)
09a4a90 HDDS-11827. Bump exec-maven-plugin to 3.5.0 (apache#7512)
ad867bc HDDS-11824. Bump sqlite-jdbc to 3.47.1.0 (apache#7511)
850306d HDDS-11823. Bump cyclonedx-maven-plugin to 2.9.1 (apache#7508)
34f9d9e HDDS-11742. Update metrics with leaderId if known when starting SCM (apache#7471)
2c6c116 HDDS-11265. Add Ozone 1.4.1 to compatibility acceptance tests (apache#7503)
ebcdc6a HDDS-11810. Secure acceptance test on arm64 fails with LoginException: Checksum failed (apache#7498)
7f40624 HDDS-11821. Mark TestECKeyOutputStream#testECKeyCreatetWithDatanodeIdChange as unhealthy
9b26156 HDDS-11811. rocksdbjni deleted on exit could be used by other components apache#7493
3f92663 HDDS-11785. DataNode aborts ContainerStateMachine if it does not know any follower next index (apache#7480)
f0a2c87 HDDS-11773. Prevent frequent DataNode Ratis snapshotting. (apache#7473)
1383c18 HDDS-11718. Some CI checks passing despite error (apache#7483)
f98eac2 HDDS-11561. Refactor Open Key Search Endpoint and Consolidate with OmDBInsightEndpoint Using StartPrefix Parameter. (apache#7336)
a99ab27 HDDS-11243. SCM SafeModeRule Support EC. (apache#7008)
6871547 HDDS-11716. Address Incomplete Upgrade Scenario in Recon Upgrade Framework (apache#7452)
9bc9145 HDDS-10411. Support incremental ChunkBuffer checksum calculation (apache#7189)
579a38e HDDS-11723. Tool to better micro benchmark hbase performance in Ozone (apache#7463)
cc1a374 HDDS-11704. Hadoop test leaves running containers in case of failure (apache#7435)
c4b2056 Update documentation to mention that container schemaV3 is default (apache#7481)
20c4cfa HDDS-11386. Multithreading bug in ContainerBalancerTask (apache#7339)
b090312 HDDS-11780. Increase client write retry when SCM is in safe mode (apache#7470)
f8e4db9 HDDS-11793. Bump maven-checkstyle-plugin to 3.6.0 (apache#7476)
6a1ff84 HDDS-11791. Bump commons-io to 2.18.0 (apache#7478)
d7f8235 HDDS-11788. Bump log4j2 to 2.24.2 (apache#7479)
6547de7 HDDS-11790. Bump commons-lang3 to 3.17.0 (apache#7475)
1d8abd6 HDDS-11789. Bump zstd-jni to 1.5.6-7 (apache#7477)
6ca7230 HDDS-11769. Add tools folder into ozone src package. (apache#7466)
3b8ed58 HDDS-11682. Bump maven-resources-plugin to 3.3.0 (apache#7384)
f4a9ee0 HDDS-11702. Merge test_bucket_encryption into robot compatibility test (apache#7451)
d6a5488 HDDS-11713. Use seek to reach the start transaction. (apache#7460)
d52615a HDDS-11733. Remove okio versioning and unused okhttp dependency (apache#7447)
1a49991 HDDS-11617. Update hadoop to 3.4.1 (apache#7376)
9945de6 HDDS-11667. Validating DatanodeID on any request to the datanode (apache#7418)
fc6a2ea HDDS-11650. ContainerId list to track all containers created in a datanode (apache#7402)
a8db9cd HDDS-11749. Extract moveToTrash implementation to client code (apache#7453)
3ba3474 HDDS-11755. mktemp --suffix does not work on Mac (apache#7457)
433c7bb HDDS-11729. Update skipRecon property to skip only frontend build (apache#7454)
6b40003 HDDS-11739. Extract generic unmarshaller for S3 requests (apache#7449)
c7f65e7 HDDS-11740. Add debug command to show internal component versions (apache#7450)
0f7104e HDDS-11708. Recon ListKeys API should return a proper http response status code if NSSummary rebuild is in progress. (apache#7437)
0e0d5e9 HDDS-11163. Improve Heatmap page UI (apache#7420)
2cef393 HDDS-11696. Limit max number of entries in list keys/status response (apache#7431)
e96e314 HDDS-11697. Integrate Ozone Filesystem Implementation with Ozone ListStatusLight API (apache#7440)
ebcbce7 HDDS-11644. Close OMLayoutVersionManager (apache#7445)
20e4969 HDDS-11737. UnsupportedOperationException in S3 setBucketAcl (apache#7448)
b252181 HDDS-10804. Include only limited set of ports in Pipeline proto (apache#6655)
79ca956 HDDS-8829. Symmetric Keys for Delegation Tokens (apache#7394)
3e798e6 HDDS-11698. Use hadoop images from GitHub in CI (apache#7432)
3e278b7 HDDS-10655. Support PutObjectTagging, GetObjectTagging, and DeleteObjectTagging (apache#6756)
036e727 HDDS-11732. Fix ACL check on bucket resolution while reading from snapshot (apache#7446)
dbda703 HDDS-11736. Bump maven-javadoc-plugin to 3.11.1 (apache#7444)
238f232 HDDS-11692. Skip spotbugs for modules with only generated code. (apache#7428)
f60ad61 HDDS-11705. Snapshot operations on linked buckets should work on actual underlying bucket (apache#7434)
dd22dbe HDDS-11615. Add Upgrade Action for Initial Schema Constraints for Unhealthy Container Table in Recon. (apache#7372)
4066c7c HDDS-117. Add convenience methods for port management in DatanodeDetails (apache#7408)
12419fa HDDS-11695. SCM follower should not log NotLeaderException during Pipeline Report processing. (apache#7430)
5275ded HDDS-10133. Add a method to check key name in OMKeyRequest (apache#6012)
fd5c6d8 HDDS-11689. Extract scheduled workflow for populate-cache (apache#7429)
889ba80 HDDS-11653. Bump Ratis to 3.1.2 (apache#7427)
47ec4dd HDDS-11671. Refer to website for supported versions (apache#7412)
10cac80 HDDS-11686. Use ozone image from GitHub in CI (apache#7425)
aa6da3e HDDS-9781. Limited maxOpenFiles, disabled enableCompactionDag, and createCheckpointDirs when creating OMMetadataManager instance for bootstrapping (apache#7095)
9dd6a83 HDDS-11645. Mark TestReconScmSnapshot#testExplicitRemovalOfNode as flaky
8e617dc HDDS-11672. Mark TestSnapshotBackgroundServices#testCompactionLogBackgroundService as flaky
d09e6d4 HDDS-11646. Mark TestXceiverClientMetrics#testMetrics as flaky
8e4a508 HDDS-11668. Recon List Keys API: Reuse key prefix if parentID is the same (apache#7410)
ee63232 HDDS-11684. Remove suppression of HiddenField (apache#7423)
a33d8a3 HDDS-10166. Replace GenericTestUtils temporary directories with `@TempDir` (apache#7399)
3a18a9d HDDS-11664. Hadoop download failure not reported as error (apache#7421)
47c2409 HDDS-64. OzoneClientException should extend IOException. (apache#7403)
27fcd0c HDDS-11685. Use ozone-testkrb5 from GitHub (apache#7424)
5663971 HDDS-11665. Minor optimizations on the write path (apache#7407)
cb81f0c HDDS-11683. Skip shade in most integration checks (apache#7422)
952e0ec HDDS-11681. Bump Bouncy Castle to 1.79 (apache#7387)
2797c45 HDDS-11677. Bump sqlite-jdbc to 3.47.0.0 (apache#7413)
358534b HDDS-11675. Bump maven-site-plugin to 3.21.0 (apache#7414)
cf79245 HDDS-11674. Bump junit to 5.11.3 (apache#7415)
6dd566f HDDS-11583. Use ozone-runner from GitHub in CI (apache#7409)
ef2bf98 HDDS-11669. In OmUtils.normalizeKey isDebugEnabled should be evaluated first (apache#7411)
a7e3014 HDDS-11660. Recon List Key API: Reduce object creation and buffering memory (apache#7405)
5d18b9c HDDS-11659. Improve HSync compatibility test (apache#7404)
4e603aa HDDS-11462. Enhancing DataNode I/O Monitoring Capabilities. (apache#7206)
18f6e8a HDDS-11311. Added Compatibility test for HSync (apache#7400)
0415c0b HDDS-11649. Recon ListKeys API: Simplify filter predicates (apache#7395)
2547ac0 HDDS-11652. Fix SCM start command in SCM-HA doc (apache#7398)
c045839 HDDS-11623. Improve OM Ratis Configuration change log message (apache#7388)
2b1524b HDDS-11609. Switch to Recon v2 UI as the default UI (apache#7358)
efe5892 HDDS-11641. Allow testing Hadoop with custom docker images (apache#7393)
c055036 HDDS-11637. Compile failure is ignored in build check (apache#7389)
67e5261 HDDS-11563. Display OM/SCM service ID as Namespace in web UI (apache#7321)
0fb5e50 HDDS-11587. Ozone Manager not processing file put requests with multi-tenancy enabled (apache#7316)
786bb49 HDDS-11642. MutableQuantiles should be stopped (apache#7392)
76ec9b9 HDDS-11639. Upgrade ozone-runner to Rocky Linux 9.3 (apache#7391)
3bc3b8a HDDS-11621. Fix missing HADOOP_ variables in MR acceptance test (apache#7375)
6f9db61 HDDS-11200. Hsync client-side metrics (apache#7371)
58d1443 HDDS-10240. Cleanup zero-copy EC (apache#7340)
5b065d8 HDDS-11638. Bump cyclonedx-maven-plugin to 2.9.0 (apache#7383)
c7a196f HDDS-11635. Memory leak when using Ozone FS via Hadoop FileContext API (apache#7382)
c9956a1 HDDS-11601. Intermittent failure in acceptance balancer test. (apache#7343)
a737fc3 HDDS-11619. Remove dependency on hadoop-shaded-guava (apache#7373)
c4d6857 HDDS-11584. Document ozone debug ldb command (apache#7313)
dded26e HDDS-11588. Add main artifact jar to classpath file (apache#7324)
afed6d9 HDDS-11558. Make OM client retry idempotent (apache#7329)
e85b32d HDDS-11591. Copy dependencies when building each module (apache#7325)
72e56d7 HDDS-11601. Disable flaky EC balancer acceptance test
ab16cbe HDDS-11507. Add error information to log while handling ServiceException. (apache#7367)
980b960 HDDS-11380. Make node decommission error message more comprehensive (apache#7155)
61c094f HDDS-11614. Speed up TestTransferLeadershipShell (apache#7370)
91188b3 HDDS-11352. Remove Flaky annotation from TestOzoneManagerHAWithStoppedNodes using Ratis 3.1.1
faf133d HDDS-11220. Initialize block length using the chunk list from DataNode before seek (apache#7221)
91d41a0 HDDS-11465. Introducing Schema Versioning for Recon to Handle Fresh Installs and Upgrades. (apache#7213)
7a27db2 HDDS-11134. Create compatibility test for FSO bucket usage (apache#7350)
30906d1 HDDS-11612. Bump jnr-posix to 3.1.20 (apache#7360)
bed4aef HDDS-11611. Bump docker-maven-plugin to 0.45.1 (apache#7362)
e2c3d57 HDDS-11610. Bump maven-dependency-plugin to 3.8.1 (apache#7361)
24c1000 HDDS-11041. Add admin request filter for S3 requests and UGI support for GrpcOmTransport (apache#7268)
0b84998 HDDS-11594. Update batchPut buffer log for rocksdb. (apache#7356)
c013516 HDDS-11608. Client should not retry invalid protobuf request (apache#7354)
32a8c09 HDDS-11160. Improve Insights page UI (apache#7327)
782ad62 HDDS-11388. Fix unnecessary call to the DB for ContainerBalancer#getBalancerStatusInfo (apache#7224)
35b6a3a HDDS-11600. Intermittent failure in repro due to ordering differences in builddef.lst (apache#7342)
e7bf154 HDDS-11132. Revert client version bump done as part of HDDS-10983 (apache#7348)
ea5cbff HDDS-11602. Bump ozone-runner to 20241022-jdk17-1 (apache#7347)
3f98df5 HDDS-11580. Validate 'hdds.datanode.dir.du.reserved' property (apache#7328)
8568075 HDDS-11570. Fix HDDS Docs build failure with Hugo v0.135.0 (apache#7337)
721ae58 HDDS-11057. Enable reproducible builds (apache#6856)
86b7aae HDDS-11205. Implement a search feature for users to locate keys pending Deletion within the OM Deleted Keys Insights section (apache#6969)
f7b428d HDDS-11503. Add Robot test to verify Container Balancer for EC containers. (apache#7311)
85eb89b HDDS-11483. Make s3g object get and put operation buffer configurable (apache#7233)
515977a HDDS-11582. Bump body-parser to 1.20.3 (apache#7307)
9b66267 HDDS-11589. ReconSCMDBDefinition should be singleton. (apache#7323)
f784a84 HDDS-11578. Unify constants for RATIS_SNAPSHOT_DIR (apache#7310)
4670a5e HDDS-11498. Improve SCM deletion efficiency. (apache#7249)
3fb2cf0 HDDS-11108. Extract keywords for multipart upload tests (apache#7318)
4b24aa9 HDDS-11545. [UI] Add OM and SCM ID information (apache#7287)
860e269 HDDS-11538. Let coverage report link to java sources (apache#7280)
2139367 HDDS-11581. Remove duplicate ContainerStateMachine#RaftGroupId (apache#7312)
64e035d HDDS-11573. Remove lib/gson-2.10.1.jar (apache#7309)
ce07a3c HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7209)
c579d06 HDDS-11574. Ozone client leak in TestS3SDKV1 (apache#7308)
c044b79 HDDS-10390. MiniOzoneCluster to support S3 Gateway (apache#7281)
8eef589 HDDS-11557. Simplify DBColumnFamilyDefinition. (apache#7298)
4c77f6b HDDS-11562. Parameterize TestSCMNodeManager#testProcessLayoutVersion (apache#7300)
b51c4b3 HDDS-11572. Bump commons-io to 2.17.0 (apache#7305)
3a37870 HDDS-11571. Bump log4j2 to 2.24.1 (apache#7301)
494798c HDDS-11564. Mark TestBlockOutputStream#testWriteExactly... as flaky
fabf512 HDDS-11569. Bump restrict-imports-enforcer-rule from 2.5.0 to 2.6.0 (apache#7303)
1e62a0a HDDS-11568. Bump commons-codec to 1.17.1 (apache#7304)
e9f92a7 HDDS-11567. Bump common-custom-user-data-maven-extension to 2.0.1 (apache#7302)
cb44d5e HDDS-11555. SCMDBDefinition should be singleton. (apache#7296)
d473134 HDDS-11486. Reduce log level for NativeLibraryNotLoadedException in SnapshotDiffManager (apache#7290)
3348d91 HDDS-11564. Mark TestBlockOutputStream as flaky
e2f2aeb HDDS-11548. Add some logging to the StateMachine (apache#7291)
523c860 HDDS-11439. De-duplicate code for ReplicatedFileChecksumHelper and ECFileChecksumHelper (apache#7264)
05a409e HDDS-11519. Clean up unused lines in BlockOutputStream
ffe7198 HDDS-11544. Improve work with arrays (apache#7286)
5657604 HDDS-11556. Add a getTypeClass method to Codec. (apache#7295)
256aad9 HDDS-11546. Add regex operation to filter option of ldb scan command. (apache#7289)
7ef7de2 HDDS-11482. EC Checksum throws IllegalArgumentException because the buffer limit is negative (apache#7230)
77c17df HDDS-11551. Provide details about integration check failure (apache#7294)
911a583 HDDS-8188. Support max allowed length in response of ozone admin container list (apache#7181)
7f2e0e3 HDDS-11554. OMDBDefinition should be singleton. (apache#7292)
170761c HDDS-11547. Make MAVEN_OPTS optional (apache#7288)
4846e97 HDDS-11543. Track OzoneClient object leaks via LeakDetector framework. (apache#7285)
e00f7ae HDDS-11159. Improve Containers page UI (apache#7267)
cfda951 HDDS-11520. Fix Delete pending directories key mapping (apache#7269)
2e3de8a HDDS-11476. Implement lesser/greater operation for --filter option of ldb scan command (apache#7222)
06ccdb3 HDDS-11526. Fix hdds.datanode.metadata.rocksdb.cache.size default value mismatch (apache#7284)
b3afaec HDDS-11535. Incomplete SCM roles table header (apache#7278)
ed2a073 HDDS-11536. Bump macOS runner version to macos-13 (apache#7279)
1887f83 HDDS-11537. Bump frontend-maven-plugin to 1.15.1 (apache#7276)
1f1e618 HDDS-6776. Cleanup TestSCMSafeModeManager (apache#7272)
4bee3e9 HDDS-11534. Bump cyclonedx-maven-plugin to 2.8.2 (apache#7277)
789fb53 HDDS-11533. Bump maven-gpg-plugin to 3.2.7 (apache#7275)
eb26677 HDDS-11268. Add --table mode for OM/SCM Roles CLI (apache#7016)
28ea480 HDDS-11527. Avoid unnecessary duplicate build (apache#7270)
30da31f HDDS-3498. Shutdown datanode if address is already in use (apache#7256)
2401d27 HDDS-11046. Coverage decreased due to running tests with Java 17 (apache#7263)
78d8418 HDDS-11524. Bump snappy-java to 1.1.10.7 (apache#7202)
8747c0e HDDS-11518. Recon OmDB Insights show isKey=true for directories (apache#7260)
5d2bbc3 HDDS-11480. Refactor OM volume response tests (apache#7265)
31f9f2c HDDS-11517. Update version to 2.0.0-SNAPSHOT (apache#7258)
a0f0872 HDDS-11444. Make Datanode Command metrics consistent across all commands (apache#7191)
d3b63c6 HDDS-11492. Directory deletion get stuck having millions of directory (apache#7254)
f52f0af HDDS-11127. [hsync] Improve test coverage for XceiverClientRatis.java (apache#7225)
360fea5 HDDS-11494. Improve the duration option of freon ombg (apache#7246)
10d3b21 HDDS-11504. Update Ratis to 3.1.1. (apache#7257)
ce46297 HDDS-11162. Improve Disk Usage page UI (apache#7214)
c91f1c7 HDDS-11491. Avoid sharing clientId among deleting services (apache#7250)
b0943d5 HDDS-11501. Improve logging in XceiverServerRatis (apache#7252)
55925ab HDDS-11502. Class path contains multiple SLF4J providers (apache#7255)
d0ad836 HDDS-11472. Avoid recreating external access authorizer on OM state reload (apache#7238)
254db9e HDDS-11500. RootCARotationManager cancelling wrong task in notifyStatusChanged (apache#7251)
1e6e4b3 HDDS-11499. Remove redundant code from ECReconstructionCoordinator. (apache#7248)
adb2821 HDDS-11490. Bump rollup to 3.29.5 (apache#7232)
189a9fe HDDS-11484. Validate javadoc in CI (apache#7245)
64a29c6 HDDS-11497. Bump commons-configuration2 to 2.11.0 (apache#7242)
95cfadd HDDS-11496. Bump maven-install-plugin to 3.1.3 (apache#7244)
0a999cf HDDS-11493. Bump sqlite-jdbc to 3.46.1.3 (apache#7243)
a214a31 HDDS-11329. Update Ozone images to Rocky Linux-based runner (apache#7241)
56ddb85 HDDS-11371. Handle cases where OM does not have getServerDefaults() implemented. (apache#7130)
b5097c7 HDDS-11347. Add rocks_tools_native lib check in Ozone CLI checknative subcommand (apache#7101)
fb0bf77 HDDS-11489. Bump maven-site-plugin to 3.20.0 (apache#7226)
70e6e40 HDDS-11122. Fix javadoc warnings (apache#7234)
acf3fdc HDDS-11458. Selective checks: trigger checkstyle for properties file changes (apache#7196)
6b87207 HDDS-11469. Statistics of Pipeline and Container (apache#7217)
1b8468b HDDS-11411. Snapshot garbage collection should not run when the keys are moved from a deleted snapshot to the next snapshot in the chain (apache#7193)
1f86ce8 HDDS-10617. Unexpected number of files in ITestS3AContractGetFileStatusV1List (apache#7208)
73a3bcc HDDS-11467. Bump vite to 4.5.5 (apache#7212)
d45aa1d HDDS-11460. Bump express to 4.21.0 (apache#7197)
e2e30b8 HDDS-11354. Intermittent failure in TestOzoneManagerSnapshotAcl#testLookupKeyWithNotAllowedUserForPrefixAcl (apache#7205)
0fcb645 HDDS-11477. [doc] Add configuration description for datanode docs (apache#7223)
3598ee3 HDDS-11464. Removed unused constants from OzoneConsts. (apache#7207)
8c0b54e HDDS-11408. Snapshot rename table entries are propagated incorrectly on snapshot deletes (apache#7200)
719bdf9 HDDS-11396. NPE due to empty Handler#clusterId (apache#7145)
40c4001 HDDS-10479. Add ozone admin ratis local raftMetaConf (apache#7170)
45f9138 HDDS-11394. Fix pipeline close --all command (apache#7138)
2b196d1 HDDS-11468. Enabled DB sync button (apache#7216)
d3899d2 Clean up files created after TestKeyValueHandlerWithUnhealthyContainer#testMarkContainerUnhealthyInFailedVolume (apache#7219)
70b8dd5 HDDS-11157. Improve Datanodes page UI (apache#7168)
151709a HDDS-11446. Downgrade picocli to 4.7.5 due to regression (apache#7215)
7a26aff HDDS-11158. Improve Pipelines page UI (apache#7171)
c365aa0 HDDS-11181. Cleanup of unnecessary try-catch blocks (apache#7210)
88dd436 HDDS-11423. Implement equals operation for --filter option to ozone ldb scan (apache#7167)
e0060a8 HDDS-11196. Improve SCM WebUI Display (apache#6960)
22ddfb9 Revert "HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7192)"
9f5bf43 HDDS-11457. Internal error on S3 CompleteMultipartUpload if parts are not specified (apache#7195)
10c47a1 HDDS-11459. Bump develocity-maven-extension to 1.22.1 (apache#7201)
50f2563 HDDS-11419. Fix waitForCheckpointDirectoryExist log message (apache#7199)
a7d7e37 HDDS-11456. Require successful dependency/licence checks for acceptance/compile/kubernetes (apache#7192)
5feb9ea HDDS-11453. OmSnapshotPurge should be in a different ozone manager double buffer batch (apache#7188)
703c4d5 HDDS-10984. Tool to restore SCM certificates from RocksDB. (apache#6781)
d221065 HDDS-11440. Add a lastTransactionInfo field in SnapshotInfo to check for transactions in flight on the snapshot (apache#7179)
e573701 HDDS-11448. Improve documentation in ContainerStateMachine (apache#7183)
0e49f7a HDDS-11449. Remove unnecessary log from client console. (apache#7184)
cd251f2 HDDS-11438. Ensure DataInputBuffer is closed in OMPBHelper#convert (apache#7182)
4b47812 HDDS-11389. Incorrect number of deleted containers shown in Recon UI. (apache#7149)
0915f0b HDDS-10985. EC Reconstruction failed because the size of currentChunks was not equal to checksumBlockDataChunks. (apache#7009)
0f16195 HDDS-11416. refactor ratis submit request avoid code duplicate (apache#7166)
86fe920 HDDS-11376. Improve ReplicationSupervisor to record replication metrics (apache#7140)
883a63f HDDS-11441. ozone sh key put should only accept positive expectedGeneration (apache#7180)
33dbd4a HDDS-11357. Datanode Usageinfo Support Display Pipeline. (apache#7105)
9477aa6 HDDS-11436. Minor update in Recon API handling. (apache#7178)
8ca33c7 HDDS-11414. Key listing for FSO buckets fails with forward client (apache#7161)
f1ebd39 HDDS-11435. Bump sqlite-jdbc to 3.46.1.0 (apache#7174)
aaf8bd0 HDDS-11434. Bump log4j2 to 2.24.0 (apache#7176)
3510ce7 HDDS-11433. Bump Jetty to 9.4.56.v20240826 (apache#7175)
0047cd2 HDDS-11400. Bump maven-core to 3.9.9 (apache#7144)
274da83 HDDS-10488. Datanode OOM due to run out of mmap handler (apache#6690)
7a452ca HDDS-11391. Addendum to fix test failure.
7e1d9b0 HDDS-11145. ozone admin om cancelprepare --service-id improvement (apache#7159)
6888cf2 HDDS-11383. Improve read key dashboard to include add the read key related OM metrics. (apache#7131)
3e0d76c HDDS-11369. [hsync] Remove KeyOutputStreamSemaphore logs (apache#7136)
b23981c HDDS-11342. [hsync] Add a config as HBase-related features master switch (apache#7126)
3e1188a HDDS-11285. cli to trigger quota repair and status (apache#7104)
2e33978 HDDS-11401. Code cleanup in DatanodeStateMachine (apache#7146)
f563d67 HDDS-11391. Frequent Ozone DN Crashes During OM + DN Decommission with Freon (apache#7154)
18b28d2 HDDS-11312. [hsync] Added upgrade tests (apache#7110)
b29beb3 HDDS-11350. NullPointerException thrown on checking container balancer status (apache#7134)
111b9df HDDS-11407. Use OMLayoutFeature.HBASE_SUPPORT for HSYNC (apache#7152)
966b8d0 HDDS-11390. Removed hsync and hflush capability check in ContentGenerator (apache#7153)
877504a HDDS-11156. Improve Buckets page UI (apache#7100)
814f78f HDDS-11392. ChecksumByteBufferImpl's static initializer fails with java 17+ (apache#7135)
b5e1a8b HDDS-11398. Bump commons-compress to 1.27.1 (apache#7142)
a8e3ea9 HDDS-11397. Bump Jersey2 to 2.45 (apache#7141)
5992837 HDDS-11399. Bump maven-deploy-plugin to 3.1.3 (apache#7143)
47564bb HDDS-11359. Intermittent timeout in TestPipelineManagerMXBean#testPipelineInfo (apache#7132)
cc4e026 HDDS-11304. Make up for the missing functionality in CommandDispatcher (apache#7062)
2d372f6 HDDS-11339. Let PrometheusServlet rely on periodically published metrics (apache#7092)
f22c6f8 HDDS-11164. Improve Navbar UI (apache#7088)
23211c1 HDDS-11381. Adding logging for sortByDistanceCost in NetworkTopologyImpl (apache#7133)
3e9cdb6 HDDS-11378. Allow disabling OM version-specific feature via config (apache#7129)
23f3e5b HDDS-11152. OMDoubleBuffer error when handling snapshot's background operations (apache#7112)
5659b7e HDDS-11375. DN startup fails due to illegal configuration of raft.grpc.message.size.max (apache#7128)
41d8147 HDDS-11368. Remove dependency on Babel in Vite (apache#7119)
0bd8ba1 HDDS-11372. No coverage for org.apache.ozone packages (apache#7124)
3bd237d HDDS-11325. (addendum) Intermittent failure in TestBlockOutputStreamWithFailures#testContainerClose (apache#7121)
8306290 HDDS-11373. Log for EC reconstruction command lists the missing indexes as ASCII control characters (apache#7123)
dab1538 HDDS-11216. Replace HAUtils#buildCAX509List usages with other direct usages (apache#6981)
51a5fb9 Revert "HDDS-11235. Spare InfoBucket RPC call in FileSystem#mkdir() call. (apache#6990)" (apache#7122)
fab56b4 HDDS-11229. Chain optionals in Recon Insight (apache#7064)
2236041 HDDS-11365. Fix the NOTICE file (apache#7120)
2e30dc1 HDDS-11190. Add --fields option to ldb scan command (apache#6976)
0b75cb0 HDDS-11251. Deprecate definitions and remove listTrash and recoverTrash APIs (apache#7060)
be34303 HDDS-9198. Maintain local cache in OMSnapshotPurgeRequest to get updated snapshotInfo and pass the same to OMSnapshotPurgeResponse (apache#7045)
c07b408 HDDS-11208. Change RatisBlockOutputStream to use HDDS-11174. (apache#7072)
8f8d809 HDDS-11309. Increase CONTAINER_STATE Column Length in UNHEALTHY_CONTAINERS to Avoid Truncation (apache#7071)
350a340 HDDS-11364. Bump jgraphx to 3.9.12 (apache#7116)
45b7056 HDDS-11363. Bump develocity-maven-extension to 1.22 (apache#7115)
9dd18f1 HDDS-11362. Bump snappy-java to 1.1.10.6 (apache#7114)
637cb91 HDDS-11361. Bump Jersey2 to 2.44 (apache#7113)

Conflicts:
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/keyvalue/KeyValueHandler.java
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/ozoneimpl/OzoneContainer.java
hadoop-hdds/container-service/src/test/java/org/apache/hadoop/ozone/container/keyvalue/TestKeyValueHandlerWithUnhealthyContainer.java
hadoop-ozone/dist/src/main/smoketest/admincli/container.robot

Modified during conflict:
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/checksum/DNContainerOperationClient.java
hadoop-hdds/container-service/src/main/java/org/apache/hadoop/ozone/container/checksum/ReconcileContainerTask.java
hadoop-ozone/integration-test/src/test/java/org/apache/hadoop/ozone/client/rpc/TestECKeyOutputStream.java
swamirishi pushed a commit to swamirishi/ozone that referenced this pull request Dec 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants