HDDS-13003. [Design Doc] Snapshot Defragmentation to reduce storage footprint #8514

swamirishi · 2025-05-27T18:24:22Z

What changes were proposed in this pull request?

In Apache Ozone, snapshots currently take a checkpoint of the Active Object Store (AOS) RocksDB each time a snapshot is created and track the compaction of SST files over time. This model works efficiently when snapshots are short-lived, as they merely serve as hard links to the AOS RocksDB. However, over time, if an older snapshot persists while significant churn occurs in the AOS RocksDB (due to compactions and writes), the snapshot RocksDB may diverge significantly from both the AOS RocksDB and other snapshot RocksDB instances. This divergence increases storage requirements linearly with the number of snapshots.

The primary inefficiency in the current snapshotting mechanism stems from constant RocksDB compactions in AOS, which can cause a key, file, or directory entry to appear in multiple SST files. Ideally, each unique key, file, or directory entry should reside in only one SST file, eliminating redundant storage and mitigating the multiplier effect caused by snapshots. If implemented correctly, the total RocksDB size would be proportional to the total number of unique keys in the system rather than the number of snapshots.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-13003

How was this patch tested?

No tests required

Change-Id: Ieb6a1145c732ffbbbc6811565734a78bd12e30ef

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

Copilot

Pull Request Overview

This PR introduces a design document explaining the snapshot compaction strategy in Apache Ozone to reduce the storage footprint of snapshots.

Introduces new metadata fields (e.g., needsCompaction, lastCompactionTime, sstFiles) for managing snapshot compaction.
Details the proposed workflow for compaction including directory restructuring and diff computation.

Comments suppressed due to low confidence (1)

hadoop-hdds/docs/content/feature/SnapshotCompaction.md:20

The type 'List' appears to be inconsistent; if the intention is to store a list of Long values, consider using 'List' for clarity and consistency.

A new list of Map<String, List<Longs>> (`sstFiles`) also needs to be added to snapshot info; this would be storing the original list of sst files in the uncompacted copy of the snapshot corresponding to keyTable/fileTable/DirectoryTable.

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

jojochuang

The Markdown is rendered from the design doc attached to the jira: https://docs.google.com/document/d/1Xw1AtKAlDm97UiLXd8egjeLIaYq4rpClv1xD7x5Xvww/edit?usp=sharing

jojochuang

Please check if the following workflow chart look correct

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

Change-Id: I29d688c22cdf29aa55cf2bbce6b750720887f6a0

Change-Id: I9761006b9b9697f8392aab68c01b400793996d06

smengcl · 2025-06-02T17:15:00Z

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

+
+3. ### Directory Structure Changes
+
+   Snapshots currently reside in the `db.checkpoints` directory. The proposal introduces a `db.checkpoints.compacted` directory for compacted snapshots. The directory format should be as follows:


clarification

Suggested change

Snapshots currently reside in the `db.checkpoints` directory. The proposal introduces a `db.checkpoints.compacted` directory for compacted snapshots. The directory format should be as follows:

Snapshots currently reside in the `db.checkpoints` directory. The proposal introduces a `db.checkpoints.compacted` directory for compacted snapshots. And the OM DB checkpoint directory name format inside the `db.checkpoints.compacted` directory should be as follows:

Actually, snapshots are under db.snapshots. e.g.:

/var/lib/hadoop-ozone/om/data/db.snapshots /var/lib/hadoop-ozone/om/data/db.snapshots/diffState /var/lib/hadoop-ozone/om/data/db.snapshots/diffState/compaction-sst-backup /var/lib/hadoop-ozone/om/data/db.snapshots/diffState/snapDiff /var/lib/hadoop-ozone/om/data/db.snapshots/diffState/compaction-log /var/lib/hadoop-ozone/om/data/db.snapshots/checkpointState

Newly created FS snapshots would be under ./db.snapshots/checkpointState

chungen0126 · 2025-06-03T11:48:43Z

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

+
+5. ### Snapshot Compaction Workflow
+
+   Snapshot compaction should only occur once the snapshot has undergone SST filtering. The following steps outline the process:


Thanks @swamirishi for working on this. Does this mean the SstFilteringService would run it?

chungen0126 · 2025-06-03T13:11:46Z

I suggest adding a section that summarises how each table is handled during compaction. The information is scattered now, grouping it together might make the doc easier to read.

smengcl · 2025-06-04T18:42:38Z

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

+    2. **Flush changed objects** into separate SST files using the SST file writer, categorizing them by table type.
+    3. **Ingest these SST files** into the RocksDB checkpoint using the `ingestFile` API.
+3. Truncate `deletedTable,deletedDirectoryTable,snapshotRenamedTable etc. (All tables excepting keyTable/fileTable/directoryTable)` in checkpointed rocksdb and ingest the entire table from deletedTable and deletedDirectoryTable from the current snapshot rocksdb.
+4. **Acquire the snapshot cache lock** to prevent snapshot access during directory updates.


Note: Grab write lock as implemented in #8474

swamirishi · 2025-06-06T22:43:28Z

I suggest adding a section that summarises how each table is handled during compaction. The information is scattered now, grouping it together might make the doc easier to read.

thanks @chungen0126 for reviewing the doc. I will update the doc shortly capturing this info

jojochuang

What conditions trigger a Snapshot Compaction? For sure not when there's no snapshots. But does it get triggered when there's one snpahsot? Or a predefined threshold of snapshots / diffs between snapshots?

smengcl · 2025-07-16T22:15:58Z

One potential race condition would be when the snapshot compaction process is ongoing, a new OM node bootstraps and tries to download the DBs.

In such case I believe we had existing lock in-place to prevent that but we need to double check.

hadoop-hdds/docs/content/feature/SnapshotCompaction.md

Co-authored-by: Wei-Chiu Chuang <[email protected]>

smengcl · 2025-08-22T18:20:12Z

I am going to push an update this design doc. Mostly about the change of terms.

github-actions · 2025-11-12T00:05:16Z

This PR has been marked as stale due to 21 days of inactivity. Please comment or remove the stale label to keep it open. Otherwise, it will be automatically closed in 7 days.

github-actions · 2025-11-19T00:05:35Z

Thank you for your contribution. This PR is being closed due to inactivity. If needed, feel free to reopen it.

HDDS-13003. [Design Doc] Snapshot Compaction to reduce storage footprint

5cd218d

Change-Id: Ieb6a1145c732ffbbbc6811565734a78bd12e30ef

swamirishi requested review from jojochuang and smengcl May 27, 2025 18:24

jojochuang added documentation Improvements or additions to documentation snapshot https://issues.apache.org/jira/browse/HDDS-6517 labels May 27, 2025

jojochuang reviewed May 27, 2025

View reviewed changes

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

jojochuang requested a review from Copilot May 27, 2025 18:48

Copilot AI reviewed May 27, 2025

View reviewed changes

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

jojochuang reviewed May 27, 2025

View reviewed changes

jojochuang reviewed May 28, 2025

View reviewed changes

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

jojochuang reviewed May 29, 2025

View reviewed changes

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

swamirishi added 2 commits May 29, 2025 21:14

Merge remote-tracking branch 'apache/master' into HEAD

82ca33e

Change-Id: I29d688c22cdf29aa55cf2bbce6b750720887f6a0

HDDS-13003. Add force manual compaction for first snapshot in chain

17573a6

Change-Id: I9761006b9b9697f8392aab68c01b400793996d06

smengcl reviewed Jun 2, 2025

View reviewed changes

chungen0126 reviewed Jun 3, 2025

View reviewed changes

smengcl reviewed Jun 4, 2025

View reviewed changes

Update SnapshotCompaction.md

421373a

jojochuang reviewed Jul 16, 2025

View reviewed changes

jojochuang reviewed Jul 23, 2025

View reviewed changes

hadoop-hdds/docs/content/feature/SnapshotCompaction.md Outdated Show resolved Hide resolved

smengcl changed the title ~~HDDS-13003. [Design Doc] Snapshot Compaction to reduce storage footprint~~ HDDS-13003. [Design Doc] Snapshot Defragmentation to reduce storage footprint Jul 28, 2025

Apply suggestion from @jojochuang

8f0b329

Co-authored-by: Wei-Chiu Chuang <[email protected]>

smengcl mentioned this pull request Aug 19, 2025

HDDS-13003. [POC] Snapshot Defragmentation to reduce storage footprint #8954

Closed

smengcl added 3 commits August 22, 2025 19:56

Change term: Snapshot Compaction -> Snapshot Defragmentation

c12e519

Rename the design doc file itself

e0d2e64

Address Wei-Chiu's comments.

e7f846c

smengcl mentioned this pull request Oct 7, 2025

HDDS-13009. Background snapshot defrag service (Draft v1) #9117

Closed

2 tasks

smengcl mentioned this pull request Oct 31, 2025

HDDS-13009. Background snapshot defrag service #9227

Closed

github-actions bot added the stale label Nov 12, 2025

github-actions bot closed this Nov 19, 2025


		3. ### Directory Structure Changes

		Snapshots currently reside in the `db.checkpoints` directory. The proposal introduces a `db.checkpoints.compacted` directory for compacted snapshots. The directory format should be as follows:


		5. ### Snapshot Compaction Workflow

		Snapshot compaction should only occur once the snapshot has undergone SST filtering. The following steps outline the process:

HDDS-13003. [Design Doc] Snapshot Defragmentation to reduce storage footprint #8514

HDDS-13003. [Design Doc] Snapshot Defragmentation to reduce storage footprint #8514

Uh oh!

Conversation

swamirishi commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

smengcl Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

smengcl Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chungen0126 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

chungen0126 commented Jun 3, 2025

Uh oh!

smengcl Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

swamirishi commented Jun 6, 2025

Uh oh!

jojochuang left a comment

Choose a reason for hiding this comment

Uh oh!

smengcl commented Jul 16, 2025

Uh oh!

Uh oh!

smengcl commented Aug 22, 2025

Uh oh!

github-actions bot commented Nov 12, 2025

Uh oh!

github-actions bot commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

swamirishi commented May 27, 2025 •

edited

Loading

smengcl Aug 12, 2025 •

edited

Loading