Skip to content

Conversation

@swamirishi
Copy link
Contributor

What changes were proposed in this pull request?

Currently while checking file links, the exclude sst file list & files already present in the current tarball is checked in the entries by sequentially iterating through the entries, for each and every file in the om data directory (snapshot directory, active om.db, compaction backup sst file). Now if the exclude list or files present in the tarball is really long order of 1000s and the total number of sst files are in the order of millions, the bootstrap is going to read timeout and might take hours. We need to optimize and not perform this unnecessary iteration to avoid this n^2 operation and do it in O

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-12064

How was this patch tested?

Existing unit tests modification

…le links

Change-Id: I6871db471adc1790ac3a0ff295a4db6eeb7608ad
Change-Id: If6f300d6068c4be2c8da99fdef3ae8495680d5ea
@swamirishi swamirishi marked this pull request as ready for review January 10, 2025 22:52
Copy link
Contributor

@adoroszlai adoroszlai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @swamirishi for the patch.

Change-Id: Ic2b623cdb5ea6cbdcfad2b82ebb11bad62caa6d2
Change-Id: I03befbcab5d08add580c44cc7ee52dbfaeb101ba
@swamirishi
Copy link
Contributor Author

@adoroszlai @prashantpogde if you don't have any issues with the patch. Can you give me a +1 for this patch?

Copy link
Contributor

@prashantpogde prashantpogde left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes look good to me. Thank you for making these changes @swamirishi

@swamirishi
Copy link
Contributor Author

Thanks for the review on the patch @prashantpogde @adoroszlai

@swamirishi swamirishi merged commit 6c41a9a into apache:master Jan 15, 2025
42 checks passed
@adoroszlai
Copy link
Contributor

@swamirishi please trim extended details when merging:

    HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links (#7676)
    
    * HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links
    
    Change-Id: I6871db471adc1790ac3a0ff295a4db6eeb7608ad
    
    * HDDS-12064. Fix findbugs
    
    Change-Id: If6f300d6068c4be2c8da99fdef3ae8495680d5ea
    
    * HDDS-12064. Address review comments
    
    Change-Id: Ic2b623cdb5ea6cbdcfad2b82ebb11bad62caa6d2
    
    * HDDS-12064. Address review comments
    
    Change-Id: I03befbcab5d08add580c44cc7ee52dbfaeb101ba

@nandakumar131
Copy link
Contributor

@swamirishi please trim extended details when merging:

    HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links (#7676)
    
    * HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links
    
    Change-Id: I6871db471adc1790ac3a0ff295a4db6eeb7608ad
    
    * HDDS-12064. Fix findbugs
    
    Change-Id: If6f300d6068c4be2c8da99fdef3ae8495680d5ea
    
    * HDDS-12064. Address review comments
    
    Change-Id: Ic2b623cdb5ea6cbdcfad2b82ebb11bad62caa6d2
    
    * HDDS-12064. Address review comments
    
    Change-Id: I03befbcab5d08add580c44cc7ee52dbfaeb101ba

+1

ptlrs pushed a commit to ptlrs/ozone that referenced this pull request Mar 8, 2025
…pache#7627) (apache#190)

* CDPD-77427. HDDS-11997. Duplicate snapshot purge request causes NPE (apache#7627)

(cherry picked from commit ad108c8)
Change-Id: I02dae0a1994bafa6dc95bd2a9a38ecb82ac45a07

* CDPD-76953. HDDS-11893. Fix full snapshot diff fallback logic because of DAG pruning (apache#7549)

(cherry picked from commit 853d657)
Change-Id: Idbdcb724c2b2da23614c2be6a4e2a88f7a4f5a71
(cherry picked from commit 4cff8f827a9cfc810f48c1be767963dad428f1c5)

* CDPD-77091. HDDS-11908. Snapshot diff DAG traversal should not skip node based on prefix presence (apache#7567)

(cherry picked from commit 66ccc25)
Change-Id: Id1960cfadc9a983c89b379ccb87a4fa6d2586203
(cherry picked from commit b10b7b0d0a1b4c7fc3d0bbeda457a8ff0a0e2313)

* CDPD-77085. HDDS-11914. Snapshot diff should not filter SST Files based by reading SST file reader (apache#7563)

(cherry picked from commit 7a46080)
Change-Id: I7788f8b962985c12ba2cec9b8cab537a392a5b65
(cherry picked from commit b8601fe948a993632f8028a8947ee53dde6e4a5e)

* CDPD-67962. HDDS-10562. Fix infinite loop in ozone fs -ls /volume (apache#6416)

(cherry picked from commit 4ca8edb)
Change-Id: I015f524950ebf15491bcd8136caa11deea60027a
(cherry picked from commit 206353c44479761c6d416704c1b01aebd66b9922)

* CDPD-77818. HDDS-12064. Optimize bootstrap logic to reduce loop while checking file links (apache#7676)

Change-Id: Ibee016b338abdbb0bee6e62af89cc2560fb8f350

---------

Co-authored-by: Swaminathan Balachandran <[email protected]>
Co-authored-by: Swaminathan Balachandran <[email protected]>
Co-authored-by: Symious <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants