HDDS-2359. Seeking randomly in a key with more than 2 blocks of data leads to inconsistent reads #82

bshashikant · 2019-10-24T14:42:37Z

What changes were proposed in this pull request?

The issue was primarily caused when first seek to an offset , then read followed by seek to a different offset and read data again both containing overlapping set of chunks . Once a seek to a position is done, the chunkPosition inside each blockInputStream is not correctly set to 0 thereby, the 1st which to which the seek offset belongs is correctly read but for the next subsequent chunks , data to be read will be returned as zero as a result of which , all the read for the subsequent chunks will return length to be read as 0. The solution here is to reset all the subsequent chunks for all subsequent blocks after a seek to set to 0 so once that it will start read from the beginning of each chunk.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-2359

How was this patch tested?

The patch was tested with addition of unit tests which reliably reproduce the issue. This was also deployed in real cluster where the issue was first discovered and verified.

Thanks @fapifta for discovering the issue and help verifying the fix as well. Thanks @bharatviswa504 and @hanishakoneru for the contribution in the fix provided.

…leads to inconsistent reads.

hanishakoneru · 2019-10-24T16:48:39Z

Thank you @bshashikant for working on this.
LGTM. +1 pending CI.

lokeshj1703 · 2019-10-26T09:04:56Z

The changes look good to me. Can you please verify the test failures? There is a failure in TestKeyInputStream.

mukul1987 · 2019-10-30T10:25:18Z

/retest

bshashikant · 2019-10-31T06:38:53Z

Thanks @lokeshj1703 for having a look. The test failure in KeyInputStream is happening while write because write chunk request counter is mismatches with the expected result as a result of retry of the request. I tried the test locally and it all seem to pass locally. The failure is not related to the patch itself.

bharatviswa504 · 2019-11-01T04:42:55Z

Thanks @lokeshj1703 for having a look. The test failure in KeyInputStream is happening while write because write chunk request counter is mismatches with the expected result as a result of retry of the request. I tried the test locally and it all seem to pass locally. The failure is not related to the patch itself.

To avoid this flakiness, can we change the check to >=writechunkcount+3, instead of equals to consider retry. (Or if we can have some way to know if retry happened then only we can check for >3 check.)

bshashikant · 2019-11-04T14:14:39Z

Thanks @bharatviswa504 for the review. There are multiple client test failures which are flaky in nature which are failing intermittently because of random retries in the test execution. Can we address this as a part of separate jira altogether?

bharatviswa504 · 2019-11-06T05:17:40Z

Thanks @bharatviswa504 for the review. There are multiple client test failures which are flaky in nature which are failing intermittently because of random retries in the test execution. Can we address this as a part of separate jira altogether?

Sure. We can open a new Jira to address this.

bharatviswa504 · 2019-11-06T05:26:36Z

Thank you @bshashikant for the contribution and all for the reviews.

…ll be in the DELETING state incorrectly (apache#6967) (apache#82)

HDDS-2359. Seeking randomly in a key with more than 2 blocks of data …

5c2b4f5

…leads to inconsistent reads.

bshashikant requested review from hanishakoneru and mukul1987 October 24, 2019 14:42

bshashikant self-assigned this Oct 24, 2019

mukul1987 requested a review from lokeshj1703 October 24, 2019 16:15

Addressed checkstyle issue.

9d41689

bharatviswa504 merged commit 9565cc5 into apache:master Nov 6, 2019

ptlrs pushed a commit to ptlrs/ozone that referenced this pull request Mar 8, 2025

CDPD-72020. HDDS-11136. Some containers affected by HDDS-8129 may sti…

3c95701

…ll be in the DELETING state incorrectly (apache#6967) (apache#82)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-2359. Seeking randomly in a key with more than 2 blocks of data leads to inconsistent reads #82

HDDS-2359. Seeking randomly in a key with more than 2 blocks of data leads to inconsistent reads #82

Uh oh!

bshashikant commented Oct 24, 2019

Uh oh!

hanishakoneru commented Oct 24, 2019

Uh oh!

lokeshj1703 commented Oct 26, 2019

Uh oh!

mukul1987 commented Oct 30, 2019

Uh oh!

bshashikant commented Oct 31, 2019

Uh oh!

bharatviswa504 commented Nov 1, 2019 •

edited

Loading

Uh oh!

bshashikant commented Nov 4, 2019

Uh oh!

bharatviswa504 commented Nov 6, 2019 •

edited

Loading

Uh oh!

bharatviswa504 commented Nov 6, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

HDDS-2359. Seeking randomly in a key with more than 2 blocks of data leads to inconsistent reads #82

HDDS-2359. Seeking randomly in a key with more than 2 blocks of data leads to inconsistent reads #82

Uh oh!

Conversation

bshashikant commented Oct 24, 2019

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

hanishakoneru commented Oct 24, 2019

Uh oh!

lokeshj1703 commented Oct 26, 2019

Uh oh!

mukul1987 commented Oct 30, 2019

Uh oh!

bshashikant commented Oct 31, 2019

Uh oh!

bharatviswa504 commented Nov 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bshashikant commented Nov 4, 2019

Uh oh!

bharatviswa504 commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bharatviswa504 commented Nov 6, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bharatviswa504 commented Nov 1, 2019 •

edited

Loading

bharatviswa504 commented Nov 6, 2019 •

edited

Loading