HDDS-10587. Reset the thread-local MessageDigest instance during exception #6435

ivandika3 · 2024-03-25T10:42:24Z

What changes were proposed in this pull request?

Currently, the MessageDigest instance is a thread local variable (one per S3G Jetty thread). MessageDigest requires the call to either MessageDigest#digest or MessageDigest#reset to reset the digest.

In normal ObjectEndpoint#put flow, MessageDigest#digest is called after the data has been written to the datanodes, before the key is committed. However, if an IOException happens (e.g. EOFException due to client cancelling during the write), the digest will not be reset and remains in the inconsistent state. This will affect the subsequent request that uses the same thread and therefore the ETag generated will be completely different from the md5 hash of the object causing AWS S3 SDK to detect inconsistent hash when downloading the object.

The issue can be replicated using an S3G with a few threads and doing three put-object operations for the same key and same payload. You can set the hadoop.http.max.threads in ozone-site.xml to a small value (e.g. 4) to increase the chance of the same thread handling the request.

1st put-object: cancel the operation before it put-object operation can finish, ensure the EOFException is thrown in the S3Gateway logs
2nd put-object: let the put-object finish. The resulting ETag will not be the same as the md5 digest of the payload (you might need to do this for a few time since the S3G thread might not be the same from the previous call)
3rd put-object: also let the put-object finish. Since the previous put-object reset the digest, the resulting ETag will be correct.

This patch adds a call to MessageDigest#reset in ObjectEndpoint#put to reset the digest in case of exception. Another valid alternative is to call the MessageDigest#reset just after the DigestInputStream initialization.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-10587

How was this patch tested?

Manual test from Ozone Intellij IDE setup as shown in the description.

Ref: https://cwiki.apache.org/confluence/display/OZONE/Run+Ozone+cluster+from+IDE

Clean CI run: https://github.com/ivandika3/ozone/actions/runs/8421982154

…ption

ivandika3 · 2024-03-25T20:57:59Z

@vtutrinov @myskov Could you help take a look when you have time?

vtutrinov · 2024-03-26T07:58:33Z

@ivandika3 I'd like to see a set of unit tests to check that the digest message will be resetted in case of exception. An example is:

ObjectEndpoint endpoint = spy(...);
try (MockedStatic staticMock = mockStatic(ObjectEndpointStreaming.class)) {
    staticMock.when(() -> ObjectEndpointStreaming.put(...)).thenThrow(IOException.class);
    endpoint.put(...);
    verify(endpoint.getETagProvider()).reset();
}

ivandika3 · 2024-03-26T10:26:40Z

@vtutrinov Thank you for the review and unit test idea. I have added unit tests for put, copy, and MPU part upload cases. PTAL.

vtutrinov · 2024-03-26T11:02:05Z

@ivandika3 thanks for the PR and the unit tests, LGTM, +1

adoroszlai · 2024-03-26T17:38:58Z

Thanks @ivandika3 for the patch, @vtutrinov for the review.

ivandika3 · 2024-03-27T01:13:09Z

Thanks @vtutrinov for the review and @adoroszlai for the merge.

…tion (apache#6435) (cherry picked from commit c6c611f)

ivandika3 added 5 commits March 25, 2024 18:06

HDDS-10587. Reset the thread-local MessageDigest instance during exce…

a01ba8c

…ption

Add comments

b5c3423

Use a new variable for digestInputStream

4acd034

Reset for createMultipartKey

9fb62c0

Unnecessary casting

e8b055c

ivandika3 marked this pull request as ready for review March 25, 2024 20:57

Add unit test to check digest message is reset during exception

d7d3343

ivandika3 mentioned this pull request Mar 26, 2024

HDDS-10574. Improve TestObjectPut #6426

Merged

adoroszlai approved these changes Mar 26, 2024

View reviewed changes

adoroszlai merged commit c6c611f into apache:master Mar 26, 2024

ivandika3 mentioned this pull request Apr 5, 2024

[DO NOT MERGE] Backport some fixes from master to ozone-1.4 #6479

Merged

ivandika3 added the s3 S3 Gateway label Apr 16, 2024

ivandika3 added a commit to ivandika3/ozone that referenced this pull request Apr 18, 2024

HDDS-10587. Reset ETag's thread-local MessageDigest instance on excep…

fc28224

…tion (apache#6435) (cherry picked from commit c6c611f)

ivandika3 mentioned this pull request Apr 19, 2024

[DO NOT MERGE] Backport ETag improvements and bug fixes from master to ozone-1.4 #6557

Merged

ivandika3 self-assigned this Apr 23, 2024

jojochuang pushed a commit to jojochuang/ozone that referenced this pull request May 29, 2024

HDDS-10587. Reset ETag's thread-local MessageDigest instance on excep…

4fe32d2

…tion (apache#6435) (cherry picked from commit c6c611f)

ivandika3 mentioned this pull request Sep 12, 2024

RATIS-2147. Md5 mismatch when snapshot install apache/ratis#1142

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-10587. Reset the thread-local MessageDigest instance during exception #6435

HDDS-10587. Reset the thread-local MessageDigest instance during exception #6435

Uh oh!

ivandika3 commented Mar 25, 2024 •

edited

Loading

Uh oh!

ivandika3 commented Mar 25, 2024

Uh oh!

vtutrinov commented Mar 26, 2024

Uh oh!

ivandika3 commented Mar 26, 2024

Uh oh!

vtutrinov commented Mar 26, 2024

Uh oh!

adoroszlai commented Mar 26, 2024

Uh oh!

ivandika3 commented Mar 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HDDS-10587. Reset the thread-local MessageDigest instance during exception #6435

HDDS-10587. Reset the thread-local MessageDigest instance during exception #6435

Uh oh!

Conversation

ivandika3 commented Mar 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

ivandika3 commented Mar 25, 2024

Uh oh!

vtutrinov commented Mar 26, 2024

Uh oh!

ivandika3 commented Mar 26, 2024

Uh oh!

vtutrinov commented Mar 26, 2024

Uh oh!

adoroszlai commented Mar 26, 2024

Uh oh!

ivandika3 commented Mar 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ivandika3 commented Mar 25, 2024 •

edited

Loading