[Storage] retry on incomplete XML responses by jeremymeng · Pull Request #13076 · Azure/azure-sdk-for-js

jeremymeng · 2021-01-05T23:13:44Z

When service times out (default max 30s) it terminates the connection
but current stable version of node_fetch doesn't report
error. Instead it returns the incomplete response which leads to XML
parse error. It's unlikely that service would send back incomplete
response on purpose so it doesn't hurt to treat this error as a
TIMEOUT error and retry the request.

The deserialization policy factory needs to move below retry policy
factory so parse error from deserialization can be retried.

After changing the order of deserialization policy and retry policy,
error.code is now populated properly by deserialization policy. This
surfaces an issue where an error with code ResourceNotFound will
also be retried because it contains eNotFound and we use
error.code.toString().toUpperCase().includes() to see if the error
is in the list. It passed the check for the network error code
ENOUTFOUND. This change fixes it by using exact match when checking
error code.

When service times out (default max 30s) it terminates the connection but current stable version of `node_fetch` doesn't report error. Instead it returns the incomplete response which leads to XML parse error. It's unlikely that service would send back incomplete response on purpose so it doesn't hurt to treat this error as a `TIMEOUT` error and retry the request. The deserialization policy factory needs to move below retry policy factory so parse error from deserialization can be retried.

jeremymeng · 2021-01-05T23:14:24Z

/azp run js - storage-blob - tests

azure-pipelines · 2021-01-05T23:14:34Z

Azure Pipelines successfully started running 1 pipeline(s).

jeremymeng · 2021-01-06T00:31:40Z

/azp run js - storage-blob - tests

azure-pipelines · 2021-01-06T00:31:51Z

Azure Pipelines successfully started running 1 pipeline(s).

sdk/storage/storage-blob/test/retrypolicy.spec.ts

sdk/storage/storage-blob/src/policies/StorageRetryPolicy.ts

sdk/storage/storage-blob/src/Pipeline.ts

HarshaNalluru

Looks good!

jeremymeng · 2021-01-06T22:51:21Z

@ljian3377 please have a look. If looking good I will change queue/fileshare/etc.

ljian3377 · 2021-01-07T03:27:48Z

The ultimate fix for partial response should be upgrading to node-fetch 3.x? Please leave the original issue open till we do that.

The change looks good but I not sure if it's useful. Have you tested this in a real environment? Does this fix mitigate the issue?

jeremymeng · 2021-01-07T19:14:03Z

The ultimate fix for partial response should be upgrading to node-fetch 3.x?

Possibly although I've not tested it. In v3.x we might get an error from node-fetch which we would retry, instead of incomplete response. v3.x is in beta now. I am not sure about its timeline.

The change looks good but I not sure if it's useful. Have you tested this in a real environment? Does this fix mitigate the issue?

Yes it helps on running the repro code. There's still possibility of getting the same error in each retry if unlucky, but it's better than before.

After changing the order of deserialization policy and retry policy, `error.code` is now populated properly by deserialization policy. This surfaces an issue where an error with code `ResourceNotFound` will also be retried because it contains `eNotFound` and we use `error.code.toString().toUpperCase().includes()` to see if the error is in the list. It passed the check for the network error code `ENOUTFOUND`. This change fixes it by using exact match when checking error code.

jeremymeng · 2021-01-07T23:50:03Z

/azp run js - storage-blob - tests

jeremymeng · 2021-01-07T23:50:12Z

/azp run js - storage-file-share - tests

azure-pipelines · 2021-01-07T23:50:14Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2021-01-07T23:50:23Z

Azure Pipelines successfully started running 1 pipeline(s).

ljian3377 · 2021-01-08T04:59:43Z

sdk/storage/storage-blob/src/policies/StorageRetryPolicy.ts

-              .toString()
-              .toUpperCase()
-              .includes(retriableError))
+          (err.code && err.code.toString().toUpperCase() === retriableError)


I think this is right but let's also check with @XiaoningLiu

ljian3377 · 2021-01-08T08:30:37Z

Logged #13119

xirzec

Nice! This ended up being a pretty elegant solution.

…omplete-xml

jeremymeng requested review from HarshaNalluru, XiaoningLiu, jiacfan, ljian3377 and vinjiang as code owners January 5, 2021 23:13

ghost added the Storage Storage Service (Queues, Blobs, Files) label Jan 5, 2021

jeremymeng added the Client This issue points to a problem in the data-plane of the library. label Jan 5, 2021

Add browser recording for new test

14eea39

jeremymeng mentioned this pull request Jan 6, 2021

"Unclosed root tag" error when listing blobs or blob containers #12672

Closed

6 tasks

HarshaNalluru reviewed Jan 6, 2021

View reviewed changes

sdk/storage/storage-blob/test/retrypolicy.spec.ts Show resolved Hide resolved

Verify that error has been injected

fe782f8

HarshaNalluru reviewed Jan 6, 2021

View reviewed changes

sdk/storage/storage-blob/src/policies/StorageRetryPolicy.ts Outdated Show resolved Hide resolved

HarshaNalluru reviewed Jan 6, 2021

View reviewed changes

sdk/storage/storage-blob/src/Pipeline.ts Outdated Show resolved Hide resolved

HarshaNalluru approved these changes Jan 6, 2021

View reviewed changes

Tweak comments

f9d435e

jeremymeng added 2 commits January 7, 2021 13:56

Apply same fix to file-share/file-datalake/queue

5003243

ljian3377 reviewed Jan 8, 2021

View reviewed changes

ljian3377 approved these changes Jan 8, 2021

View reviewed changes

XiaoningLiu approved these changes Jan 8, 2021

View reviewed changes

xirzec approved these changes Jan 25, 2021

View reviewed changes

jeremymeng added 3 commits January 26, 2021 17:26

Merge remote-tracking branch 'upstream/master' into storage-retry-inc…

8aa55ff

…omplete-xml

Merge remote-tracking branch 'upstream/master' into storage-retry-inc…

a837e83

…omplete-xml

Add CHANGELOG entries

9161ef0

jeremymeng merged commit ee90255 into Azure:master Jan 26, 2021

jeremymeng deleted the storage-retry-incomplete-xml branch January 26, 2021 21:58

Copilot AI mentioned this pull request Dec 10, 2025

Sync eng/common directory with azure-sdk-tools for PR 13076 #36849

Merged

Conversation

jeremymeng commented Jan 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremymeng commented Jan 5, 2021

Uh oh!

azure-pipelines bot commented Jan 5, 2021

Uh oh!

jeremymeng commented Jan 6, 2021

Uh oh!

azure-pipelines bot commented Jan 6, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

HarshaNalluru left a comment

Choose a reason for hiding this comment

Uh oh!

jeremymeng commented Jan 6, 2021

Uh oh!

ljian3377 commented Jan 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jeremymeng commented Jan 7, 2021

Uh oh!

jeremymeng commented Jan 7, 2021

Uh oh!

jeremymeng commented Jan 7, 2021

Uh oh!

azure-pipelines bot commented Jan 7, 2021

Uh oh!

azure-pipelines bot commented Jan 7, 2021

Uh oh!

ljian3377 Jan 8, 2021

Choose a reason for hiding this comment

Uh oh!

ljian3377 commented Jan 8, 2021

Uh oh!

xirzec left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jeremymeng commented Jan 5, 2021 •

edited

Loading

ljian3377 commented Jan 7, 2021 •

edited

Loading