You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We've been seeing some very occasional file corruption when downloading files from S3 using the SDK. We first observed this on 2nd March, and have had 6 occurrences since (out of, say, 20,000 successful downloads).
The affected files have a repeated chunk in them - say the original file is 5MB, the resulting file after download might be 5.5MB, with the first 0.5MB being repeated twice.
After downloading, the file at local_path should match the S3 object. However, we're still seeing this silently corrupt the download despite the presence of checksum_mode.
Expected Behavior
Downloaded files exactly match the file on S3. Or, at least, a checksum error is raised if they don't match.
Current Behavior
Whenever we've seen this fail, the file takes longer than usual to download, and the SDK logger reports that a retry was used:
Comments on closed issues are hard for our team to see.
If you need more assistance, please either tag a team member or open a new issue that references this one.
If you wish to keep having a conversation with other community members under this issue feel free to do so.
Describe the bug
We've been seeing some very occasional file corruption when downloading files from S3 using the SDK. We first observed this on 2nd March, and have had 6 occurrences since (out of, say, 20,000 successful downloads).
The affected files have a repeated chunk in them - say the original file is 5MB, the resulting file after download might be 5.5MB, with the first 0.5MB being repeated twice.
After downloading, the file at
local_path
should match the S3 object. However, we're still seeing this silently corrupt the download despite the presence of checksum_mode.Expected Behavior
Downloaded files exactly match the file on S3. Or, at least, a checksum error is raised if they don't match.
Current Behavior
Whenever we've seen this fail, the file takes longer than usual to download, and the SDK logger reports that a retry was used:
Here's the S3 logs from that:
In this example, the S3 file is 35,600,463 bytes long, but the corrupted local file is 36,036,229 with a duplicated 435,766 byte section at the start.
Reproduction Steps
I'm open to better ideas to reproduce this but have tried for a while without success. Sometimes weeks go by without us seeing the issue.
Possible Solution
No response
Additional Information/Context
This is also opened as AWS support ticket 9950331191 with unredacted info, if you have any access to that
Gem name ('aws-sdk', 'aws-sdk-resources' or service gems like 'aws-sdk-s3') and its version
aws-sdk-s3 3.130.0
Environment details (Version of Ruby, OS environment)
Ruby 3.0.3, Ubuntu 18
The text was updated successfully, but these errors were encountered: