-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HADOOP-15999. S3Guard: Better support for out-of-band operations #624
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
🎊 +1 overall
This message was automatically generated. |
|
This test is working for me standalone, but when run in a parallel set, its failing Assumption: some FS sharing/caching is still going on |
|
and I've managed to do it in a single test run. There's some intermittent issue going on here. It could just be timestamp race conditions. |
|
Given I've seen the failure in a standalone run, it's not FS caching somehow meaning raw has picked up a DDB table. Though I've added a way to query S3A FS instance for its auth status to make sure we know what's going on in inside Possible causes then
See PR #630 for the patch addressing these issues @bgaborg to get the changes I've done, look at the "command line instructions" at the bottom of the list, add my repo, do a git fetch and then cherry pick the commit 5aba1e4738b050fde1b2c45c21ed59320869be53 into this PR |
* Add delays long enough for timestamps to be different * Add delays for S3 to stabilize after writes/deletes, so that listings and HEAD calls will get the new value, not old ones * probes for differences look for file lengths ahead of timestamps, for more tangible failures. * and they validate the raw FS status acquired after the stabiliziation delay * package private (currently) probe for S3A to verify that an FS instances considers its store to be authoritative. Currently we've been checking the config, but to really know what's happening: lets query the internal state of FS. Change-Id: Ib0184a2aacbec1e4b316cb8cad0265bd0b579bcd (cherry picked from commit 5aba1e4738b050fde1b2c45c21ed59320869be53)
|
I'll do another check on your modifications, run some test with local and ddb, and get back with the results. |
|
I never got any issues with running ITestS3GuardOutOfBandOperations (just with local and because of the reference on the cache), but running it with your commit gave me the following error:
Maybe something still inconsistent? |
|
The second run succeeded, no errors. |
|
🎊 +1 overall
This message was automatically generated. |
|
I think there's still a race condition. Can you use LambdaTestUtils.eventually() to give a longer interval for things to stabilise? that will exit fast on success, but retry until there's stability |
|
OK, updated the test to use If you can run it and the tests work, all is well. If they fail, then we may have more insight as to what is wrong |
| final long s3ModTime = s3AFileStatus.getModificationTime(); | ||
|
|
||
| if(s3ModTime > msModTime) { | ||
| LOG.debug("S3Guard metadata for {} is outdated, updating it", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
might include the 2 mod times in the debug log
|
Looks like this should be closed since #630 was merged? |
|
It's already committed so closing this. |
Remove SimpleConsumer and BrokerProxy from Samza's KafkaSystemConsumer implementation. Instead use KafkaConsumerProxy with high-level kafka consumer. Author: Boris S <[email protected]> Author: Boris Shkolnik <[email protected]> Reviewers: Shanthoosh Venktataraman <[email protected]>, Prateek Maheshwari <[email protected]> Closes apache#624 from sborya/NewConsumer2
No description provided.