-
Notifications
You must be signed in to change notification settings - Fork 3k
Docs: Clarify ObjectStoreLocationProvider #2963
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for rewording this and provide more clarification! There is an ongoing PR #2845 to finalize the exact way that |
|
@cobookman the PR referenced is merged, could you update the documentation with the correct path resolution strategy? Thank you! |
szehon-ho
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice, thanks for adding some light on this useful but under-documented feature , (struggled a little myself :))
Happy to, just want to understand what the expected write behaviour is for folder-storage & s3. Having the following fail on my end. Omitting the |
|
@cobookman The new resolution strategy is the following:
I think you are testing using an older version. I have added some more tests in Spark SQL, in case you want some examples: https://github.com/apache/iceberg/pull/2966/files |
|
/faceplam yep that was it, wasn't using the lastest on github. Got it to work, and will update the docs. CREATE TABLE my_catalog.my-db.my_table (
id bigint,
data string,
category string)
USING iceberg
OPTIONS (
'write.object-storage.enabled'=true,
'write.folder-storage.path'='s3://my-bucket/some-random-folder/')
PARTITIONED BY (category);Writes to: |
Clarified the description on ObjectStoreLocationProvider on that it generates a deterministic hash based on the filename, and that the hash is placed after `write.object-storage.path`. Added an example s3 path for ObjectStorageProvider Docs aws.md - Updated with suggestions Added - path resolution information - link to YT video on how S3 scales - explained that 2d3905f8 is a hash in the s3 path - changed text to "table properties"
jackye1995
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
thank you, lots of good information added!
Clarified the description on ObjectStoreLocationProvider on that it generates a deterministic hash based on the filename, and that the hash is placed after
write.object-storage.path.