-
Notifications
You must be signed in to change notification settings - Fork 3k
Add FileIO implementation for Google Cloud Storage #3711
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
gcp/src/main/java/org/apache/iceberg/gcp/gcs/GCSOutputStream.java
Outdated
Show resolved
Hide resolved
gcp/src/test/java/org/apache/iceberg/gcp/gcs/GCSFileIOTest.java
Outdated
Show resolved
Hide resolved
|
@danielcweeks could you provide some insights into the current thread in dev list about the bundled runtime? @openinx is starting a vote. I approved this PR assuming we will not include this to the runtime, but if we go with his option 2 we will need to mark the Google dependencies as compile only. |
|
@jackye1995 Thanks for pointing out the thread and I followed up there. For now, we don't really need to add this to the runtime, so I think what we have here is fine. We can always follow up with bundle changes if the thinking on that changes, but I wouldn't want to push that with this PR. |
This PR adds native FileIO support for GCS using the google storage apis directly.
The read/write paths leverage the streaming transfers and allow for full configuration of the storage provider via custom storage supplier. By default, a number of common configuration properties are provided via GCPProperties and can be used with a dynamically loaded configuration.