-
Notifications
You must be signed in to change notification settings - Fork 2.5k
(WIP) Updated all calls to get file system through FSUtils.getFs() #191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
1bcce25 to
ef5a407
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is just a test, and not a CLI right. so why the parameters?
vinothchandar
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@gekath Have you been able to test this out in production? This is rather big change, so would like to understand how much testing has been done..
63b301d to
d6af8c9
Compare
|
@vinothchandar this PR focused on testing that a read from local, and write to HDFS will result in correct count of records, and this functionality should be similar between any two file systems. Our specific goal is to consider use case of reading from HDFS, writing to GCS. Will look at adding additional testing, (changed to WIP). Key updates are in:
|
|
Sounds good. Will take a closer look sometime this week. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
logger or please remote s.o.pln
vinothchandar
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look safe to me. Need to look at the stuff at HoodieWrapperFileSystem more closely..
Please ping back once you have tested this more for your use-cases.. I ll pull and test on HDFS at Uber as well and we can proceed from there.
|
@gekath any update on this? |
|
Hi @vinothchandar , we're working on integrating Hoodie into our current pipeline and then will proceed to test out performance. |
|
Sounds good. please keep us posted on progress/blockers. |
|
@gekath - Do you have any updates on this PR? Thanks. |
|
@prazanna not much progress on this front yet. |
|
from what @gekath and I discusssed offline.. Seems this is being actively tested :) |
82e2216 to
3949388
Compare
Merge conflicts Merge conflicts Removed print statement Merge conflicts Merge conflicts Merge conflicts Merge conflict Removed default getFs that takes no arguments.
3949388 to
481c4a5
Compare
|
@alunarbeach do you want to drive this? |
|
Closing in favor of #293 |
Co-authored-by: StreamingFlames <[email protected]> Co-authored-by: Nicholas Jiang <[email protected]> Co-authored-by: Y Ethan Guo <[email protected]> Co-authored-by: Sagar Sumit <[email protected]> Co-authored-by: Alexey Kudinkin <[email protected]> Co-authored-by: RexAn <[email protected]>
Updates all consumers of FSUtils.getFs() to accept a path, and infers scheme based on given path and/or configuration, instead of inferring scheme through
fs.defaultFS. Calls to get a filesystem go through FSUtils.getFs(), aimed to support multiple filesystems./fix #96
/cc @vinothchandar @prazanna @zqureshi