Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallelize file commit in HdfsWriter close #3

Open
Ishiihara opened this issue Jun 9, 2015 · 0 comments
Open

Parallelize file commit in HdfsWriter close #3

Ishiihara opened this issue Jun 9, 2015 · 0 comments

Comments

@Ishiihara
Copy link
Contributor

When closing the HdfsWriter, it loops through the topic partitions and commit the data file associated with each topic partition. This process is blocking and can take a long time if the number of topic partitions is large. Parallelize this process can reduce latency. One way to parallelize file commit is to use a thread pool.

Usiel added a commit to Usiel/kafka-connect-hdfs that referenced this issue Mar 5, 2024
…t-69-ensure-rotation

Ensure file rotation
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants