Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Add options to control the volume of requests fetched by the source connector #47

Open
yeikel opened this issue Nov 6, 2024 · 3 comments

Comments

@yeikel
Copy link

yeikel commented Nov 6, 2024

As far as I know, the only settings that may be able to control the volume are

  • couchbase.batch.size.max
  • couchbase.persistence.polling.interval
  • couchbase.flow.control.buffer (somewhat)
  • tasks

But as far as I can tell, there is no easy way to say "Only insert up to N updates per N time frequency" to the topic

Would it be possible to implement this?

For context, I am only interested in the latest version of the document and sometimes the connector can publish at a faster pace than what my kafka topic can hold. I'd like to control the backlog from bot the producer and the consumer ends

@dnault
Copy link
Contributor

dnault commented Nov 6, 2024

You're right, there's no rate limiting built into the Couchbase connector.

You're also not the only Kafka user with this problem:
https://cwiki.apache.org/confluence/display/KAFKA/KIP-731%3A+Record+Rate+Limiting+for+Kafka+Connect

That KIP was last updated in 2021, so I'm not sure if/when it will be implemented.

If you'd like to see something built into the Couchbase connector before then, you can help with the prioritization effort by taking advantage of your Couchbase Enterprise Subscription License and opening a support ticket.

We'd also welcome a PR that adds rate-limiting.

@dnault
Copy link
Contributor

dnault commented Nov 6, 2024

In the meantime, since you're only interested in the latest version of a document, maybe you can get the results you want by tuning Kafka's log compaction settings to use the "compact" policy.

@yeikel
Copy link
Author

yeikel commented Nov 6, 2024

In the meantime, since you're only interested in the latest version of a document, maybe you can get the results you want by tuning Kafka's log compaction settings to use the "compact" policy.

Thanks for the suggestion. We are using that already and it helps to an extent

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants