Event Listeners #922

andrew4699 · 2025-01-31T19:47:19Z

Implementation of event listeners discussed here.

I decided to keep this implementation generic and not take a dependency on Jakarta Events nor Vertx busses. It's easy to extend this, either within Polaris or in an external PolarisEventListener, and handle events however one wishes.

Some high level notes:

PolarisEventListener is the main interface with all the event methods such as onBeforeRequestRateLimited
DefaultPolarisEventListener is an empty implementation which allows users to only partially implement event handlers
polaris.events.type is the config that lets you specify your event listener implementation

quarkus/defaults/src/main/resources/application.properties

...java/org/apache/polaris/service/quarkus/events/QuarkusPolarisEventListenerConfiguration.java

service/common/src/main/java/org/apache/polaris/service/events/BeforeTableRefreshEvent.java

eric-maynard · 2025-03-04T17:24:29Z

service/common/src/main/java/org/apache/polaris/service/events/PolarisEventListener.java

I wonder if we really need the Polaris prefix here.

Also, unfortunately, I think we need javadocs on the methods.

Yea we could get rid of the Polaris prefix.

WDYT about @link javadocs that point to the actual event data class? I just didn't wanna duplicate the comment in 2 places. Spark seems to have fairly vague comments on the methods that don't provide much more info than the method name itself, but also Spark doesn't seem to have the detailed comments on the event data classes themselves.

Yeah that's fair, let's just add @links

The "Polaris" prefix would deserve a discuss thread imho. Many components have this prefix, but not all.

Many have it, and many don't -- agreed that we should standardize and agreed that a discussion thread is a good place to do that.

For this individual decision on this PR, I'm not sure a decision either way would require a huge discussion given the current lack of standardization.

quarkus/defaults/src/main/resources/application.properties

service/common/src/main/java/org/apache/polaris/service/events/PolarisEvent.java

adutra · 2025-03-06T18:03:17Z

service/common/src/main/java/org/apache/polaris/service/events/PolarisEventListener.java

The "Polaris" prefix would deserve a discuss thread imho. Many components have this prefix, but not all.

adutra · 2025-03-06T18:04:13Z

service/common/src/main/java/org/apache/polaris/service/events/PolarisEventListener.java

Generally speaking, what are the guarantees about "after" commits? Are they triggered if the operation failed, or just when they succeeded? But then, do we need events for failed operations as well?

I see that AfterAttemptTaskEvent has a boolean success flag, but other "after" events do not. Also, maybe the full throwable is preferable than just a boolean flag.

I agree that we should be as explicit as possible about this, although I think it might be ambitious to guarantee the exact semantics for each event out of the gate. At the very least, we should try our best to define the behavior with tests.

I think we just need to rename to make it clear. I suggested to use a name like TableCommitted to indicate this event happens only if the table committed successfully. #922 (comment)

The Javadoc should describe the semantics to a reasonable extent. It does describe the table commit event as not being emitted during failure, and that each attempt of a task regardless of success is emitted.

Docs aside, there's a question of which semantics we choose to actually implement, and since this is meant to be a broad interface there will be differing opinions on which make the most sense. Rather than trying to pick the perfect one for everyone, folks can always add new events and/or fields to existing events that represent different semantics.

adutra · 2025-03-06T18:09:18Z

service/common/src/main/java/org/apache/polaris/service/catalog/BasePolarisCatalog.java

This approach of manually triggering events is fine as a starting point, but I think a more elegant approach would be to create a decorator catalog that would decorate calls to each method with before/after events.

Yea that would be interesting. It might not be ideal to couple events to methods though, as that could force you to unnecessarily extract things into a separate method.

flyrain

I think adding a new event in Polaris shouldn't break existing listeners. We will need to consider a different design.

...service/src/test/java/org/apache/polaris/service/quarkus/catalog/BasePolarisCatalogTest.java

service/common/src/main/java/org/apache/polaris/service/events/AfterTableCommitEvent.java

flyrain · 2025-03-07T01:15:08Z

service/common/src/main/java/org/apache/polaris/service/events/PolarisEventListener.java

Adding a new event shouldn't require recompilation or redeployment of listeners. How do we archive that with the current design? One of solutions is to remove the specific type in the listener interface, only keep the a generic event type, like the following:

public void handleEvent(Event event) { switch(event.getEventType()) { case "BeforeTableCommit": handleBeforeTableCommit(event.getPayload()); break; case "TableCommitted": handleTableCommitted(event.getPayload()); break; default: // Ignore or log unknown event } }

I disagree -- if I was maintaining a listener implementation and a new event type was added I would prefer that things fail at build time rather than have my listener silently ignore the new event

I think it's a common use case that one implementation only cares about a subset of Polaris events, instead of all of them. For example, user A may only care about the table commit event, so it's fine to ignore other events for user A's listener, including new added events.

To Andrew's point, if you want that behavior then you just have your implementation extend DefaultPolarisEventListener.

My bad. I thought we need to recompile to allow the subclass to get the new methods from the parent class. Java did a good job to keep the binary compatibility, so that the subclass doesn't have to recompile, the caller can still resolve the methods from its parent, https://docs.oracle.com/javase/specs/jls/se8/html/jls-13.html. My mindset for this part is still with c++. Sorry for the confusion, @eric-maynard and @andrew4699.

I think there's use cases for both wanting to know about new events (ie auditing) and wanting to ignore them (if you only care about a very specific event). Given that, I'm not sure we should sell a single extension point but rather give the option.

I don't think I understand quite enough about how people use dependencies in Java nor the plugin pattern to fully understand the implications of that though. If something is to break, it would ideally happen at compile time and not at runtime.

Also I agree that DefaultPolarisEventListener isn't a great name for an extension point.

The current implementation gives you both options

Does it? If the interface PolarisEventListener is public, it's part of the API. If it's part of the API, you cannot add regular methods to it, even if you implement them in DefaultPolarisEventListener.

@andrew4699 , two use cases are both valid. However, it shouldn't notify users in a way that breaks existing listeners. If a listener need to include all new events, it has to recompile and redeployment anyways. At the same time, Polaris couldn't just break other listeners who may only care a subset of events.

For example, Polaris will introduce events that covers a wide range of activities like the followings:

Table/namespace create/update events

Catalog create/update events

Principal/role create/update events.

It's common that some listeners only care about table/namespace events, as they are essential for catalog federation, while other listeners only care about principal/role events for identity federation. Adding a new table related event shouldn't break the identity federation listeners.

I'm happy to disagree & commit to one option here and do this iteratively, but let me make 1 more case for the current approach:

Spark does almost exactly what this does. SparkListenerInterface is public and they just recommend against using it. The only difference in Spark is that SparkListener (our DefaultPolarisEventListener) is abstract. However, I'm not sure if we have an elegant way to specify "no event listener" with our config singleton pattern, so a non-abstract default listener may actually be needed.

If the interface PolarisEventListener is public, it's part of the API. If it's part of the API, you cannot add regular methods to it, even if you implement them in DefaultPolarisEventListener.

I think as a general rule it's good to apply this principle, but users are getting an explicit warning here. We could make the warning even louder.

SparkListenerInterface is actually private[spark] in Scala, but Scala's private is pretty fake anyway. Java or Scala callers can easily access it.

Does it? If the interface PolarisEventListener is public, it's part of the API. If it's part of the API, you cannot add regular methods to it, even if you implement them in DefaultPolarisEventListener.

The current implementation gives you both the options to:

Implement a listener that extends PolarisEventListener and must implement exactly the methods that appear there or fail at build time (or runtime to some ClassNotFoundException if you don't build against the dependency you actually run against).

Implement a listener that extends DefaultPolarisEventListener and can optionally override various methods of that class. If new methods appear, your already-built artifact may continue working against a different version of polaris than the one you built against.

In both cases, you also have the option to go into the code, add new events, and handle those events in your listener(s).

It seems like Yufei thinks that people running against a different version than the one they built against will be a common pattern we should support whereas I considered this something to be avoided and discouraged through our design here.

In the end I think @flyrain's suggestion that we "hide" the interface like Spark does is fine; expert users who want it can break glass via fork or something and use it anyway. If the DefaultPolarisEventListener being the more accessible extension point turns out to be problematic in the future (e.g. users are surprised by new events being ignored) then we can always expose the interface and update our guidance. In general, I expect anyone deploying a custom listener to be a power-user.

flyrain · 2025-03-07T01:25:26Z

service/common/src/main/java/org/apache/polaris/service/events/AfterTableRefreshEvent.java

Curious the use case of this event. I'd assume it's not for the performance instrument.

Same naming suggestion here, the name would be TableRefreshed

Broadly speaking I think the service doesn't need to be too opinionated on how events might be consumed; the service just emits them. You could imagine something as simple as wanting to keep a counter of how many tables were refreshed in the last 24h.

service/common/src/main/java/org/apache/polaris/service/events/AfterViewCommitEvent.java

andrew4699 · 2025-03-07T02:06:55Z

I think adding a new event in Polaris shouldn't break existing listeners. We will need to consider a different design.

That's what DefaultPolarisEventListener is for. Users who are concerned with that issue should extend DefaultPolarisEventListener instead of PolarisEventListener.

flyrain

LGTM. Thanks @andrew4699 !

snazy

It seems that the event types are missing some important attributes and some event types are exposing implementation specifics.

I'll comment on the dev-ML as well, but overall I don't understand how this is intended to be used ("what are the concrete use cases for this?").

snazy · 2025-03-18T09:47:08Z

service/common/src/main/java/org/apache/polaris/service/events/TestPolarisEventListener.java

This is test-only code and shouldn't be reachable in production code at all.

I agree in principle, but I'm following the same pattern that already exists:

https://github.com/apache/polaris/blob/main/quarkus/service/src/main/java/org/apache/polaris/service/quarkus/config/ProductionReadinessChecks.java#L166

https://github.com/apache/polaris/blob/main/service/common/src/main/java/org/apache/polaris/service/context/TestRealmContextResolver.java

Some beans in test folders don't get resolved.

Sorry, but I don't think that the argument that there's already a bad pattern is good enough to justify this.

I'm confused why you approved this earlier then #594

It's not that I agree with the pattern (I don't and have expressed this), it's that maintaining consistency is less confusing to all developers, and this PR doesn't make it materially harder to change the pattern in the future. A separate discussion should happen about changing this but that's out of scope of this PR.

service/common/src/main/java/org/apache/polaris/service/catalog/IcebergCatalogAdapter.java

...ce/common/src/main/java/org/apache/polaris/service/catalog/PolarisCatalogHandlerWrapper.java

service/common/src/main/java/org/apache/polaris/service/events/DefaultPolarisEventListener.java

snazy · 2025-03-18T09:56:46Z

service/common/src/main/java/org/apache/polaris/service/events/PolarisEventListener.java

What's the plan/approach for this type when new events are added?
For example: generic tables - are those handled via on*Table* or do those get separate events?

Style: this is rather an interface.

I don't think there's anything special about event listeners for the question of "if we implement X, how will that affect Y?" The same principle of maintaining reasonable behavior for existing users should apply.

See #922 (comment) for why it's an ABC instead of an interface.

snazy · 2025-03-18T09:58:25Z

service/common/src/main/java/org/apache/polaris/service/events/AfterTaskAttemptedEvent.java

taskEntityId is a persistence internal property, attempt is actually wrong - see how loadTasks is implemented.
I'm also not sure this event makes sense, given that task handling is incomplete.

Can you elaborate on the use case of this one?

Can you elaborate on the attempt being wrong? I don't see what you're referring to.

Here are some threads talking about use cases:

Event Listeners #922 (comment)

Event Listeners #922 (comment)

Also the design doc, mailing list thread, and this PR talk about events specifically being intended to be super flexible because different Polaris users will have different needs. Not all events will be useful for everyone.

"Flexible" is not the same as "implementation specific".
Also, there's nothing in the code that could emit this one.
So this one should actually be removed.

If there's a specific use case, let's discuss the use case specifically and please not add stuff that's not usable.

Also, there's nothing in the code that could emit this one.

https://github.com/apache/polaris/blob/ab2560d27fb9df225f1a7ee1cdca27b20bfc8676/service/common/src/main/java/org/apache/polaris/service/task/TaskExecutorImpl.java#L165

specific use case

Logging.

Same as below - logging isn't a good use case.
Logging services already allow that - just implement the necessary logging in the code.

#922 (comment)

service/common/src/main/java/org/apache/polaris/service/events/AfterViewCommitedEvent.java

snazy · 2025-03-18T10:00:20Z

...ce/common/src/main/java/org/apache/polaris/service/events/BeforeRequestRateLimitedEvent.java

What's the use case of this one and how would an implementation behave in case of a DoS attack attempt?

See #922 (comment)

Can you elaborate on your DoS concerns? If you're using the no-op listener, nothing changes. It's up to you to respond to events in your preferred way.

I'd like to counter the question: What's the concrete use case for this one?

I don't think that "logging" is a good enough reason for an event.
Logging can happen from the limiter, right?

People have different logging requirements. Logs are a controversial change, which you explained elegantly here.

Sorry, I'm strongly -1 on this event. DoS w/ millions or billions of requests are already an issue - this event just makes such a situation worse.

Can you elaborate on your DoS concerns? If you're using the no-op listener, nothing changes. It's up to you to respond to events in your preferred way.

That's the only one that's there - "noop". The code yields exactly nothing. The events can't be consumed by anybody. I would really like to see concrete use cases for Apache Polaris users, how the architecture would look like and how things are integrated.

The missing attributes that I noticed let me think that the consuming side (aka use cases) hasn't been thought through.

The code yields exactly nothing. The events can't be consumed by anybody.

Isn't exactly the opposite true? After this PR, events can be consumed by everybody!

I've listed demonstrable use cases for each event in the design doc.

service/common/src/main/java/org/apache/polaris/service/events/AfterTableCommitedEvent.java

snazy · 2025-03-18T10:03:34Z

service/common/src/main/java/org/apache/polaris/service/catalog/BasePolarisCatalog.java

Don't failures deserve to be emitted?

Maybe. Events are supposed to be flexible and cover a variety of needs, so there isn't a clear minimum nor maximum for what should be implemented. This PR aims to introduce the concept and cover a large surface area, but by design it's easy to add more features.

What's the point of having an "onBefore*" but not always an "onAfter*" then?

The point is that there's no common list of events that cover what everyone needs, so lets implement a big chunk of them. The beauty of open source is that if someone wants more events, it's easy for them after this PR to add them :)

Sorry, strong -1 on "lets implement a big chunk just because it's possible".

but it's impossible to know all concrete use cases

Exactly that! But to me the justification "let's implement everything we can think of because it's possible" does not work. Having code just because we can have code - sorry, no.

Again, "because it's possible" is not the justification. There are use cases for each event implemented here, and there should be a demonstrable one for events added in the future. The design doc says this:

The intent is for the list of events to grow with Polaris and for adding an event to feel like a relatively lightweight change, compared to refactoring an entire component into one/many swappable interfaces. Not everything should be an event though; there should be at least 3 criteria for adding a new event:

Difficult to achieve through other means (configuration/dependency injection/etc)

A use case can be demonstrated, although it may not be obviously useful to all users of the OSS project

Cannot be folded into existing event listeners

All 3 criteria are true for events in this PR.

Again, "because it's possible" is not the justification.

You mentioned above: there's no common list of events that cover what everyone needs, so lets implement a big chunk of them

I'm asking for concrete requirements.

so lets implement a big chunk of them

Replace "them" with "events with demonstrable use cases".

I'm asking for concrete requirements.

Ok, I've added a new column to the design doc with example use cases for each event.

eric-maynard

With the updated examples in the doc, this LGTM!

eric-maynard · 2025-04-18T03:24:37Z

To quickly capture my understanding wrt. how this relates to, and differs from, the events APIs proposed in Iceberg and the observability APIs proposed for Polaris, I mocked up an example of what an architecture might look like:

The tl;dr for readers confused about how these events accomplish observability is that they don't... they allow Polaris to emit events to some external source which might later be used to power observability. They also allow you to do other handy things -- like file a Jira if more than 100 tables are created in a given namespace -- if you're willing to write a custom event listener.

andrew4699 · 2025-04-18T16:26:55Z

Yes exactly. Also solving atomicity (ie of table commit + event) is out of scope of this PR. The model for these event listeners is that they are as if you are adding new specific code to Polaris. Like Polaris, the code running in these event listeners can be interrupted at any time, due to ie a process crash. The external-facing events implementation can choose to build on top of this or not.

eric-maynard · 2025-04-24T18:06:42Z

@snazy given the examples added to the doc in March are there still specific events you have concerns about emitting?

@snazy

@snazy can you take a second look to confirm whether you still have concerns about certain events? I looked through the linked doc and the example there make sense to me. In the meantime, we have added a lot of new actions where we should consider putting an event listener hook e.g. policy creation.

andrew4699 force-pushed the aguterman-event-listeners branch 3 times, most recently from 74b27ce to e65726f Compare February 26, 2025 22:44

andrew4699 force-pushed the aguterman-event-listeners branch from e65726f to f7c8391 Compare February 26, 2025 22:55

andrew4699 changed the title ~~[PROTOTYPE] Event listeners using a custom interface (not Jakarta Events)~~ Event Listeners Feb 26, 2025

andrew4699 marked this pull request as ready for review February 26, 2025 23:08

andrew4699 requested review from MonkeyCanCode, RussellSpitzer, adutra, ashvina, collado-mike, dennishuo, dimas-b, ebyhr, eric-maynard, flyrain, jackye1995, jbonofre, snazy, takidau and vvcephei as code owners February 26, 2025 23:08

andrew4699 mentioned this pull request Feb 26, 2025

[PROTOTYPE] Event listeners using Jakarta Events #921

Closed

eric-maynard reviewed Mar 3, 2025

View reviewed changes

andrew4699 requested a review from eric-maynard March 4, 2025 17:05

eric-maynard reviewed Mar 4, 2025

View reviewed changes

eric-maynard previously approved these changes Mar 6, 2025

View reviewed changes

github-project-automation bot moved this from PRs In Progress to Ready to merge in Basic Kanban Board Mar 6, 2025

adutra reviewed Mar 6, 2025

View reviewed changes

flyrain reviewed Mar 7, 2025

View reviewed changes

andrew4699 requested review from eric-maynard and flyrain March 17, 2025 23:00

flyrain previously approved these changes Mar 18, 2025

View reviewed changes

snazy previously requested changes Mar 18, 2025

View reviewed changes

github-project-automation bot moved this from Ready to merge to PRs In Progress in Basic Kanban Board Mar 18, 2025

andrew4699 requested a review from snazy March 21, 2025 00:43

andrew4699 dismissed stale reviews from flyrain and eric-maynard via ab2560d March 30, 2025 05:38

andrew4699 force-pushed the aguterman-event-listeners branch 4 times, most recently from 4cd0a4d to 95b4db7 Compare April 17, 2025 22:21

eric-maynard previously approved these changes Apr 18, 2025

View reviewed changes

andrew4699 dismissed eric-maynard’s stale review via 0189d55 April 18, 2025 18:36

andrew4699 force-pushed the aguterman-event-listeners branch from 95b4db7 to 0189d55 Compare April 18, 2025 18:36

Event listeners

b891810

andrew4699 force-pushed the aguterman-event-listeners branch from 0189d55 to b891810 Compare May 2, 2025 18:19

Fix test

f3e7d0c

andrew4699 requested a review from HonahX as a code owner May 5, 2025 16:43

eric-maynard approved these changes May 6, 2025

View reviewed changes

github-project-automation bot moved this from PRs In Progress to Ready to merge in Basic Kanban Board May 6, 2025

eric-maynard merged commit e5e0c0d into apache:main May 7, 2025
6 checks passed

github-project-automation bot moved this from Ready to merge to Done in Basic Kanban Board May 7, 2025

eric-maynard mentioned this pull request Sep 19, 2025

Fix & enhancements to the Events API hierarchy #2629

Merged

Event Listeners #922

Event Listeners #922

Uh oh!

Conversation

andrew4699 commented Jan 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

flyrain left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrew4699 Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eric-maynard Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andrew4699 commented Mar 7, 2025

Uh oh!

flyrain left a comment

Choose a reason for hiding this comment

Uh oh!

snazy left a comment

Choose a reason for hiding this comment

Uh oh!

andrew4699 commented Jan 31, 2025 •

edited

Loading

andrew4699 Mar 7, 2025 •

edited

Loading

eric-maynard Mar 11, 2025 •

edited

Loading

andrew4699 Mar 19, 2025 •

edited

Loading

andrew4699 Mar 18, 2025 •

edited

Loading