You need a fahrschein to use the nakadi event bus.
Image Source: Deutsches Bundesarchiv, Photographer: Ralph Hirschberger, Image License: CC BY-SA 3.0 DE
- Consistent error handling
- Stream-based parsing
- Optimized utf-8 decoding by Jackson
- No unnecessary buffering or line-based processing, causing less garbage
- Less garbage and higher performance
- No required base classes for events
- Support for both high-level (subscription) and low-level APIs
- Pluggable HTTP client implementations
- Gzip encoding support for publishing and consuming events
- ZStandard compression support for publishing events
- Integrated Spring-Boot-Starter
Fahrschein is available in maven central, so you only have to add the following dependency to your project:
<dependency>
<groupId>org.zalando</groupId>
<artifactId>fahrschein</artifactId>
<version>${fahrschein.version}</version>
</dependency>
The following code example creates an event listener based on Nakadi's subscription API, which gets a call-back for each batch.
String eventName = "sales-order-placed";
// Create a Listener for our event
Listener<SalesOrderPlaced> listener = events -> {
for (SalesOrderPlaced salesOrderPlaced : events) {
LOG.info("Received sales order [{}]", salesOrderPlaced.getSalesOrder().getOrderNumber());
}
};
// Configure client. It is mandatory to choose the RequestFactory implementation (we recommend the JavaNetRequestFactory) and request encoding (we recommend compression via GZIP or ZSTD)
// PlatformAccessTokenProvider provides a token by reading from the filesystem's CREDENTIALS_DIR
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, new JavaNetRequestFactory(ContentEncoding.GZIP))
.withAccessTokenProvider(new PlatformAccessTokenProvider())
.build();
// Create subscription using the Subscription API.
// The authorization section is mandatory. We recommend setting your team as admin
// and your application as reader.
Subscription subscription = nakadiClient.subscription(applicationName, eventName)
.withAuthorization(authorization()
.addAdmin("team", "your_team")
.addReader("service", "stups_your_application").build())
.subscribe();
// Define stream parameters, like batch size. For more details and defaults see Nakadi's API definition
StreamParameters streamParameters = new StreamParameters()
.withBatchLimit(25);
// Start streaming. The listen call will block and automatically reconnect on IOException
nakadiClient.stream(subscription)
.withStreamParameters(streamParameters)
.listen(SalesOrderPlaced.class, listener);
See ConsumerExample.java
and ProducerExample.java
for executable versions of the above code.
One could also use a pre-existing subscription ID to stream events. This will force fahrschein not to create a subscription for the given event and will use the provided subscription ID.
// Create subscription using an existing subscription ID
Subscription subscription = nakadiClient
.subscription("application-name",eventName)
.subscribe("subscription-id");
// Start streaming, the listen call will block and automatically reconnect on IOException
nakadiClient.stream(subscription)
.listen(SalesOrderPlaced.class, listener);
By default nakadi will start streaming from the most recent offset. The initial offsets can be changed by requesting data about partitions from Nakadi and using this data to configure the CursorManager
.
List<Partition> partitions = nakadiClient.getPartitions(eventName);
// NakadiClient can be configured to start reading from the oldest available offset in each partition
nakadiClient.stream(eventName).readFromBegin(partitions);
// Or from the newest available offset, but this is the same as the default
nakadiClient.stream(eventName).readFromNewestAvailableOffset(partitions);
// Or (for a persistent cursor manager) we can start reading from the last offset that we processed if it's still available, and from the oldest available offset otherwise
nakadiClient.stream(eventName).skipUnavailableOffsets(partitions);
// event types
String ORDER_CREATED = "order-created";
// Configure client. It is mandatory to choose the RequestFactory implementation (we recommend the JavaNetRequestFactory) and request encoding (we recommend compression via GZIP or ZSTD)
// PlatformAccessTokenProvider provides a token by reading from the filesystem's CREDENTIALS_DIR
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, new JavaNetRequestFactory(ContentEncoding.GZIP))
.withAccessTokenProvider(new PlatformAccessTokenProvider())
.build();
// Produce events in batches
while (true) {
List<OrderCreatedEvent> events = new ArrayList<>();
for (int i = 0; i < 10; i++) {
events.add(new OrderCreatedEvent(metadata, "123", Money.of(123, "EUR"), "1234", "paypal"));
}
LOG.info("publishing {} events", events.size());
// Publish
nakadiClient.publish(ORDER_CREATED, events);
}
Nakadi publishing API accepts events in batches. It can fail to publish some events from the batch to underlying storage.
In that case Nakadi publishing API will return error that batch was partially successful, indicated in Fahrschein by an EventPersistenceException
.
For this situation, Fahrschein offers auto-retry mechanisms, allowing clients to make a choice between retrying the entire batch or opting for a partial retry as needed.
For added flexibility, various backoff strategies for delaying the retries exist.
Fahrschein has the capability to retry the entire batch or only the events within a batch that have failed
(see available publishing retry strategies below). It will not
retry aborted events, as they failed the event schema validation, and retrying without either changing the event or the
schema would be futile. This is indicated by an EventValidationException
. For generic IOExceptions
like temporary network issues,
custom retry code can be wrapped around the NakadiClient.publish
call.
Fahrschein does not enable auto-retry on publishing by default.
To enable automatic retry on publishing:
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, new JavaNetRequestFactory(ContentEncoding.GZIP))
.withPublishingRetryAndBackoffStrategy(PublishingRetryStrategies.FAILED_ONLY, new ExponentialBackoffStrategy()) // recommended retry configuration
.build();
nakadiClient.publish("foobar", List.of(
new SomeEvent("eid1", new Metadata("eid1",OffsetDateTime.now())),
new SomeEvent("eid2", new Metadata("eid2",OffsetDateTime.now())),
new SomeEvent("eid3", new Metadata("eid3",OffsetDateTime.now())))
);
-
PublishingRetryStrategies.ALL
to retry the entire batch regardless of which events in the batch have failed. This strategy will lead to high write amplification on Nakadi's side in case of partial outages, and is discouraged. -
PublishingRetryStrategies.FAILED_ONLY
(default) to retry only the events within a batch that have failed. This is
the recommended default for Nakadi clients, but there is a potential edge case, where within publishing of one batch Nakadi recovers. If, for example in a batch of three events[1,2,3]
, all belonging to the same partition,[1]
fails, but[2,3]
succeed, the ordering guarantee for one partition may be potentially be broken, leading to[2,3,1]
after retry. For most API clients this is not an issue. -
PublishingRetryStrategies.ALL_FROM_FIRST_FAILURE
to retry the events within a batch, starting with the first failure. This strategy trades off write amplification with better ordering guarantees for the aforementioned edge case. Taking the same example in a batch of three events[1,2,3]
, all belonging to the same partition,[1]
fails, but[2,3]
succeed, this retry strategy will retry the full batch of[1,2,3]
. -
PublishingRetryStrategies.NONE
to not retry.
You may also provide your own PublishingRetryStrategy
implementation.
By default fahrschein supports implementation of Zalando platofrm IAM (OAuth 2.0) through PlatformAccessTokenProvider
. The implementation
expects a mounted directory with the below structure.
meta
└── credentials
├── example-token-secret
└── example-token-type
The resulting token would be "Bearer your-secret-token"
One can override AuthorizationProvider
interface to support custom authorization flow.
Example:
//Create custom authorization
class CustomAuthorization implements AuthorizationProvider{
@Override
public String getAuthorizationHeader() throws IOException {
return "your-secret-token";
}
}
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, new SimpleRequestFactory(ContentEncoding.IDENTITY))
.withAccessTokenProvider(new CustomAuthorization())
.build();
Exception handling while streaming events follows some simple rules
IOException
and its subclasses are treated as temporary failures.If anIOException
occurs while opening the initial connection, it is considered a temporary failure. The library will automatically recover from this type of failure by retrying the processing attempt after time interval specified by theBackoffStrategy
.- If
listener
throwsRuntimeException
streaming of events will be aborted. User is responsible for handling these exceptions. Library code itself will not throwRuntimeException
s. - Exceptions in other client methods are not automatically retried
Fahrschein supports different exponential backoff strategies. All backoff values are configurable when passing your own instance in StreamBuilder#withBackoffStrategy(BackoffStrategy)
and NakadiClientBuilder#withPublishingRetryAndBackoffStrategy(PublishingRetryStrategy, BackoffStrategy)
.
ExponentialBackoffStrategy
- Base implementation for exponential backoff without jitter. Initial delay is 500ms, backoff factor 1.5, maximum delay 10min, with no limit on the maximum number of retries.EqualJitterBackoffStrategy
(default) - extendsExponentialBackoffStrategy
with the same defaults. For each delay it takes half of the delay value and adds the other half multiplied by a random factor [0..1).FullJitterBackoffStrategy
- extendsExponentialBackoffStrategy
with the same defaults and multiplies each delay by a random factor [0..1).
The stream implementation gracefully handles thread interruption, so it is possible to stop a running thread and resume consuming events by re-submitting the Runnable
:
ExecutorService executorService = Executors.newSingleThreadExecutor();
Runnable runnable = nakadiClient.stream(SALES_ORDER_SERVICE_ORDER_PLACED)
.runnable(SalesOrderPlaced.class, listener)
.unchecked();
// start consuming events
Future<?> future = executorService.submit(runnable);
// stop consuming events
future.cancel(true);
// resume consuming events
Future<?> future2 = executorService.submit(runnable);
Fahrschein uses it's own http abstraction which is very similar to spring framework's ClientHttpRequestFactory
interface. By default it uses the SimpleRequestFactory
which uses a HttpURLConnection
internally and has no further dependencies.
There is a version using java.net HttpClient (JDK11+) named JavaNetRequestFactory
in the fahrschein-http-jdk11
artifact.
<dependency>
<groupId>org.zalando</groupId>
<artifactId>fahrschein-http-jdk11</artifactId>
<version>${fahrschein.version}</version>
</dependency>
There is also a version using apache http components named HttpComponentsRequestFactory
in the fahrschein-http-apache
artifact.
<dependency>
<groupId>org.zalando</groupId>
<artifactId>fahrschein-http-apache</artifactId>
<version>${fahrschein.version}</version>
</dependency>
The apache version is useful when you want more control about the number of parallel connections in total or per host. The following example shows how to use a customized HttpClient
:
RequestConfig config = RequestConfig.custom().setSocketTimeout(readTimeout)
.setConnectTimeout(connectTimeout)
.setConnectionRequestTimeout(connectTimeout)
.build();
CloseableHttpClient httpClient = HttpClients.custom()
.setConnectionTimeToLive(readTimeout, TimeUnit.MILLISECONDS)
.disableAutomaticRetries()
.setDefaultRequestConfig(config)
.disableRedirectHandling()
.setMaxConnTotal(20)
.setMaxConnPerRoute(20)
.build();
RequestFactory requestFactory = new HttpComponentsRequestFactory(httpClient, ContentEncoding.GZIP);
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, requestFactory)
.withAccessTokenProvider(new PlatformAccessTokenProvider())
.build();
It is also possible to adapt other client implementations (e.g. OkHttp3) by using the Spring framework abstraction and by wrapping them into SpringRequestFactory
, contained in the fahrschein-http-spring
artifact. The current version assumes Spring Framework to be a provided dependency. See table below for the range of compatible Spring versions.
<dependency>
<groupId>org.zalando</groupId>
<artifactId>fahrschein-http-spring</artifactId>
<version>${fahrschein.version}</version>
</dependency>
Example using OkHttp 3.x:
ClientHttpRequestFactory clientHttpRequestFactory = new OkHttp3ClientHttpRequestFactory();
RequestFactory requestFactory = new SpringRequestFactory(clientHttpRequestFactory, ContentEncoding.GZIP);
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, requestFactory)
.withAccessTokenProvider(new PlatformAccessTokenProvider())
.build();
Note: The implementations from the Spring framework don't handle closing of streams as expected. They will try to consume remaining data, which will usually time out when nakadi does not receive a commit.
Note: Regarding sizing and reuse of HTTP client connection pools, make sure to have a connection pool size bigger than the number of subscriptions you're making, because subscriptions use long-polling to retrieve events, each effectively blocking one connection.
Fahrschein has support for metrics using Dropwizard
and Micrometer
, at the moment for consuming events only.
Each integration has two metrics collector implementations, one for counting, one for last activity timestamps.
Multiple metrics integrations can be chained by using the MultiplexingMetricsCollector
.
In dropwizard, the default metrics prefix is org.zalando.fahrschein
, for Micrometer it is fahrschein.listener
.
Example using Micrometer:
MeterRegistry registry = new SimpleMeterRegistry();
nakadiClient.stream(subscription)
.withMetricsCollector(new MicrometerMetricsCollector(registry))
Please do not use the Low-level API, as it is deprecated by Nakadi.
The Low-level API requires local persistence of partition offsets.
There is one CursorManager
implementation left: InMemory.
Postgres and Redis cursor managers have been DEPRECATED and removed in version 0.22.0 of Fahrschein.
CursorManager cursorManager = new InMemoryCursorManager();
NakadiClient nakadiClient = NakadiClient.builder(NAKADI_URI, new SimpleRequestFactory(ContentEncoding.IDENTITY))
.withAccessTokenProvider(new PlatformAccessTokenProvider())
.withCursorManager(cursorManager)
.build();
nakadiClient.stream(eventName)
.listen(SalesOrderPlaced.class, listener);
With the PartitionManager
api it is possible to coordinate between multiple nodes of one application, so that only one node is consuming events from a partition at the same time.
Partitions are locked by one node for a certain time. This requires that every node has a unique name or other identifier.
@Scheduled(fixedDelay = 60*1000L)
public void readSalesOrderPlacedEvents() throws IOException {
String lockedBy = ... // host name or another unique identifier for this node
List<Partition> partitions = nakadiClient.getPartitions(eventName);
Optional<Lock> optionalLock = partitionManager.lockPartitions(eventName, partitions, lockedBy);
if (optionalLock.isPresent()) {
Lock lock = optionalLock.get();
try {
nakadiClient.stream(eventName)
.withLock(lock))
.listen(SalesOrderPlaced.class, listener);
} finally {
partitionManager.unlockPartitions(lock);
}
}
}
Although Fahrschein is using fixed dependency versions, it is integration-tested against the following dependency matrix. We will inform in the release notes in case we bump the compatibility baseline.
Dependency | Baseline | Latest |
---|---|---|
Jackson | 2.8.0 | 2.+ |
Spring Core | 6.0.0 | 6.+ |
Spring Boot | 3.0.0 | 3.+ |
okHttp | 3.3.0 | 4.+ |
Apache HttpClient | 5.0 | 5.+ |
Fahrschein handles content compression transparently to the API consumer, and mostly independently of the actual HTTP
client implementation. Since version 0.20.0
it can be enabled to both compress HTTP POST bodies when event
publishing, and requesting payload compression from Nakadi when consuming events.
For event consumption the underlying HTTP client implementations send Accept-Encoding
headers, indicating their supported compression algorithms.
At the time of writing, all tested client implementations default to gzip
compression. If this is undesired, wrap your
RequestFactory into a IdentityAcceptEncodingRequestFactory
, which sets the Accept-Encoding
header to identity
.
For event publishing, the Request
body can also get compressed by Fahrschein, if enabled when building the RequestFactory.
For this, you need to pass either ContentEncoding.GZIP
, ContentEncoding.ZSTD
, or if compression is undesired, pass ContentEncoding.IDENTITY
.
Zstandard compression was added in version 0.21.0
.
Fahrschein | nakadi-java | |
---|---|---|
Dependencies | Jackson | gson, okhttp3, RxJava |
Low-level API streaming | yes | yes |
Subscription API streaming | yes | yes |
Compression: consuming | gzip (enabled by default) | gzip (enabled by default) |
Compression: publishing | gzip, zstd | gzip |
Error Handling | Automatic retry with exponential backoff | Automatic retry with exponential backoff |
OpenTracing | yes | yes |
Metrics Collection | yes (dropwizard,micrometer) | yes (dropwizard,micrometer) |
Access to Event Metadata | no | yes |
Event Type manipulation | no | yes |
If you have questions, concerns, bug reports, etc, please file an issue in this repository's issue tracker.
Fahrschein is a gradle-based project. For local development, Fahrschein requires:
- A local installation of JDK11
- A local Docker installation for running integration tests
When developing, make sure to run unit and integration tests with ./gradlew check
.
We use Gradle convention plugins to share common build logic across multiple subprojects. These are fahrschein.java-conventions.gradle for common java properties, and fahrschein.maven-publishing-conventions.gradle for subprojects that are released as maven artefacts.
Most dependencies are defined on a per-subproject level, only the versions for the most-used shared dependencies are controlled centrally, in the gradle.properties file. This also allows you testing your build with a different version by specifying the property on the command-line.
./gradlew check -Pjackson.version=2.9.0
The integration tests include running the build with our supported baseline dependency as well as the latest micro release of Jackson, Apache HttpClient and Spring. Please update the section in the README when bumping dependency baselines, and add this to the release notes.
Fahrschein automatically generates test coverage reports via its JaCoCo
build integration. For example, you can check code coverage for a single subproject:
./gradlew fahrschein-http-api:check
open fahrschein-http-api/build/reports/jacoco/test/html/index.html
Alternatively, run the full build and check the aggregated coverage report:
./gradlew check -Pe2e.skip
open build/reports/jacoco/jacocoAggregateReport/html/index.html
The fahrschein-e2e-test
module has end-to-end tests using docker-compose
. If you wish to leave the Docker containers running between tests, you can also bring up docker-compose manually using its docker-compose.yaml, before running the tests. In either case, you will need the port 8080
to be available for Nakadi to run.
docker-compose -f fahrschein-e2e-test/src/test/resources/docker-compose.yaml up -d
./gradlew :fahrschein-e2e:check -Pe2e.composeProvided
If you want to skip end-to-end tests completely, run
./gradlew check -Pe2e.skip
The project integrates CVE scanning to check for vulnerable dependencies. In case of build failure, this can be caused by a high-risk vulnerability in a dependency being identified. You can run the reporting locally:
./gradlew :dependencyCheckAggregate
open build/reports/dependency-check-report.html
Fahrschein uses Github Workflows to build and publish releases. This happens automatically whenever a new release is created in Github. After creating a release, please bump the project.version
property in gradle.properties.
If needed, you can preview (unsigned) release artifacts in your local maven repository.
./gradlew publishToMavenLocal -xsignMavenPublication
Check the contribution guidelines if you want to get involved in Fahrschein development.