[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view #1509

xunyin8 · 2025-02-07T06:27:38Z

[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view

The change will not work if record is actually large and chunked. Proper chunking support is needed and will be addressed in a separate PR.

Introduced ComplexVenicePartitioner which extends VenicePartitioner and offer a new API to partition by value and provide possible one-to-many partition mapping.
Added value provider of type Lazy to VeniceViewWriter's processRecord API to access deserialized value if needed. e.g. when a ComplexVenicePartitioner is involved.
MergeConflictResultWrapper and WriteComputeResultWrapper will now provide deserialized value in a best effort manner. This is useful when we already deserialized the value for a partial update operation so that the deserialized value can be provided directly to the materialized view writer.
Refactored VeniceWriter to expose some APIs to child class. Introduced ComplexVeniceWriter which extends VeniceWriter. Reasoning here is that the ComplexVeniceWriter will have different APIs to be used in MaterializedViewWriter and CompositeVeniceWriter to write to materialized view partition(s) and potentially involving a ComplexVenicePartitioner. Alternatively we could push common logic from VeniceWriter to AbstractVeniceWriter. However, ComplexVeniceWriter needs/shares too much common logic with VeniceWriter (chunking, DIV support, pubSubAdapter, etc...) it will make AbstractVeniceWriter too specialized and unable to offer the flexibility it needs to support something like the CompositeVeniceWriter.
Override putLargeValue in ComplexVeniceWriter to skip chunking and writing large messages. Once we have proper chunking support we need to be careful to not re-chunk when writing the same value to different partition in ComplexVeniceWriter.

How was this PR tested?

Added new integration test with A/A, W/C and a new test value based partitioner.
Will add new unit tests once we have some consensus on the API changes

Does this PR introduce any user-facing changes?

No. You can skip the rest of this section.
Yes. Make sure to explain your proposed changes and call out the behavior change.

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java

...ts/da-vinci-client/src/main/java/com/linkedin/davinci/store/view/MaterializedViewWriter.java

...sh-job/src/main/java/com/linkedin/venice/hadoop/task/datawriter/AbstractPartitionWriter.java

...rc/main/java/com/linkedin/venice/hadoop/task/datawriter/ComplexPartitionerWriterAdapter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/VeniceWriter.java

...a-vinci-client/src/main/java/com/linkedin/davinci/replication/merge/MergeConflictResult.java

FelixGV

Just some early thoughts... did not read the whole PR yet. But hopefully useful in terms of discussing the API changes.

...ce-client-common/src/main/java/com/linkedin/venice/partitioner/VeniceComplexPartitioner.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/VeniceWriter.java

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java

...nci-client/src/main/java/com/linkedin/davinci/kafka/consumer/MergeConflictResultWrapper.java

clients/da-vinci-client/src/main/java/com/linkedin/davinci/store/view/VeniceViewWriter.java

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java

...nci-client/src/main/java/com/linkedin/davinci/kafka/consumer/MergeConflictResultWrapper.java

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java

...a-vinci-client/src/main/java/com/linkedin/davinci/replication/merge/MergeConflictResult.java

...rc/main/java/com/linkedin/venice/hadoop/task/datawriter/ComplexPartitionerWriterAdapter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/ComplexVeniceWriter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/VeniceWriterFactory.java

...rc/main/java/com/linkedin/venice/hadoop/task/datawriter/ComplexPartitionerWriterAdapter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/ComplexVeniceWriter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/CompositeVeniceWriter.java

internal/venice-common/src/main/java/com/linkedin/venice/writer/VeniceWriter.java

…zed view The change will not work if record is actually large and chunked. Proper chunking support is needed and will be addressed in a separate PR. 1. Introduced VeniceComplexPartitioner which extends VenicePartitioner and offer a new API to partition by value and provide possible one-to-many partition mapping. 2. Added value provider of type Lazy<GenericRecord> to VeniceViewWriter's processRecord API to access deserialized value if needed. e.g. when a VeniceComplexPartitioner is involved. 3. MergeConflictResult will now provide deserialized value in a best effort manner. This is useful when we already deserialized the value for a partial update operation so that the deserialized value can be provided directly to the materialized view writer. 4. Refactored VeniceWriter to expose an API to write to desired partition with new DIV. This is only used by the new method writeWithComplexPartitioner for now to handle the partitioning and writes of the same value to mulitple partitions. However, this newly exposed API should also come handy when we build proper chunking support to forward chunks to predetermined view topic partitions. 5. writeWithComplexPartitioner in VeniceWriter will re-chunk when writing to each partition. This should be optimized when we build proper chunking support.

gaojieliu

Code change looks good overall.
I do think we need to take care of the comment I just left, which is very tricky as it is a race condition.

gaojieliu · 2025-02-18T22:29:29Z

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java

+        Lazy<GenericRecord> oldValueProvider = Lazy.of(() -> {
+          ChunkedValueManifestContainer oldValueManifestContainer = new ChunkedValueManifestContainer();
+          int oldValueReaderSchemaId = schemaRepository.getSupersetSchema(storeName).getId();
+          return readStoredValueRecord(


readStoredValueRecord method would read the most-recent data, which means it will try to read the transient record cache first and then RocksDB.
For WC enabled store, delete will update the transient record to be null for the key and I think this method will always return null.

Even we perform the lookup before updating transient record cache, it will be wrong as it is a lazy function, so when ViewWriter tries to produce to view topics, it will still read the most recent value, which is null, and the situation will become worse when parallel compute for AA/WC workload is enabled as all the updates to the same key in the same batch will be executed (updating transient cache) before producing to version/view topics, which means for the delete operation, the lazy function can read the most-recent value, which can be populated by a later put in the same batch.

Can we do a non-lazy lookup always before finding out a more optimized solution?

gaojieliu reviewed Feb 7, 2025

View reviewed changes

ZacAttack reviewed Feb 10, 2025

View reviewed changes

...a-vinci-client/src/main/java/com/linkedin/davinci/replication/merge/MergeConflictResult.java Outdated Show resolved Hide resolved

FelixGV reviewed Feb 11, 2025

View reviewed changes

xunyin8 force-pushed the value-based-partitioner branch from 0d80a11 to fa6c001 Compare February 13, 2025 07:18

xunyin8 changed the title ~~[server][common][vpj] Introduce VeniceComplexPartitioner to materialized view~~ [server][common][vpj] Introduce ComplexVenicePartitioner to materialized view Feb 13, 2025

xunyin8 force-pushed the value-based-partitioner branch from fa6c001 to fee9bc7 Compare February 13, 2025 07:30

ZacAttack reviewed Feb 13, 2025

View reviewed changes

...ient/src/main/java/com/linkedin/davinci/kafka/consumer/LeaderFollowerStoreIngestionTask.java Show resolved Hide resolved

ZacAttack reviewed Feb 13, 2025

View reviewed changes

...nci-client/src/main/java/com/linkedin/davinci/kafka/consumer/MergeConflictResultWrapper.java Show resolved Hide resolved

ZacAttack reviewed Feb 13, 2025

View reviewed changes

clients/da-vinci-client/src/main/java/com/linkedin/davinci/store/view/VeniceViewWriter.java Show resolved Hide resolved

gaojieliu reviewed Feb 13, 2025

View reviewed changes

gaojieliu reviewed Feb 14, 2025

View reviewed changes

internal/venice-common/src/main/java/com/linkedin/venice/writer/ComplexVeniceWriter.java Show resolved Hide resolved

gaojieliu reviewed Feb 14, 2025

View reviewed changes

internal/venice-common/src/main/java/com/linkedin/venice/writer/ComplexVeniceWriter.java Show resolved Hide resolved

gaojieliu reviewed Feb 14, 2025

View reviewed changes

internal/venice-common/src/main/java/com/linkedin/venice/writer/ComplexVeniceWriter.java Show resolved Hide resolved

internal/venice-common/src/main/java/com/linkedin/venice/writer/CompositeVeniceWriter.java Show resolved Hide resolved

ZacAttack reviewed Feb 14, 2025

View reviewed changes

internal/venice-common/src/main/java/com/linkedin/venice/writer/VeniceWriter.java Show resolved Hide resolved

xunyin8 added 3 commits February 17, 2025 23:23

Addressed comments

1bb67bc

Addressed comments and fixed re-push and compression support

ca52bc5

xunyin8 force-pushed the value-based-partitioner branch from fee9bc7 to ca52bc5 Compare February 18, 2025 07:23

gaojieliu reviewed Feb 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view #1509

[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view #1509

xunyin8 commented Feb 7, 2025 •

edited

Loading

FelixGV left a comment

gaojieliu left a comment

gaojieliu Feb 18, 2025

[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view #1509

Are you sure you want to change the base?

[server][common][vpj] Introduce ComplexVenicePartitioner to materialized view #1509

Conversation

xunyin8 commented Feb 7, 2025 • edited Loading