Spark3 structured streaming micro_batch read support #2611

SreeramGarlapati · 2021-05-19T07:53:09Z

This work is an extension of the idea in issue #179 & the Spark2 work done in PR #2272 - only that - this is for Spark3.

In the current implementation:

Iceberg Snapshot is the upper bound for MicroBatch. A given MicroBatch will only Span within a Snapshot. It will not be composed of multiple Snapshots. BatchSize - is used to limit the number of files with in a given snapshot.
By default, the streaming reader - will ignore the Snapshots of type DELETE or REPLACE. rationale: DELETEs are common for GDPR and REPLACE is common during table maintenance / compaction related rewrites.
OVERWRITES are not handled. Something for future.
Columnar reads are not enabled. Something for future.

cc: @aokolnychyi & @RussellSpitzer & @holdenk @rdblue @rdsr

SreeramGarlapati · 2021-05-19T08:11:39Z

spark3/src/main/java/org/apache/iceberg/spark/source/SparkScanBuilder.java

-    return new SparkBatchQueryScan(
-        spark, table, caseSensitive, schemaWithMetadataColumns(), filterExpressions, options);
+    // TODO: understand how to differentiate that this is a spark streaming microbatch scan.
+    if (false) {


@aokolnychyi / @RussellSpitzer / @holdenk
Spark3 gives ScanBuilder - abstraction - to define all types of Scans (Batch, MicroBatch & Continuous). But, the current implementation / class modelling - has SparkBatchScan as the Scan implementation.
Looking at some of the concerns of BatchScan - all the way from the State maintenance of a single SnapshotId to read from, the asOfTimeStamp & features like VectorizedReads - all of these don't seem relevant to Streaming Scans.
So, I feel that we need to divide out Streaming Scans into a different class.
Does this thought process - make sense?
If we go by this route - do you folks know - how to pass different Scan objects to Spark based on Batch vs Streaming?

…apache#2615

…3.stream.read

SreeramGarlapati · 2021-05-29T07:31:44Z

spark3/src/main/java/org/apache/iceberg/spark/source/SparkMicroBatchStream.java

+  public void stop() {
+  }
+
+  private String getOffsetLogLocation(String checkpointLocation) {


@aokolnychyi @RussellSpitzer @rdblue @holdenk - do you folks know of a better way to read the last checkpointed offset.
Am needing this hack for cases where spark cluster goes down and has to restart stream from an old checkpoint. I definitely DO NOT plan to Keep this HACK. Am trying to understand a better way to do this. Truly appreciate any help here.
Started a conversation in iceberg slack channel.

SreeramGarlapati · 2021-06-01T06:19:17Z

spark3/src/main/java/org/apache/iceberg/spark/source/SparkMicroBatchStream.java

+    Snapshot previousSnapshot = table.snapshot(microBatchStartOffset.snapshotId());
+    Snapshot pointer = table.currentSnapshot();
+    while (pointer != null && previousSnapshot.snapshotId() != pointer.parentId()) {
+      Preconditions.checkState(pointer.operation().equals(DataOperations.APPEND),


TODO: Add unittest coverage for overwrite operation.

holdenk · 2021-06-01T19:56:17Z

Ok this is great work, I'm going to have to get back up to speed on the streaming stuff. Perhaps https://github.com/viirya has some thoughts here as well.
From the PR description though, I'm not sure I understand the logic behind skipping deletes since GDPR compliance is rather important.

SreeramGarlapati · 2021-06-01T21:31:50Z

@holdenk - thanks for your first take on the PR. Would be happy to hear out https://github.com/viirya's thoughts as well. Am unable to tag https://github.com/viirya. Please see if you can...

The PR description about GDPR - is to decide between
a) whether to ignore deletes by default or
b) whether to take a special flag to be able to ignore deletes.

The reasoning is that -

In Spark Structured streaming - we are STREAMING Iceberg table - row by row. ==> So, there is NO way to STREAM deletes from Iceberg table.
Which implies ==> when we encounter deletes - we are left with 2 choices
1. fail - with UnsupportedSnapshotDataOperation - DELETE
2. Ignore deletes and move on.
almost all of the users of the Iceberg tables want to be GDPR compliant.
which implies ==> they would want to delete some rows out of their Iceberg table & want to stream reads out of that table.
So, if we throw - UnsupportedOperation - when we encountered a Snapshot of type Delete while reading off of Iceberg Table - potentially all tables out there will need to handle this!
So, my proposal is to accept that - Iceberg tables will have GDPR deletes - i.e., - if the Iceberg table has Snapshots which are marked as Delete - we will ignore that Snapshot for streaming read purposes. In the later PRs I will expose a Spark Option - which will give the ability to fail the streaming read - if a Delete is encountered.

Did this make sense!? Happy to discuss.

…3.stream.read

SreeramGarlapati · 2021-06-02T06:45:30Z

From the PR description though, I'm not sure I understand the logic behind skipping deletes since GDPR compliance is rather important.

Hi @holdenk - after gaving it a bit more thought - I actually think otherwise. Overall, I agree with you. Ignoring deletes from tables might spook out some of the consumers of this data. For now, I removed the IgnoreDeletes part from the PR and updated the description to reflect the same here: #2660

PS: I was playing around with this branch to squash-merge my 28 commits to one commit (as it was very chatty) & in the process git closed this PR and is not letting me reopen this. So, I had to create a brand new PR.

Wireframe.

e024ba0

github-actions bot added the spark label May 19, 2021

SreeramGarlapati commented May 19, 2021

View reviewed changes

SreeramGarlapati mentioned this pull request May 19, 2021

Make a Copy of StreamingOffset implementation from Spark2 into Spark3's version of Offset implementation #2615

Merged

SreeramGarlapati added 3 commits May 19, 2021 19:24

remove StreamingOffset related change - as it is handled in another PR …

61611fd

…apache#2615

Merge branch 'master' of https://github.com/apache/iceberg into spark…

31f2a03

…3.stream.read

test changes

6a2adcb

SreeramGarlapati mentioned this pull request May 21, 2021

minor refactoring in MicroBatches class to prep for Spark3 streaming change #2620

Merged

SreeramGarlapati added 20 commits May 21, 2021 09:08

Merge branch 'master' of https://github.com/apache/iceberg into spark…

f1565cc

…3.stream.read

rudimentary implementation for spark3 streaming reads from iceberg table

def1bc0

rudimentary implementation for spark3 streaming reads from iceberg table

a96b3e6

unit test

b18cfdc

works!

993bd9e

Merge branch 'master' of https://github.com/apache/iceberg into spark…

94dd103

…3.stream.read

Unit test.

7063fd3

Merge branch 'master' of https://github.com/apache/iceberg into spark…

96fbc87

…3.stream.read

Unit test.

17f6eb8

checkpoint done!

2044d94

checkpoint done!

15e33e9

refactor

b8e5b34

Merge branch 'master' of https://github.com/apache/iceberg into spark…

fa4f2ae

…3.stream.read

test batchSize option

633afbb

refactor

16d3984

checkstyle

919e386

checkstyle

e3fb1fe

fix indent

bee1690

unit test - full coverage

daee48a

add logic for ignoring deletes and replace

0a65617

SreeramGarlapati changed the title ~~[WorkInProgress] Spark3 structured streaming micro_batch read support~~ Spark3 structured streaming micro_batch read support May 29, 2021

SreeramGarlapati commented May 29, 2021

View reviewed changes

minor refactor

f9e9e66

SreeramGarlapati commented Jun 1, 2021

View reviewed changes

SreeramGarlapati added 3 commits June 1, 2021 18:47

Merge branch 'master' of https://github.com/apache/iceberg into spark…

c7658d3

…3.stream.read

minor refactor

67e2d27

remove ignoreDelete and ignoreReplace.

072c911

SreeramGarlapati closed this Jun 2, 2021

SreeramGarlapati deleted the spark3.stream.read branch June 2, 2021 06:31

SreeramGarlapati mentioned this pull request Jun 2, 2021

Spark3 structured streaming micro_batch read support #2660

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Spark3 structured streaming micro_batch read support #2611

Spark3 structured streaming micro_batch read support #2611

Uh oh!

SreeramGarlapati commented May 19, 2021 •

edited

Loading

Uh oh!

SreeramGarlapati May 19, 2021 •

edited

Loading

Uh oh!

SreeramGarlapati May 29, 2021 •

edited

Loading

Uh oh!

SreeramGarlapati Jun 1, 2021

Uh oh!

holdenk commented Jun 1, 2021

Uh oh!

SreeramGarlapati commented Jun 1, 2021

Uh oh!

SreeramGarlapati commented Jun 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Spark3 structured streaming micro_batch read support #2611

Spark3 structured streaming micro_batch read support #2611

Uh oh!

Conversation

SreeramGarlapati commented May 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SreeramGarlapati May 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SreeramGarlapati May 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SreeramGarlapati Jun 1, 2021

Choose a reason for hiding this comment

Uh oh!

holdenk commented Jun 1, 2021

Uh oh!

SreeramGarlapati commented Jun 1, 2021

Uh oh!

SreeramGarlapati commented Jun 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SreeramGarlapati commented May 19, 2021 •

edited

Loading

SreeramGarlapati May 19, 2021 •

edited

Loading

SreeramGarlapati May 29, 2021 •

edited

Loading