Docs: update spark doc about incremental scan #3796

ajantha-bhat · 2021-12-23T05:51:29Z

Some users in the slack are exploring incremental read in spark and we don't have document for the same. Hence this PR.

ajantha-bhat · 2021-12-31T02:13:18Z

rdblue · 2022-01-03T18:01:19Z

site/docs/spark-queries.md

+Currently gets only the data from `append` operation. Cannot support `replace`, `overwrite`, `delete` operations yet.
+Works with both V1 and V2 format-version.
+
+Incremental read is not yet supported by Spark's SQL syntax.


Let's remove "yet" because it is unclear whether it will be supported.

rdblue · 2022-01-03T18:01:39Z

site/docs/spark-queries.md

+```
+
+!!! Note
+Currently gets only the data from `append` operation. Cannot support `replace`, `overwrite`, `delete` operations yet.


If you want this to be in a note box, it needs to be indented with 4 spaces.

rdblue · 2022-01-03T18:02:11Z

site/docs/spark-queries.md

+
+!!! Note
+Currently gets only the data from `append` operation. Cannot support `replace`, `overwrite`, `delete` operations yet.
+Works with both V1 and V2 format-version.


Is this part of the note or a separate paragraph? Also, could you expand this to be a complete sentence?

rdblue · 2022-01-03T18:02:29Z

site/docs/spark-queries.md

+* `end-snapshot-id` End snapshot ID used in incremental scans (inclusive)
+
+```scala
+// get the data added after start-snapshot-id (10963874102873L) till end-snapshot-id (63874143573109L)


Typo: "till" should be "until"

rdblue · 2022-01-03T18:03:00Z

site/docs/spark-queries.md


+### Incremental read
+
+To read incremental data between the snapshots, Configure below Spark read options:


How about "To read appended data incrementally, use:"

rdblue · 2022-01-03T18:03:27Z

site/docs/spark-queries.md

+To read incremental data between the snapshots, Configure below Spark read options:
+
+* `start-snapshot-id` Start snapshot ID used in incremental scans (exclusive)
+* `end-snapshot-id` End snapshot ID used in incremental scans (inclusive)


This is optional. Omitting it will default to the current snapshot.

ajantha-bhat · 2022-01-04T04:57:12Z

@rdblue : Thanks for the review. I have handled the comments.

rdblue · 2022-01-04T17:53:45Z

Thanks, @ajantha-bhat!

* apache/iceberg#3723 * apache/iceberg#3732 * apache/iceberg#3749 * apache/iceberg#3766 * apache/iceberg#3787 * apache/iceberg#3796 * apache/iceberg#3809 * apache/iceberg#3820 * apache/iceberg#3878 * apache/iceberg#3890 * apache/iceberg#3892 * apache/iceberg#3944 * apache/iceberg#3976 * apache/iceberg#3993 * apache/iceberg#3996 * apache/iceberg#4008 * apache/iceberg#3758 and 3856 * apache/iceberg#3761 * apache/iceberg#2062 * apache/iceberg#3422 * remove restriction related to legacy parquet file list

github-actions bot added the docs label Dec 23, 2021

rdblue reviewed Jan 3, 2022

View reviewed changes

Docs: update spark doc about incremental scan

e66d584

ajantha-bhat force-pushed the doc branch from ae83e4c to e66d584 Compare January 4, 2022 04:56

rdblue approved these changes Jan 4, 2022

View reviewed changes

rdblue merged commit a4afaab into apache:master Jan 4, 2022

jackye1995 pushed a commit to jackye1995/iceberg-docs that referenced this pull request Feb 8, 2022

https://github.com/apache/iceberg/pull/3796

5eb37be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Docs: update spark doc about incremental scan #3796

Docs: update spark doc about incremental scan #3796

Uh oh!

ajantha-bhat commented Dec 23, 2021

Uh oh!

ajantha-bhat commented Dec 31, 2021

Uh oh!

rdblue Jan 3, 2022

Uh oh!

rdblue Jan 3, 2022

Uh oh!

rdblue Jan 3, 2022

Uh oh!

rdblue Jan 3, 2022

Uh oh!

rdblue Jan 3, 2022

Uh oh!

rdblue Jan 3, 2022

Uh oh!

ajantha-bhat commented Jan 4, 2022

Uh oh!

rdblue commented Jan 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		### Incremental read

		To read incremental data between the snapshots, Configure below Spark read options:

Docs: update spark doc about incremental scan #3796

Docs: update spark doc about incremental scan #3796

Uh oh!

Conversation

ajantha-bhat commented Dec 23, 2021

Uh oh!

ajantha-bhat commented Dec 31, 2021

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

rdblue Jan 3, 2022

Choose a reason for hiding this comment

Uh oh!

ajantha-bhat commented Jan 4, 2022

Uh oh!

rdblue commented Jan 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants