Docs: Add information on how to read from branches and tags in Spark docs #6573

amogh-jahagirdar · 2023-01-12T21:03:35Z

https://github.com/apache/iceberg/pull/5150/files introduced the ability to read from branches and tags in Spark, but the docs haven't been updated. This change updates the docs and examples for reading from branches and tags.

docs/spark-queries.md

…query docs

jackye1995

looks good to me!

singhpk234

Looks good to me as well, Thanks @amogh-jahagirdar !

ajantha-bhat · 2023-01-13T07:19:37Z

docs/spark-queries.md

 * `snapshot-id` selects a specific table snapshot
 * `as-of-timestamp` selects the current snapshot at a timestamp, in milliseconds
+* `branch` selects the head snapshot of the specified branch. Note that currently branch cannot be combined with as-of-timestamp.
+* `tag` selects the snapshot associated with the specified tag


do we need to mention that tag also cannot be combined with as-of-timestamp.

or we can wait till #6575 gets merged. So that we don't have to mention it for both branch and tag. But we need to add an example in ##SQL also.

Definitely agree on having a SQL example once #6575 gets merged. For combining as-of-timestamp with tag I felt that was apparent since a tag can only map to a single snapshot which conflicts with passing in a timestamp, where as branch + time travel is a different case.

Given that is a syntax change, I am waiting for more time for others to take a look. I think we can first merge this one and add that later.

ajantha-bhat · 2023-01-13T07:21:49Z

docs/spark-queries.md

 #### DataFrame

 To select a specific table snapshot or the snapshot at some time in the DataFrame API, Iceberg supports two Spark read options:



we need to change two to four.

jackye1995 · 2023-01-14T01:33:05Z

Thanks everyone for the review, as I said in the thread for the SQL related changes, I will wait for some more time in case there are disagreements. I will merge this in first and we can add follow up PRs at this front.

one of my comments was not addressed in apache#6573. Hence, a follow-up PR. apache#6573 adds two more spark read options in the data frame time travel syntax.

github-actions bot added the docs label Jan 12, 2023

jackye1995 reviewed Jan 12, 2023

View reviewed changes

docs/spark-queries.md Outdated Show resolved Hide resolved

Docs: Add information on how to read from branches and tags in Spark …

1fbf385

…query docs

amogh-jahagirdar force-pushed the update-spark-branch-docs branch from b1ac360 to 1fbf385 Compare January 12, 2023 21:57

jackye1995 approved these changes Jan 12, 2023

View reviewed changes

singhpk234 approved these changes Jan 12, 2023

View reviewed changes

ajantha-bhat reviewed Jan 13, 2023

View reviewed changes

ajantha-bhat mentioned this pull request Jan 13, 2023

Spark 3.3: support version travel by reference name #6575

Merged

jackye1995 merged commit 7943e94 into apache:master Jan 14, 2023

ajantha-bhat mentioned this pull request Jan 16, 2023

Docs: Fix minor typo in time travel dataframe section #6601

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Docs: Add information on how to read from branches and tags in Spark docs #6573

Docs: Add information on how to read from branches and tags in Spark docs #6573

Uh oh!

amogh-jahagirdar commented Jan 12, 2023 •

edited

Loading

Uh oh!

Uh oh!

jackye1995 left a comment

Uh oh!

singhpk234 left a comment

Uh oh!

ajantha-bhat Jan 13, 2023

Uh oh!

ajantha-bhat Jan 13, 2023

Uh oh!

amogh-jahagirdar Jan 13, 2023

Uh oh!

jackye1995 Jan 14, 2023

Uh oh!

ajantha-bhat Jan 13, 2023

Uh oh!

jackye1995 commented Jan 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		#### DataFrame

		To select a specific table snapshot or the snapshot at some time in the DataFrame API, Iceberg supports two Spark read options:

Docs: Add information on how to read from branches and tags in Spark docs #6573

Docs: Add information on how to read from branches and tags in Spark docs #6573

Uh oh!

Conversation

amogh-jahagirdar commented Jan 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jackye1995 left a comment

Choose a reason for hiding this comment

Uh oh!

singhpk234 left a comment

Choose a reason for hiding this comment

Uh oh!

ajantha-bhat Jan 13, 2023

Choose a reason for hiding this comment

Uh oh!

ajantha-bhat Jan 13, 2023

Choose a reason for hiding this comment

Uh oh!

amogh-jahagirdar Jan 13, 2023

Choose a reason for hiding this comment

Uh oh!

jackye1995 Jan 14, 2023

Choose a reason for hiding this comment

Uh oh!

ajantha-bhat Jan 13, 2023

Choose a reason for hiding this comment

Uh oh!

jackye1995 commented Jan 14, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amogh-jahagirdar commented Jan 12, 2023 •

edited

Loading