Skip to content

Conversation

@jackylee-ch
Copy link
Contributor

@jackylee-ch jackylee-ch commented Oct 17, 2025

Closed #14340.

@github-actions github-actions bot added the docs label Oct 17, 2025

```sh
spark-shell --packages org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{{ icebergVersion }}
spark-shell --packages org.apache.iceberg:iceberg-spark-runtime-4.0_2.13:{{ icebergVersion }}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

!!! info
<!-- markdown-link-check-disable-next-line -->
If you want to include Iceberg in your Spark installation, add the [`iceberg-spark-runtime-3.5_2.12` Jar](https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/{{ icebergVersion }}/iceberg-spark-runtime-3.5_2.12-{{ icebergVersion }}.jar) to Spark's `jars` folder.
If you want to include Iceberg in your Spark installation, add the [`iceberg-spark-runtime-{{ sparkVersion }}_{{ scalaVersion }}` Jar](https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-{{ sparkVersion }}_{{ scalaVersion }}/{{ icebergVersion }}/iceberg-spark-runtime-{{ sparkVersion }}_{{ scalaVersion }}-{{ icebergVersion }}.jar) to Spark's `jars` folder.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If sparkVersion always comes together with scalaVersion, I think we can just use one variable.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

# Spark Procedures

To use Iceberg in Spark, first configure [Spark catalogs](spark-configuration.md). Stored procedures are only available when using [Iceberg SQL extensions](spark-configuration.md#sql-extensions) in Spark 3.
To use Iceberg in Spark, first configure [Spark catalogs](spark-configuration.md). Stored procedures are only available when using [Iceberg SQL extensions](spark-configuration.md#sql-extensions) in Spark.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

configuring the extension is not necessary for Spark 4.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, yes, we have used Spark Call syntax from Spark 4.0, thanks for point out.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also better to document behavior change for Spark 4 in CALL syntax resolving, see #13106 and SPARK-53523

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For SPARK-53523, we may ignore it until Spark 4.1.0 is released and supported in Iceberg.

site/mkdocs.yml Outdated
flinkVersion: '2.0.0'
flinkVersionMajor: '2.0'
sparkVersion: '4.0'
scalaVersion: '2.13'
Copy link
Member

@pan3793 pan3793 Oct 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sparkBinaryVersion and scalaBinaryVersion are more accurate


To use Iceberg in Spark, first configure [Spark catalogs](spark-configuration.md). Stored procedures are only available when using [Iceberg SQL extensions](spark-configuration.md#sql-extensions) in Spark 3.
To use Iceberg in Spark, first configure [Spark catalogs](spark-configuration.md).
For Spark 3.x, stored procedures are only available when using [Iceberg SQL extensions](spark-configuration.md#sql-extensions) in Spark.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please share a snapshot of this page after the PR.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

image

@jackylee-ch jackylee-ch force-pushed the refine_Spark_3_words_from_docs branch from 6003251 to 5825e88 Compare October 20, 2025 02:06
@huaxingao
Copy link
Contributor

@jackylee-ch Could you please also share a screenshot of spark-getting-started page? Thanks

@jackylee-ch
Copy link
Contributor Author

@jackylee-ch Could you please also share a screenshot of spark-getting-started page? Thanks

Sure, would done this later~

@jackylee-ch jackylee-ch force-pushed the refine_Spark_3_words_from_docs branch from 1320d80 to 6efed89 Compare October 30, 2025 12:33
Copy link
Contributor

@kevinjqliu kevinjqliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.


<!-- markdown-link-check-disable-next-line -->
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/{{ icebergVersion }}/iceberg-spark-runtime-3.5_2.12-{{ icebergVersion }}.jar
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-{{ sparkVersionMajor }}/{{ icebergVersion }}/iceberg-spark-runtime-{{ sparkVersionMajor }}-{{ icebergVersion }}.jar
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: this isnt rendered
same problem on https://iceberg.apache.org/spark-quickstart/#learn-more right now

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems more reasonable to have the note point to the release at this stage. Exposing the spark-runtime-jar directly might appear a bit abrupt. Can we just remote this line?
Google Chrome 2025-11-10 10 31 50

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

agreed, we dont need to expose the jar directly

@kevinjqliu
Copy link
Contributor

btw if you merge main (to pull in #14267), you can use make serve-dev and render the site

@jackylee-ch
Copy link
Contributor Author

@jackylee-ch Could you please also share a screenshot of spark-getting-started page? Thanks

@huaxingao Sorry for the late response. I just built the doc locally, and here’s the screenshot of the Spark Getting Started page.

image image image image image image image image image

@jackylee-ch
Copy link
Contributor Author

LGTM, added a few nit comments on not removing the warning about spark 3.0.

@kevinjqliu Since we only support Spark 3.4 or later, users may not be able to run it with Spark 3.0. Therefore, the warning seems unnecessary to me.
cc @huaxingao @manuzhang

@kevinjqliu
Copy link
Contributor

Since we only support Spark 3.4 or later, users may not be able to run it with Spark 3.0. Therefore, the warning seems unnecessary to me.

makes sense, we have versioned docs so this warning still exists in the previous versions

Comment on lines 330 to +332

<!-- markdown-link-check-disable-next-line -->
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/{{ icebergVersion }}/iceberg-spark-runtime-3.5_2.12-{{ icebergVersion }}.jar
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-{{ sparkVersionMajor }}/{{ icebergVersion }}/iceberg-spark-runtime-{{ sparkVersionMajor }}-{{ icebergVersion }}.jar
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
<!-- markdown-link-check-disable-next-line -->
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-3.5_2.12/{{ icebergVersion }}/iceberg-spark-runtime-3.5_2.12-{{ icebergVersion }}.jar
[spark-runtime-jar]: https://search.maven.org/remotecontent?filepath=org/apache/iceberg/iceberg-spark-runtime-{{ sparkVersionMajor }}/{{ icebergVersion }}/iceberg-spark-runtime-{{ sparkVersionMajor }}-{{ icebergVersion }}.jar

lets remove this then

@kevinjqliu kevinjqliu merged commit 5481507 into apache:main Nov 10, 2025
3 checks passed
@kevinjqliu
Copy link
Contributor

Thanks for the PR @jackylee-ch and thanks everyone for the review!

@kevinjqliu
Copy link
Contributor

Follow up to remove spark-runtime-jar #14555
I wasn't able to make the change on this PR

@jackylee-ch
Copy link
Contributor Author

Thanks for your help @kevinjqliu. And thanks for your review. @huaxingao @kevinjqliu @manuzhang @pan3793

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Remove Spark 3 specific wordings in docs

5 participants