Skip to content
This repository has been archived by the owner on Jan 9, 2020. It is now read-only.

Releases: apache-spark-on-k8s/spark

v2.2.0-kubernetes-0.5.0

30 Oct 19:49
b008be3
Compare
Choose a tag to compare
Pre-release

This release is based off Spark 2.2 and requires Kubernetes 1.6 and up.

Major features and changes in this release include:

  • Support Kubernetes 1.8 by changing init-container annotation (#528).
  • Spark R support (#507).
  • Mount emptyDir volumes for temporary directories on executors in static allocation mode (#522).
  • Use the driver pod IP address for spark.driver.bindAddress (#533).

v2.2.0-kubernetes-0.4.0

25 Sep 22:40
Compare
Choose a tag to compare
Pre-release

Major features and fixes in this release include:

  • Support for HDFS locality.
  • Added an option to use a secret to mount small files in driver and executors.
  • Support for custom Kubernetes service account for the driver pod.
  • Support for executor java options.
  • Fixed conversion from MB to MiB in driver and executor memory specification.
  • Added configuration properties for injecting arbitrary Kubernetes secrets into the driver and executors.
  • Use a headless service to give a hostname to the driver (requiring Kubernetes DNS in the cluster).
  • Improved docker image build/push flow.
  • Added the ability to fail submission if submitter-local files are provided without the resource staging server URI.
  • Added reference YAML files for RBAC configs for driver and shuffle service.

Deprecations and removals:

  • Removed support for spark.kubernetes.driver.labels, spark.kubernetes.driver.annotations, spark.kubernetes.executor.labels,
    spark.kubernetes.executor.annotations which were deprecated. They have been superseded by new properties.
    See documentation for details.

v2.2.0-kubernetes-0.3.0

14 Aug 23:23
Compare
Choose a tag to compare

Spark-on-Kubernetes release rebased from the Apache Spark 2.2 branch

Features available with this release include:

  • Cluster-mode submission of Spark jobs to a Kubernetes cluster
  • Support for Scala, Java and PySpark
  • Static and Dynamic Allocation for Executors
  • Automatic staging of local resources onto Driver and Executor pods
  • Configurable security and credential management
  • HDFS, running on the Kubernetes cluster or externally
  • Launch jobs using kubectl proxy
  • Support for Kubernetes 1.6 - 1.7
  • Pre-built docker images

v2.1.0-kubernetes-0.3.1

08 Aug 22:05
Compare
Choose a tag to compare
Pre-release

This is a bug-fix release and contains the following changes:

  • Flag-guard expensive DNS lookup of cluster node full names, part of HDFS locality support (#412)
  • Fixed PySpark submission failure on --py-files (#407)

v2.1.0-kubernetes-0.3.0

24 Jul 16:50
Compare
Choose a tag to compare
Pre-release

Changelog:

  • Support specify CPU cores and Memory restricts for driver (#340)
  • Generate the application ID label irrespective of app name. (#331)
  • Create base-image and minimize layer count (#324)
  • Added log4j config for k8s unit tests. (#314)
  • Use node affinity to launch executors on preferred nodes benefitting from data locality (#316)
  • New API for custom labels and annotations. (#346)
  • Allow spark driver find shuffle pods in specified namespace (#357)
  • Bypass init-containers when possible (#348)
  • Config for hard cpu limit on pods; default unlimited (#356)
  • Allow number of executor cores to have fractional values (#361)
  • Python Bindings for launching PySpark Jobs from the JVM (#364)
  • Submission client redesign to use a step-based builder pattern (#365)
  • Add node selectors for driver and executor pods (#355)
  • Retry binding server to random port in the resource staging server test. (#378)
  • set RestartPolicy=Never for executor (#367)
  • Read classpath entries from SPARK_EXTRA_CLASSPATH on executors. (#383)
  • Changes to support executor recovery behavior during static allocation. (#244)

v2.1.0-kubernetes-0.2.0

05 Jun 18:22
Compare
Choose a tag to compare
Pre-release

First beta release of Spark with Kubernetes support.
Based off Spark upstream at version 2.1.0

New:

  • File staging server for local files
  • Dynamic allocation of executors
  • Stability and bug fixes

Limitations:

  • Applications can only run in cluster mode.
  • Only Scala and Java applications can be run.
  • No HA

v2.1.0-kubernetes-0.1.0-alpha.1

17 Mar 23:34
Compare
Choose a tag to compare
Pre-release

First alpha release of Spark with Kubernetes support.
Based off Spark upstream at version 2.1.0

What works:

  • Basic functionality (submission, aggregating/streaming logs, etc.)
  • Submission of local files
  • Custom images for driver/executor
  • Spark Jobs can use a fixed number of executors

Limitations:

  • Dynamic allocation is not supported.
  • Applications can only run in cluster mode.
  • Only Scala and Java applications can be run.