Configuring Polaris Spark Client

Describes how to configure Apache Polaris Spark Client for Spark.

Polaris Spark Client

Apache Polaris version 1.0.1 has released a Spark client, which helps to provide an end-to-end solution for Apache Spark to manage Iceberg tables using Polaris.

Polaris Spark client includes Iceberg libraries and all other dependencies for working with Polaris and Iceberg tables.

Configuring Polaris Spark Client

  1. Download org.apache.polaris:polaris-spark-3.5_2.12:1.0.1.0-dep-1000 jar file from Maven repository, and add to <spark_home>/jars.
  2. After that you can use spark.sql.extensions=org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions and spark.sql.catalog.<catalog_name>=org.apache.iceberg.spark.SparkCatalog to start a session with Iceberg support.

For Spark session examples on how to connect to Polaris and Iceberg tables, see https://polaris.apache.org/releases/1.0.1/polaris-spark-client/.