Configuring Polaris Spark Client
Describes how to configure Apache Polaris Spark Client for Spark.
Polaris Spark Client
Apache Polaris version 1.0.1 has released a Spark client, which helps to provide an end-to-end solution for Apache Spark to manage Iceberg tables using Polaris.
Polaris Spark client includes Iceberg libraries and all other dependencies for working with Polaris and Iceberg tables.
Configuring Polaris Spark Client
- Download
org.apache.polaris:polaris-spark-3.5_2.12:1.0.1.0-dep-1000jar file from Maven repository, and add to<spark_home>/jars. - After that you can use
spark.sql.extensions=org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensionsandspark.sql.catalog.<catalog_name>=org.apache.iceberg.spark.SparkCatalogto start a session with Iceberg support.
For Spark session examples on how to connect to Polaris and Iceberg tables, see https://polaris.apache.org/releases/1.0.1/polaris-spark-client/.