Get Started
Describes how to get started with HPE Ezmeral Unified Analytics Software.
Administration
Provides information about managing applications and clusters in HPE Ezmeral Unified Analytics Software.
Security
Describes security in HPE Ezmeral Unified Analytics Software.
Observability
Describes observability in HPE Ezmeral Unified Analytics Software.
Data Engineering
Data engineers can design and build pipelines that transform and transport data into usable formats for data consumers.
Data Analytics
Provides a brief overview of data analytics in HPE Ezmeral Unified Analytics Software.
- Spark
  Provides a brief overview of Apache Spark in HPE Ezmeral Unified Analytics Software.
  - Using Spark Images
    Describes different types of Spark images supported by HPE Ezmeral Unified Analytics Software.
  - List of Spark Images
    Lists the Spark images distributed by HPE Ezmeral Unified Analytics Software. These images enables you to run the Spark applications in an air-gapped environment.
  - Creating Spark Applications
    Describes how to create and submit Spark applications using HPE Ezmeral Unified Analytics Software.
    - Configuring a Spark Application to Access External S3 Object Storage
      Describes configuration options for connecting Spark to external S3 object storage.
      - Configuring a Spark Application to Directly Access Data in an External S3 Data Source
        Describes how to configure a Spark application to connect directly to an external S3 data source.
      - Configuring a Spark Application to Access Data in an External S3 Data Source through the S3 Proxy Layer
        Describes how to configure a Spark application to connect to an external S3 data source through the S3 proxy later in HPE Ezmeral Unified Analytics Software.
  - Managing Spark Applications
    Describes how to view and manage Spark applications using HPE Ezmeral Unified Analytics Software.
  - Configuring Memory for Spark Applications
    Describes how to set memory options for Spark applications.
  - Creating Interactive Sessions
    Describes how to create interactive sessions in HPE Ezmeral Unified Analytics Software.
  - Submitting Statements
    Describes how to submit statements in HPE Ezmeral Unified Analytics Software.
  - Managing Interactive Sessions
    Describes how to view and manage Spark interactive sessions in HPE Ezmeral Unified Analytics Software.
  - Spark History Server
    Provides an overview of Spark History Server.
  - Using Spark SQL API
    Describes how to use Spark SQL API in HPE Ezmeral Unified Analytics Software.
  - Enabling GPU Support for Spark
    Describes NVIDIA spark-rapids accelerator support for Spark, and how to enable and allocate the GPU resources on Spark.
  - Securely Passing Spark Configuration Values
    Describes how to pass the sensitive data to Spark configuration using the Kubernetes Secret.
  - Running Spark Applications in Namespaces
    Describes how namespaces work with regard to Spark applications in HPE Ezmeral Unified Analytics Software.
  - Using whylogs with Spark
    Describes how to use whylogs with Spark.
Data Science
Provides a brief overview of data science in HPE Ezmeral Unified Analytics Software.
Notebooks
Provides a brief overview of Notebooks in HPE Ezmeral Unified Analytics Software.
Glossary
Definitions for commonly used terms in HPE Ezmeral Unified Analytics environments.

Configuring a Spark Application to Access External S3 Object Storage

Describes configuration options for connecting Spark to external S3 object storage.

You can configure a Spark application to connect to an external S3 data source directly or through the S3 proxy layer in HPE Ezmeral Unified Analytics Software.

The following diagram shows how applications in Unified Analytics access external S3 data sources, either through a direct connection from the application to an external S3 data source, as depicted by 1, or through the S3 proxy layer, as depicted by 2, 3, and 4.

The S3 proxy layer securely connects Unified Analytics to external data sources, such as AWS S3, MinIO S3, and HPE Ezmeral Data Fabric Object Store.

When you configure a Spark application to access an S3 data source through the S3 proxy layer, you do not have to provide the access credentials or ask an administrator for access to the data source. Your Unified Analytics administrator creates the connections to external S3 data sources and provides the required access credentials (access key and secret key) at that time. Your administrator also grants permissions on the data sources. Your access to the data sources is authorized through Unified Analytics.

You can see the external S3 data sources that your administrator configured for you in the Unified Analytics UI by signing in and going to Data Engineering > Data Sources and clicking on the Object Store Data tab.

The following image shows an example of the Object Store Data tab with tiles for each of the connected external S3 data sources.

The following topics describe each of the methods (direct or S3 proxy) for connecting Spark to an external S3 data source.

Partners Support Dev-Hub Community ALA Privacy Policy Glossary

HPE Ezmeral Unified Analytics Software 1.5 Documentation
Abstract	HPE Ezmeral Unified Analytics Software is a usage-based Software-as-a-Service (SaaS) model that operationalizes hybrid and multi-cloud modern analytical workloads through a simple user interface, easily installed and deployed in minutes. HPE Ezmeral Unified Analytics Software separates compute and storage for flexible, cost-efficient scalability to securely access data stored in multiple data platforms, enabling you to run traditional and advanced analytics workloads with open-source tools.
Published	July 2025
Edition	1.5.0
Topic last updated	2024-04-08