Data Analytics

Provides a brief overview of data analytics in HPE Ezmeral Unified Analytics Software.

HPE Ezmeral Unified Analytics Software provides a single place where data engineers and data scientists can run analytical workloads through the Apache Spark Operator, interactive sessions in Apache Livy, and schedule jobs using Apache Airflow.

ACID (Atomicity, Consistency, Isolation and Durability) transactions for Spark applications are supported out of box with Delta Lake. Delta Lake has a well-defined open protocol called Delta Transaction Protocol that provides ACID transactions to Apache Spark applications. You can use any Apache Spark APIs to read and write data with Delta Lake. Delta Lake stores the data in Parquet format as versioned Parquet files.

HPE Ezmeral Unified Analytics Software simplifies data access and data workflows and pipelines. HPE Ezmeral Unified Analytics Software connects to multiple types of internal and external data sources that you can easily explore with federated SQL queries that you visualize in Superset (dashboards). You can also use Spark to transform raw data sets into consumable formats like data lakehouses.