This site contains documentation for HPE Ezmeral Data Fabric release 7.9.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
This section contains information about installing and upgrading HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
This section describes how to manage the nodes and services that make up a cluster.
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Before you start developing applications on the HPE Ezmeral Data Fabric platform, consider how you will get the data into the platform, the storage format of the data, the type of processing or modeling that is required, and how the data will be accessed.
The following sections provide information about accessing the File Store with C and Java applications.
This section contains information about developing client applications for JSON and key-value tables.
HPE Ezmeral Data Fabric Streams supports Apache Kafka Wire Protocol Service. Apache Kafka Wire Protocol Service is a TCP/IP service that emulates a Kafka cluster backed by HPE Ezmeral Data Fabric Streams. The service makes it possible for Apache Kafka clients written in any programming language to access topics in HPE Ezmeral Data Fabric Streams.
HPE Ezmeral Data Fabric Streams brings integrated publish and subscribe messaging to HPE Ezmeral Data Fabric.
This section contains information associated with developing YARN applications.
This section describes how to leverage the capabilities of the Kubernetes Interfaces for Data Fabric.
The following sections provide information about each open-source project that is supported by the HPE Ezmeral Data Fabric.
This topic provides an overview of Apache Airflow on HPE Ezmeral Data Fabric.
This topic describes how to configure the Hive client on a Data Fabric client node.
MSCK REPAIR TABLE
This section guides you through configuring MSCK REPAIR TABLE command to compare and update the partitions in Hive Metastore and file systems.
This section describes changes made in Hive default configuration. It shows how to configure Hive after manual installation.
The authentication method that you configure for the Hive Metastore, HiveServer2, and WebHcat determines how these Hive components access and connect to each other.
This topic describes the manual and automatic options to configure Hive for SCRAM token authentication.
When you configure encryption, the thrift messages sent between the Hive Metastore, HiveServer 2, and HiveServer2 clients are encrypted.
EEP 4.0 introduces default configuration for Hive Metastore password encryption using the Data Fabric Installer. The password is stored in the hive-site.xml file.
hive-site.xml
HPE Ezmeral Data Fabric has built-in platform authorization that protects all data regardless of the execution engine. This topic describes alternative authorization modes you can choose to implement.
Fallback Hive Authorizer is used by Hive DDL (Data Definition Language) tasks for access control and for checking authorization from Driver.doAuthorization().
Driver.doAuthorization()
This section describes how to enable High Availability for HiveServer2 and HiveMetastore.
Describes HPE Ezmeral Data Fabric-specific features in Hive.
This topic describes the public API changes that occurred between Hive 2.3.9 EEP 8.1.0 and Hive 3.1.3 EEP 9.0.0.
This topic describes the public API changes that occurred between Hive 2.1 EEP 5.0.0 and Hive 2.3 EEP 6.0.0.
This section describes Hive logging for Hive 2.1 and later releases and includes information about log splitting.
Apache Livy is primarily used to provide integration between Hue and Spark.
Describes the supported HPE Ezmeral Data Fabric Streams tools and clients.
This topic provides an overview of Apache NiFi on HPE Ezmeral Data Fabric.
This topic provides an overview of OpenTelemetry on HPE Ezmeral Data Fabric.
This section discusses topics associated with Maven and the HPE Ezmeral Data Fabric.
This section contains in-depth information for the developer.
HPE Ezmeral Data Fabric supports public APIs for file system, HPE Ezmeral Data Fabric Database, and HPE Ezmeral Data Fabric Streams. These APIs are available for application-development purposes.
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other data-fabric version documentation.
Definitions for commonly used terms in MapR Converged Data Platform environments.
You can configure the following features for Hive security: