Jump to main content
HPE Data Fabric 8.0.0 Software Documentation
  • About Release 8.0.0

    This site contains documentation for HPE Data Fabric release 8.0.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.

  • 8.0.0 Installation

    This section contains information about installing HPE Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.

  • 8.0.0 Upgrade

    This section describes how to upgrade HPE Data Fabric software.

  • 8.0.0 Data Fabric

    HPE Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.

  • 8.0.0 Administration

    This section describes how to manage the nodes and services that make up a cluster.

  • 8.0.0 Development

    This section contains information related to application development for Ezmeral ecosystem components and HPE Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.

    • Application Development Process

      Before you start developing applications on the HPE Data Fabric platform, consider how you will get the data into the platform, the storage format of the data, the type of processing or modeling that is required, and how the data will be accessed.

    • File Store and Apps

      The following sections provide information about accessing the File Store with C and Java applications.

    • HPE Data Fabric Database and Apps

      This section contains information about developing client applications for JSON and key-value tables.

    • Apache Kafka Wire Protocol Service

      HPE Data Fabric Streams supports Apache Kafka Wire Protocol Service. Apache Kafka Wire Protocol Service is a TCP/IP service that emulates a Kafka cluster backed by HPE Data Fabric Streams. The service makes it possible for Apache Kafka clients written in any programming language to access topics in HPE Data Fabric Streams.

    • Model Context Protocol (MCP)
    • HPE Data Fabric Streams and Apps

      HPE Data Fabric Streams brings integrated publish and subscribe messaging to HPE Data Fabric.

    • MapReduce and Apps

      This section contains information associated with developing YARN applications.

    • Kubernetes Interfaces for Data Fabric

      This section describes how to leverage the capabilities of the Kubernetes Interfaces for Data Fabric.

    • Ecosystem Components

      The following sections provide information about each open-source project that is supported by the HPE Data Fabric.

      • Ecosystem Packs

      • Apache Airflow

        This topic provides an overview of Apache Airflow on HPE Data Fabric.

      • AsyncHBase

      • Cascading

      • Apache Drill
      • Apache Flink
      • Hadoop
      • HBase

      • HBase Client and HPE Data Fabric Database Binary Tables

      • HCatalog
      • Hive
      • HttpFS
      • Hue
      • Livy

        Apache Livy is primarily used to provide integration between Hue and Spark.

      • HPE Data Fabric Streams Clients and Tools

        Describes the supported HPE Data Fabric Streams tools and clients.

      • NiFi

        This topic provides an overview of Apache NiFi on HPE Data Fabric.

      • OTel

        This topic provides an overview of OpenTelemetry on HPE Data Fabric.

      • Apache Polaris
      • Ranger
      • Apache Spark
      • YARN
        • ResourceManager

          Describes the role of the ResourceManager.

        • ApplicationMaster

          Describes the role of the ApplicationMaster.

        • MapReduce Version 2

          Provides an overview of how MapReduce works.

        • How Applications Work in YARN

          Describes the data flow during application execution in YARN.

        • Direct Shuffle on YARN

          Explains the shuffle phase of a MapReduce application.

        • Apache Shuffle on YARN

          You can disable Direct Shuffle and enable Apache Shuffle by modifying the configuration options in the yarn-site.xml and mapred-site.xml files. This page describes how to configure Apache Shuffle for MapReduce applications.

        • Logging Options on YARN

          Describes the logging options that are available on YARN.

        • Support for ADLS

          Starting with MapR 6.1, you can use Azure Data Lake Store (ADLS) as a data source or destination for all applications.

          • Prerequisites for Using ADLS

            Setting up Azure Data Lake Store (ADLS) on the Azure portal enables you to access ADLS from any application.

          • Authenticating ADLS Account

            To access data stored in Azure Data Lake Store (ADLS), you must first authenticate your ADLS account using your ADLS credentials.

        • Configuring ATS 1.0 or 1.5 for Hadoop 3.3 (Required for Tez UI)

          Describes how to configure the YARN Application Timeline Server (ATS) 1.0 and 1.5 for Hadoop 3.3.x. You must complete this process in order to use the Tez UI.

        • Configuring ATS 2.0 for Hadoop 3.3

          Describes how to install and configure the YARN Application Timeline Server (ATS) 2.0 for Hadoop 3.3.

      • Zeppelin

    • Maven and the HPE Data Fabric

      This section discusses topics associated with Maven and the HPE Data Fabric.

    • Developer's Reference

      This section contains in-depth information for the developer.

    • API Documentation

      HPE Data Fabric supports public APIs for file system, HPE Data Fabric Database, and HPE Data Fabric Streams. These APIs are available for application-development purposes.

  • Other Docs

    This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other Data Fabric version documentation.

  • Glossary

    Definitions for commonly used terms in MapR Converged Data Platform environments.

Prerequisites for Using ADLS

Setting up Azure Data Lake Store (ADLS) on the Azure portal enables you to access ADLS from any application.

  • Create an account on the Azure portal.
  • Create an Azure Data Lake Store (get started with Azure Data Lake Storage).
Partners Support Dev-Hub Community ALA Privacy Policy Glossary
Document information
HPE Data Fabric 8.0.0 Software Documentation
Abstract This site contains documentation for HPE Data Fabric Software version 8.0.0 including installation, configuration, administration, and reference content, as well as content for the associated bundled ecosystem components and drivers.
Published November 2025
Edition 8.0.0
EZDF 8.0.x
Search