About Release 7.9.0
This site contains documentation for HPE Ezmeral Data Fabric release 7.9.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.9.0 Installation
This section contains information about installing HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
7.9.0 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
7.9.0 Administration
This section describes how to manage the nodes and services that make up a cluster.
7.9.0 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
- Application Development Process
  Before you start developing applications on the HPE Ezmeral Data Fabric platform, consider how you will get the data into the platform, the storage format of the data, the type of processing or modeling that is required, and how the data will be accessed.
- File Store and Apps
  The following sections provide information about accessing the File Store with C and Java applications.
- HPE Ezmeral Data Fabric Database and Apps
  This section contains information about developing client applications for JSON and key-value tables.
- Apache Kafka Wire Protocol Service
  HPE Ezmeral Data Fabric Streams supports Apache Kafka Wire Protocol Service. Apache Kafka Wire Protocol Service is a TCP/IP service that emulates a Kafka cluster backed by HPE Ezmeral Data Fabric Streams. The service makes it possible for Apache Kafka clients written in any programming language to access topics in HPE Ezmeral Data Fabric Streams.
- HPE Ezmeral Data Fabric Streams and Apps
  HPE Ezmeral Data Fabric Streams brings integrated publish and subscribe messaging to HPE Ezmeral Data Fabric.
- MapReduce and Apps
  This section contains information associated with developing YARN applications.
- Kubernetes Interfaces for Data Fabric
  This section describes how to leverage the capabilities of the Kubernetes Interfaces for Data Fabric.
- Ecosystem Components
  The following sections provide information about each open-source project that is supported by the HPE Ezmeral Data Fabric.
- Maven and the HPE Ezmeral Data Fabric
  This section discusses topics associated with Maven and the HPE Ezmeral Data Fabric.
- Developer's Reference
  This section contains in-depth information for the developer.
  - HPE Ezmeral Data Fabric Database Shell (JSON Tables)
    The mapr dbshell is a tool that enables you to create and perform basic manipulation of JSON tables and documents. You run dbshell by typing mapr dbshell on the command line after logging into a node in a HPE Ezmeral Data Fabric cluster.
  - Utilities for HPE Ezmeral Data Fabric Database JSON Tables
    HPE Ezmeral Data Fabric Database JSON provides utilities to copy, export, and import data, compare table content, and verify the consistency of secondary indexes.
  - HPE Ezmeral Data Fabric Database HBase Shell (Binary Tables)
    You can manage HPE Ezmeral Data Fabric Database tables using HBase shell commands and additional HBase shell commands included in the HPE Ezmeral Data Fabric distribution of Hadoop.
  - Utilities for HPE Ezmeral Data Fabric Database Binary Tables
    HPE Ezmeral Data Fabric Database provides utilities to copy and compare data in HPE Ezmeral Data Fabric Database binary tables.
  - HPE Ezmeral Data Fabric Streams Utilities
  - YARN Commands
    This section describes the YARN commands.
  - Source Code for HPE Ezmeral Data Fabric Software
    HPE releases source code to the open-source community for enhancements that HPE has made to the Apache Hadoop project and other ecosystem components.
  - Hadoop Commands
    This section describes the Hadoop commands.
    - Hadoop Command Overview
      - Hadoop Syntax Summary
      - Supported Commands for Hadoop
      - Generic Options
      - Hadoop 3 API Changes
        Summarizes the API changes introduced in Hadoop 3.
    - hadoop archive
      The hadoop archive command creates a Hadoop archive, a file that contains other files. A Hadoop archive always has a *.har extension.
    - hadoop classpath
      The hadoop classpath command prints the class path needed to access the Hadoop jar and the required libraries.
    - hadoop daemonlog
      The hadoop daemonlog command gets and sets the log level for each daemon.
    - hadoop distcp
      The hadoop distcp command is a tool used for large inter- and intra-cluster copying.
    - hadoop fs
      The hadoop fs command runs a generic file system user client that interacts with the file system. Starting from EEP 7.1.0, all hadoop fs commands support operations on symlinks.
    - hadoop jar
      The hadoop jar command runs a program contained in a JAR file. Users can bundle their MapReduce code in a JAR file and execute it using this command.
    - hadoop job
      The hadoop job command enables you to manage MapReduce jobs.
    - hadoop mfs
      The hadoop mfs command displays directory information and contents, creates symbolic links and hard links, sets, gets, and removes Access Control Expressions (ACE) on files and directories, and sets compression and chunk size on a directory.
    - hadoop mradmin
      The hadoop mradmin command runs Map-Reduce administrative commands.
    - hadoop pipes
      The hadoop pipes command runs a pipes job.
    - hadoop queue
      The hadoop queue command displays job queue information.
    - hadoop version
      The hadoop version command prints the hadoop software version.
    - hadoop conf
      The hadoop conf command outputs the configuration information for this node to standard output.
- API Documentation
  HPE Ezmeral Data Fabric supports public APIs for file system, HPE Ezmeral Data Fabric Database, and HPE Ezmeral Data Fabric Streams. These APIs are available for application-development purposes.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other Data Fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Hadoop Command Overview

This section contains the following:

Partners Support Dev-Hub Community ALA Privacy Policy Glossary

HPE Ezmeral Data Fabric – Customer-Managed 7.9.0 Documentation
Abstract	This site contains documentation for the customer-managed platform of the HPE Ezmeral Data Fabric version 7.9.0 including installation, configuration, administration, and reference content, as well as content for the associated bundled ecosystem components and drivers.
Published	April 2025
Edition	7.9.0
Topic last updated	2022-09-22