About Release 7.10.0
This site contains documentation for HPE Ezmeral Data Fabric release 7.10.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.10.0 Installation
This section contains information about installing HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
7.10.0 Upgrade
This section describes how to upgrade HPE Ezmeral Data Fabric software.
7.10.0 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
7.10.0 Administration
This section describes how to manage the nodes and services that make up a cluster.
- Working with Multiple Fabrics Using the Data Fabric UI
  This section describes how you can get started learning about, installing, and using the HPE Ezmeral Data Fabric.
- Administering Users and Clusters
  Lists topics that help manage a Data Fabric cluster.
- Administering Nodes
  Provides a synopsis of managing nodes in a cluster.
- Administering Volumes
  This section provide information about how to organize and manage data using volumes, a unique feature of HPE Ezmeral Data Fabric clusters.
- Administering Files and Directories
- Administering Tables
  Administration of the HPE Ezmeral Data Fabric Database is done primarily via the command line (maprcli) or with the Managed Control System (MCS). Regardless of whether the HPE Ezmeral Data Fabric Database table is used for binary files or JSON documents, the same types of commands are used with slightly different parameter options. HPE Ezmeral Data Fabric Database administration is associated with tables, columns and column families, and table regions.
- Administering Streams
- Administering Data Fabric Gateways
  A HPE Ezmeral Data Fabric gateway mediates one-way communication between a source HPE Ezmeral Data Fabric cluster and a destination cluster. You can replicate HPE Ezmeral Data Fabric Database tables (binary and JSON) and HPE Ezmeral Data Fabric Streams streams. HPE Ezmeral Data Fabric gateways also apply updates from JSON tables to their secondary indexes and propagate Change Data Capture (CDC) logs.
- Administering Services
- Monitoring the Cluster
  This section describes how to monitor the health and performance of a MapR cluster.
  - Monitoring Using the Control System and the CLI
    Describes the Overview page in the Control System, which displays information about the cluster.
  - Using HPE Ezmeral Data Fabric Monitoring (Spyglass Initiative)
    HPE Ezmeral Data Fabric Monitoring (part of the Spyglass initiative) provides the ability to collect, store, and view metrics and logs for nodes, services, and jobs/applications.
  - Configuring Data Fabric to Track User Behavior
    Describes how to configure Data Fabric to track user behavior.
    - Upgrade Considerations for User Behavior Tracking
      Describes the backward compatibility considerations for user behavior tracking with insight gathering, when upgrading from Data Fabric release version 7.8 to 7.9.
    - Enabling Insight Gathering in Trial Mode
      Describes how to enable insight collection in trial mode.
    - Enabling Insight Gathering in Production Mode
      Describes the steps to enable insight gathering in production mode.
- Configuring Security
  Describes how to configure security and manage secure clusters.
- Managing Secure Clusters
  Provides procedures that will enable you to use Data Fabric clusters securely.
- Administering the Data Access Gateway
  The HPE Ezmeral Data Fabric Data Access Gateway is a service that acts as a proxy and gateway for translating requests between lightweight client applications and the HPE Ezmeral Data Fabric cluster. This section describes considerations when upgrading the service, how to modify configuration settings, and how to administer and manage the service.
- Planning for High Availability
- Administrator's Reference
  This section contains in-depth reference information for the administrator.
- Troubleshooting Cluster Administration
  Lists the common errors and their solutions.
- Best Practices for Backing Up HPE Ezmeral Data Fabric Information
  Lists the best practices and performance considerations to follow when backing up HPE Ezmeral Data Fabric information.
- IPv6 Support in Data Fabric
  Describes the IPv6 support feature for Data Fabric.
7.10.0 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other Data Fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Enabling Insight Gathering in Production Mode

Describes the steps to enable insight gathering in production mode.

Prerequisites

The following prerequisites must be met before you can start insight gathering in production mode:

The cluster/fabric on which you wish to track user behavior has the insight service installed on all the nodes of the cluster or the desired nodes.
Hive Metastore must be installed with a production grade RDBMS like MySQL, Postgres, MariaDB. HPE recommends that you have the Hive Metastore running in high availability mode.

About this task

Insight gathering can be enabled on few nodes, but the approach does not give a complete picture of the events taking place on the cluster/fabric.

HPE recommends that insight gathering is enabled on all nodes, when you wish to gather insights in production mode. In other words, insight gathering must be enabled at the global level in production mode.

The insight service automatically runs in production mode when Hive Metastore is configured with a production-grade RBDMS. The insight service picks the audit logs directly from the audit log files, and adds them to the respective Iceberg tables. Audit streaming is not required for insight gathering in production mode.

Insight gathering is more efficient and more scalable for file or production mode as files containing audit data are distributed on the global level.

Follow the steps given below to enable insight gathering in production mode.

Procedure

Enable audit. See Enabling and Disabling Auditing of Cluster Administration to enable auditing for cluster administration.
Configure a production-grade database for the Hive Metastore. Restart Hive Metastore after the database is changed.
Enable insight. See insight cluster to enable insight.

Results

Insight gathering automatically begins in production mode after Hive Metastore successfully configured with a production-grade RDBMS.

The insight data is gathered on the following Apache Iceberg tables.

Data from cldb audit file is pushed to the cldb_is table.
Data from auth audit file is pushed to the auth_is table.
Data from mfs audit file is pushed to the mfs_is table.
Data from s3 audit file is pushed to the s3_is table.

Partners Support Dev-Hub Community ALA Privacy Policy Glossary

HPE Ezmeral Data Fabric 7.10.0 Documentation
Abstract	This site contains documentation for the customer-managed platform of the HPE Ezmeral Data Fabric version 7.10.0 including installation, configuration, administration, and reference content, as well as content for the associated bundled ecosystem components and drivers.
Published	October 2025
Edition	7.10.0
Topic last updated	2025-04-25