About Release 7.9.0
This site contains documentation for HPE Ezmeral Data Fabric release 7.9.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.9.0 Installation
This section contains information about installing HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
7.9.0 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
7.9.0 Administration
This section describes how to manage the nodes and services that make up a cluster.
- Administering Users and Clusters
  Lists topics that help manage a Data Fabric cluster.
- Administering Nodes
  Provides a synopsis of managing nodes in a cluster.
- Administering Volumes
  This section provide information about how to organize and manage data using volumes, a unique feature of HPE Ezmeral Data Fabric clusters.
- Administering Files and Directories
- Administering Tables
  Administration of the HPE Ezmeral Data Fabric Database is done primarily via the command line (maprcli) or with the Managed Control System (MCS). Regardless of whether the HPE Ezmeral Data Fabric Database table is used for binary files or JSON documents, the same types of commands are used with slightly different parameter options. HPE Ezmeral Data Fabric Database administration is associated with tables, columns and column families, and table regions.
- Administering Streams
- Administering Data Fabric Gateways
  A HPE Ezmeral Data Fabric gateway mediates one-way communication between a source HPE Ezmeral Data Fabric cluster and a destination cluster. You can replicate HPE Ezmeral Data Fabric Database tables (binary and JSON) and HPE Ezmeral Data Fabric Streams streams. HPE Ezmeral Data Fabric gateways also apply updates from JSON tables to their secondary indexes and propagate Change Data Capture (CDC) logs.
- Administering Services
- Monitoring the Cluster
  This section describes how to monitor the health and performance of a MapR cluster.
  - Monitoring Using the Control System and the CLI
    Describes the Overview page in the Control System, which displays information about the cluster.
  - Using HPE Ezmeral Data Fabric Monitoring (Spyglass Initiative)
    HPE Ezmeral Data Fabric Monitoring (part of the Spyglass initiative) provides the ability to collect, store, and view metrics and logs for nodes, services, and jobs/applications.
    - HPE Ezmeral Data Fabric Monitoring Architecture
      HPE Ezmeral Data Fabric Monitoring integrates with open-source components to collect, aggregate, store, and visualize metrics and logs.
    - Metric Collection
      Metrics are collected from each node in the cluster so that administrators can use the data to monitor the cluster. In general, the collectd service collects metrics every 10 seconds. The exception is volume metrics which are collected every 10 minutes.
      - Configure Metric Retention
        By default, OpenTSDB stores two weeks of metrics. Based on your requirements, you can change the metric-retention period.
      - Configure Queue Filters for mapr.rm.<value> Metrics
        The YARN application metrics that are collected by JMX have the metric name syntax mapr.rm.<metric_name> and the metric values are aggregated among all the queues in the default queue. However, you can configure collectd to create a filter for each queue. As an alternative, you can use the REST API queue metrics (mapr.rm_queue.<metric_name>) which are by default set up for filtering by queue.
      - Configure the Collectd Service Heap Size
        The collectd service uses an embedded JVM when it gathers metrics from the CLDB, Node Manager, Resource Manager, and Drill. You can edit the Plugin Java section of collectd.conf to configure limits to the collectd virtual memory footprint.
      - CPU Metrics
        Every 10 seconds, the collectd service uses the cpu plugin to gather the following CPU metrics on each node in the cluster.
      - Disk Free Metrics
        Every 10 seconds, the collectd service uses the df plugin to gather the following disk free metrics on each node in the cluster.
      - Disk Metrics
        Every 10 seconds, the collectd service uses the disk plugin to gather the following disk metrics on each node in the cluster.
      - Drill Metrics
        Every 10 seconds, the collectd service uses the plugin to gather the following Drill metrics on each node in the cluster.
      - Hive JMX Metrics
        Every 10 seconds, the collectd service uses the Hive plug-in to gather the following Hive JMX metrics on each node in the cluster. Descriptions for the Hive metrics are not currently available.
      - Kafka JMX Metrics
        Starting in EEP 9.0.0, you can enable metrics collection for Kafka consumers and producers. When enabled, JMX collects Kafka producer and consumer metrics from client applications. You can view the metrics through the JConsole UI or JMXTerm CLI.
      - Load Metrics
        Every 10 seconds, the collectd service uses the load plugin to gather the following load metrics on each node in the cluster.
      - Alarm Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the cluster alarms.
      - Cache Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following Data Fabric file system cache metrics on each node in the cluster.
      - CLDB Metrics
        Every 10 seconds, the collectd service uses a HPE Ezmeral Data Fabric plugin to gather the following CLDB metrics on the primary CLDB node in the cluster.
      - HPE Ezmeral Data Fabric Database Metrics
        Every 10 seconds, the collectd service uses a plugin to gather HPE Ezmeral Data Fabric Database metrics on each node in the cluster. HPE Ezmeral Data Fabric Database provides both node and table metrics.
      - HPE Ezmeral Data Fabric Streams Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following Streams metrics on each node in the cluster.
      - file system Metrics
        Every 10 seconds, the collectd service uses a HPE Data Fabric plugin to gather the following file system metrics on each node in the cluster.
      - Process Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following process metrics on each node in the cluster.
      - I/O Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following I/O metrics on each node in the cluster.
      - RPC Metric
        Every 10 seconds, the collectd service uses a plugin to gather the following RPC metrics on each node in the cluster.
      - Spark JMX Metrics
        Every 10 seconds, the collectd service gathers the following Spark JMX metrics on each node in the cluster.
      - Topology Metrics
        Every 60 seconds, the collectd service uses a plugin to gather the following topology metrics on each node in the cluster. Use these metrics to understand disk utilization across a topology or rack. By default, these metrics include all racks and topologies associated with the cluster. However, you can use tags to specify which rack(s) or topologies(s) to include. Note: Racks and topologies can span multiple nodes and one rack can be associated with multiple topologies.
      - Volume Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following volume metrics on each CLDB node in the cluster.
      - Virtual Memory Metrics
        Every 10 seconds, the collectd service uses the vmem plugin to gather the following memory metrics on each node in the cluster.
      - Memory Metrics
        Every 10 seconds, the collectd service uses the memory and swap plugins to gather the following memory metrics on each node in the cluster.
      - Network Metrics
        Every 10 seconds, the collectd service uses the interface plugin to gather network metrics on each node in the cluster.
      - Node Manager Metrics
        Every 10 seconds, the collectd service uses a plugin to gather the following Node Manager metrics on each node in the cluster.
      - Resource Manager Metrics
        Every 10 seconds, the collectd service uses a HPE Ezmeral Data Fabric plugin to gather Resource Manager metrics on the active Resource Manager. Collectd gathers metrics on the Resource Manager JVM process, YARN applications, and nodes that are managed by the Resource Manager. The method used to gather the metrics differs based on the metric type.
    - Configure the OpenTSDB Service Heap Size
      By default, the OpenTSDB service is configured to use a default heap size of 6 gigabytes. The default heap size can be adjusted by modifying certain configuration files.
    - Metric Visualization
      Use dashboards to visualize metrics across multiple nodes and clusters.
    - Log Collection
      Fluentd collects log events from each node in the cluster and stores them in a centralized location so that administrators can search the logs when troubleshooting issues in the cluster. The process that fluentd uses to parse and send log events to Elasticsearch differs based on the formatting of log events in each log file.
    - Log Aggregation and Storage
      Fluentd uses a round-robin approach when writing logs to Elasticsearch nodes. If an Elasticsearch node in unavailable, Fluentd can fail over log storage to another Elasticsearch node.
    - Log Visualization
      Use dashboards to visualize the logs across multiple nodes and clusters.
    - HPE Ezmeral Data Fabric Monitoring Tips and Troubleshooting
      Lists the nuances of monitoring clusters.
    - Reconfiguring Data Fabric Monitoring
      Changes to an existing cluster, such as the addition of services, may require additional steps to enable the collection of metrics and logs.
  - Configuring Data Fabric to Track User Behavior
    Describes how to configure Data Fabric to track user behavior.
- Configuring Security
  Describes how to configure security and manage secure clusters.
- Managing Secure Clusters
  Provides procedures that will enable you to use Data Fabric clusters securely.
- Administering the Data Access Gateway
  The HPE Ezmeral Data Fabric Data Access Gateway is a service that acts as a proxy and gateway for translating requests between lightweight client applications and the HPE Ezmeral Data Fabric cluster. This section describes considerations when upgrading the service, how to modify configuration settings, and how to administer and manage the service.
- Planning for High Availability
- Administrator's Reference
  This section contains in-depth reference information for the administrator.
- Troubleshooting Cluster Administration
  Lists the common errors and their solutions.
- Best Practices for Backing Up HPE Ezmeral Data Fabric Information
  Lists the best practices and performance considerations to follow when backing up HPE Ezmeral Data Fabric information.
- IPv6 Support in Data Fabric
  Describes the IPv6 support feature for Data Fabric.
7.9.0 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other Data Fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Metric Collection

Metrics are collected from each node in the cluster so that administrators can use the data to monitor the cluster. In general, the collectd service collects metrics every 10 seconds. The exception is volume metrics which are collected every 10 minutes.

When collectd writes metrics to streams, tags are assigned to each metric so that administrators can filter metric data to create dashboards that are specific to their needs.

By default, each metric contains the following tags:

fqdn: Displays values for a specified node.
clusterid: Displays values for a specific cluster.
clustername: As of EEP 3.0, displays values for a specific cluster.

However, many metrics have additional tags that you can use to filter metric data.

Streams store metrics in OpenTSDB with the following schema:

<metrictype.name> <fqdn:fqdnvalue> <clusterid:clusteridvalue> <clustername:clusternamevalue>[<AdditionalTagA:AdditionalTagAvalue> <AdditionalTagB:AdditionalTagBvalue>...] <metricvalue> <timestamp>

NOTE

A negative value shown in the metrics indicates that the maximum value configured for that metric is exceeded. The maximum value for GUT metrics is int32 (2^31-1).

For more information on using tags and dashboards, see Metric Visualization.

Partners Support Dev-Hub Community ALA Privacy Policy Glossary

HPE Ezmeral Data Fabric – Customer-Managed 7.9.0 Documentation
Abstract	This site contains documentation for the customer-managed platform of the HPE Ezmeral Data Fabric version 7.9.0 including installation, configuration, administration, and reference content, as well as content for the associated bundled ecosystem components and drivers.
Published	April 2025
Edition	7.9.0
Topic last updated	2021-02-01