About Release 7.9.0
This site contains documentation for HPE Ezmeral Data Fabric release 7.9.0, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.9.0 Installation
This section contains information about installing HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
7.9.0 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
- HPE Ezmeral Data Fabric File Store
  HPE Ezmeral Data Fabric File Store is a distributed file system for data storage, data management, and data protection. File Store supports mounting and cluster access via NFS and FUSE-based POSIX clients (basic, platinum, or PACC) and also supports access and management via HDFS APIs.
- HPE Ezmeral Data Fabric Object Store
  The HPE Ezmeral Data Fabric Object Store is a native object storage solution that efficiently stores objects and metadata for optimized access.
- HPE Ezmeral Data Fabric Database
  HPE Ezmeral Data Fabric Database is an enterprise-grade, high-performance, NoSQL database management system that you can use for real-time, operational analytics.
- HPE Ezmeral Data Fabric Streams
  HPE Ezmeral Data Fabric Streams brings integrated publish and subscribe messaging to the HPE Ezmeral Data Fabric.
  - Architecture
    Streams contain topics that have logical collections of messages.
    - Stream Design
      Streams are created in volumes and contain topics, which in turn, contain messages. Security, replication, retention, and compression policies are applied at the stream-level.
    - Stream Topics
      Topics are created in streams and contain logical collections of messages. These collections of messages are published to partitions in the topic.
      - Topic Partitions
        Partitions, which exist within topics, are parallel, ordered, immutable sequences of messages that are continually appended to.
      - Topic Creation
        Topics are created in streams and contain logical collections of messages. They can be created either automatically through your producer application or manually through the Control System or the maprcli commands.
    - Topic Messages
      Messages are key/value pairs, where keys are optional. The values contain the data payload, which can be text, images, video files, or any other types of data.
  - Producers
    Producers are data-generating applications, such as sensors in automobiles or activity loggers in servers. Producers create messages with the collected data and publish the messages to HPE Ezmeral Data Fabric Streams topics, specifically, to HPE Ezmeral Data Fabric Streams topic-partitions.
  - Consumers
    Consumers are applications that you create such as analytics applications, reporting tools, or enterprise dashboards.
  - Stream Replication
    You can replicate streams to other Data Fabric clusters worldwide, or to other streams within a Data Fabric cluster.
  - Stream Security
    The adminperm, copyperm, comsumeperm, produceperm, and topicperm security permissions protect topics in a stream from unauthorized access. In addition, Data Fabric supports user impersonation.
- HPE Ezmeral Unified Analytics
  Describes the HPE Ezmeral Unified Analytics Software and provides a link to more information.
- Kubernetes Interfaces for Data Fabric
  This section describes the Kubernetes Interfaces for Data Fabric, which include the Container Storage Interface (CSI) driver for multiple container-orchestration systems, and the FlexVolume driver for Kubernetes.
- Cluster Management
  Provides a synopsis of the various cluster components and their management.
- Performance
  Describes how to tune system performance, manage RDMA, and optimize CLDB tables.
- Security
  Provides an overview of the Data Fabric security features.
- YARN
- Client Connections
  The following sections describe how a client connects to local and remote Data Fabric clusters.
7.9.0 Administration
This section describes how to manage the nodes and services that make up a cluster.
7.9.0 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other Data Fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Topic Partitions

Partitions, which exist within topics, are parallel, ordered, immutable sequences of messages that are continually appended to.

Topics can contain multiple partitions, which make topics scalable by spreading the load for a topic across multiple servers.

Downstream applications that read messages can read from multiple partitions within a topic for faster performance than would be possible if they read from a single partition per topic. Downstream applications can also scale by having separate instances read from separate partitions.

When creating or editing a stream, a default number of partitions can be specified for that stream's topics. Topics inherit the stream's partition default. However, topics can also override the stream's partition default by setting the number of partitions to be used.

Performance

The default number of partitions for Data Fabric streams and topics can impact performance. Depending on the volume of messages being published to a topic, the default number of partitions might be increased for efficient consumption.

When there is a high volume of messages being published to a topic:

Multiple consumers, in consumer groups, reading from multiple partitions are handled more efficiently.
Individual consumers each reading from a single partition are handled less efficiently.

Reference

The following lists topics that have more detailed information.

See the maprcli stream create for information about creating streams with the -defaultpartitions parameter.
See the maprcli stream edit for information about editing streams with the -defaultpartitions parameter.
See the maprcli stream topic create for information about creating topics with the -partitions parameter.
See the maprcli stream topic edit for information about modifying topics with the -partitions parameter.
See the maprcli stream topic info for information about topic data including the -partitions parameter.
See the HPE Ezmeral Data Fabric Streams Java API Library for the methods used to create and edit streams and to create and edit topics.

Partners Support Dev-Hub Community ALA Privacy Policy Glossary

HPE Ezmeral Data Fabric – Customer-Managed 7.9.0 Documentation
Abstract	This site contains documentation for the customer-managed platform of the HPE Ezmeral Data Fabric version 7.9.0 including installation, configuration, administration, and reference content, as well as content for the associated bundled ecosystem components and drivers.
Published	April 2025
Edition	7.9.0
Topic last updated	2025-03-07