Preparing Clusters for Querying using Secondary Indexes on JSON Tables
Describes the tasks needed to prepare your environment so you can query HPE Ezmeral Data Fabric Database JSON tables using secondary indexes.
Installing with the Data Fabric Installer
To install Data Fabric using the Data Fabric installer, follow the steps outlined at Installing with the Installer.
Starting with release 6.0.1, you do not have to enable a separate query service to use secondary indexes. The Operational Applications with HPE Ezmeral Data Fabric Database template installs and configures the replication gateways needed to update secondary indexes in HPE Ezmeral Data Fabric Database JSON and includes the components needed to run OJAI queries.
You must enable the OJAI Distributed Query Service to use certain features. The following table summarizes the differences in the functionality of OJAI queries when you do and do not have the service enabled:
Service Not Enabled | Service Enabled |
---|---|
|
|
Selecting any of the following templates enables the OJAI Distributed Query Service:
- Operational Applications with HPE Ezmeral Data Fabric Database and Distributed Query Service
- Data Fabric Cluster: Batch, interactive and real-time analytics
- Analytics with HPE Ezmeral Data Fabric Database
You can also explicitly enable the OJAI Distributed Query Service by selecting the service in the Custom Services template.
For more information about installer templates, see Auto-Provisioning Templates.
For more information about how secondary index selection and execution works in HPE Ezmeral Data Fabric Database JSON, see Selection and Execution of Secondary Indexes.
Installing without the Data Fabric Installer
Other sections of the documentation describe the detailed steps for installing and configuring without the Data Fabric installer. Generally, you need to perform the following steps:
- Install software.
To install Data Fabric without using the Data Fabric installer, follow the steps outlined at Installing without the Installer. In addition to installing Data Fabric core packages, you also need to install Drill if you want advanced secondary index selection, sorts on large data sets, and parallel query execution. When installing Drill, make sure to Configure the OJAI Distributed Query Service.
- Install and configure replication gateways.
Updates are propagated from the JSON tables using the Gateways for Replicating HPE Ezmeral Data Fabric Database Tables. You need to install the replication gateways. Since the source JSON table and the secondary index are on the same volume within a cluster, configure an intracluster gateway. In this type of gateway, the source and destination clusters are the same.
If your gateways are running on the same nodes as CLDB, then no additional configuration steps are required. See Configuring Gateways for Table and Stream Replication for details about this scenario and other options for configuring your gateways.
Upgrades
Other sections of the documentation describe the detailed steps for upgrades. Generally, you need to perform the following steps:
- Upgrade your Data Fabric software by following the instructions at Upgrading Core or EEP Components.
- Install Drill, if you have not already done so and want to sort large data sets and run queries in parallel.
- When installing Drill, make sure to Configure the OJAI Distributed Query Service.
- If you are upgrading without the Data Fabric installer, follow step 2 in the previous section to install and configure replication gateways.
- Enable the replication support needed to propagate index updates by running the
following
command:
maprcli cluster feature enable -name mfs.feature.db.streams.v6.support
See Step 4: Enable New Features for further details.