Apache Zeppelin is an open source, Web-based data-science notebook. You can use it with Data Fabric components to conduct data discovery, ETL, machine learning, and data visualization.

You can run the package-based Zeppelin product only on a Data Fabric node (and not on an edge node). Out of the box, Zeppelin is integrated with open-source data-processing engines such as Apache Spark, Apache Drill, and Apache Hive, as well as with native Data Fabric engines (file system, HPE Ezmeral Data Fabric Database, and HPE Ezmeral Data Fabric Streams). Using the notebook simply requires connecting to Zeppelin through your browser.

Zeppelin provides the following benefits for your data-engineering and data-science use cases:

  • An interactive development environment for writing, testing, and sharing data processing code snippets
  • Support for a variety of interpreters for integrating with different backend components
  • Support for extensible visualization libraries

For release-specific information, see Zeppelin Release Notes (Package-Based).

For installation information, see Installing Zeppelin.

For additional information about Zeppelin, refer to the open source documentation.