Airflow 2.5.1.100 - 2405 (EEP 8.1.2) Release Notes

The following notes relate specifically to the HPE Ezmeral Data Fabric Distribution for Apache Airflow. You may also be interested in the Apache Airflow home page.
Airflow Version 2.5.1.100
Release Date May 2024
HPE Version Interoperability See EEP Components and OS Support.
Source on GitHub https://github.com/mapr/airflow
GitHub Release Tag 2.5.1.100-eep-812
Maven Artifacts https://repository.mapr.com/maven/
Package Names Navigate to http://package.ezmeral.hpe.com/releases/MEP/, and select your EEP (MEP) and OS to view the list of package names.
Documentation

New in This Release

This release updates the Airflow component to version 2.5.1.100.

Fixes

This HPE release includes the following fixes on the base release:
Commit Date (YYYY-MM-DD) Comment
88c0ef18fd 2024-03-19 AIRFLOW-210: Created scheduler log directory for next day to avoid race condition issues
0fd7e86056 2024-02-26 AIRFLOW-205: Added additional log information for case when the recorded pid doesn't equal to task pid
f5eec18873 2023-10-17 AIRFLOW-185: Check and create scheduler log directory before scheduling

Known Issues and Limitations

  • Airflow is not supported on FIPS-enabled nodes.
  • Starting with EEP 8.1.1, Apache PySpark is removed from the default Airflow dependencies. To use the Spark JDBC operator/hook from Apache, install PySpark as follows:
    1. Run <airflow_home>/build/env/bin/activate.
    2. Run pip install pyspark==<spark_version>.
    3. Run deactivate.
    4. NOTE This process does not affect the Ezmeral Spark provider.

Resolved Issues

  • None.