Airflow 2.9.3.0 (EEP 9.3.1) Release Notes

The following notes relate specifically to the HPE Ezmeral Data Fabric Distribution for Apache Airflow. You may also be interested in the Apache Airflow home page.
Airflow Version 2.9.3.0
Release Date October 2024
HPE Version Interoperability See EEP Components and OS Support.
Source on GitHub https://github.com/mapr/airflow
GitHub Release Tag 2.9.3.0-eep-931
Package Names Navigate to http://package.ezmeral.hpe.com/releases/MEP/, and select your EEP(MEP) and OS to view the list of package names.
Documentation

New in This Release

This release:
  • Updates the Airflow component to version 2.9.3.0.
  • Adds MapR-FS as the schema for remote logging.
  • Sets secure cookies to true for secure clusters by default.

Known Issues and Limitations

  • The Installer can install Airflow, but cannot set up MySQL as the backend database for Airflow. The default Airflow database is SQLite.​
  • Apache PySpark has many CVEs and is removed from the default Airflow dependencies. To use the Spark JDBC operator/hook from Apache, install PySpark as follows:
    1. Run <airflow_home>/build/env/bin/activate.
    2. Run pip install pyspark==3.3.3.
    3. Run deactivate.
    4. NOTE
      This process does not affect the HPE Ezmeral Spark provider.
  • If the repair_pip_depends.sh script fails with the following error, you must run the script again:
    subprocess.CalledProcessError: Command 'krb5-config --libs gssapi' returned non-zero exit      status 127. 
     [end of output] 

Resolved Issues

None.