Airflow 2.7.3.0 - 2401 (EEP 9.2.1) Release Notes

The following notes relate specifically to the HPE Ezmeral Data Fabric Distribution for Apache Airflow. You may also be interested in the Apache Airflow home page.
Airflow Version 2.7.3.0
Release Date January 2024
HPE Version Interoperability See EEP Components and OS Support.
Source on GitHub https://github.com/mapr/airflow
GitHub Release Tag 2.7.3.0-eep-921
Maven Artifacts https://repository.mapr.com/maven/
Package Names Navigate to http://package.ezmeral.hpe.com/releases/MEP/, and select your EEP(MEP) and OS to view the list of package names.
Documentation

New in This Release

  • This release updates the Airflow component to version 2.7.3.0.
  • The airflow db migrate command replaces db init and db upgrade as the command to create or upgrade the Airflow database.

Fixes

None.

Known Issues and Limitations

  • The Installer can install Airflow, but cannot set up MySQL as the backend database for Airflow. The default Airflow database is SQLite.​
  • Apache PySpark has many CVEs and is removed from the default Airflow dependencies. To use the Spark JDBC operator/hook from Apache, install PySpark as follows:
    1. Run <airflow_home>/build/env/bin/activate.
    2. Run pip install pyspark==3.3.2.
    3. Run deactivate.
    4. NOTE
      This process does not affect the Ezmeral Spark provider.
  • If the repair_pip_depends.sh script failed with the following error, you must run the script again:
    subprocess.CalledProcessError: Command 'krb5-config --libs gssapi' returned non-zero exit      status 127. 
     [end of output] 

Resolved Issues

  • AIRFLOW-164: Airflow fails to start on FIPS-enabled node. Extra steps are needed to install Airflow on a FIPS node. See Installation on a FIPS Node.