0.24.0 - 2023-05-03
Added
- Support custom transport types
#1795@nataliezeller1
Adds a new interface,TransportBuilder, for creating custom transport types without having to modify core components of OpenLineage. - Airflow: dbt Cloud integration
#1418@howardyoo
Adds a new OpenLineage extractor for dbt Cloud that uses the dbt Cloud hook provided by Airflow to communicate with dbt Cloud via its API. - Spark: support dataset name modification using regex
#1796@pawel-big-lebowski
It is a common scenario to write Spark output datasets with a location path ending with/year=2023/month=04. The Spark parameterspark.openlineage.dataset.removePath.patternintroduced here allows for removing certain elements from a path with a regex pattern.
Fixed
- Spark: catch exception when trying to obtain details of non-existing table.
#1798@pawel-big-lebowski
This mostly happens when getting table details on START event while the table is still not created. - Spark: LogicalPlanSerializer
#1792@pawel-big-lebowski
ChangesLogicalPlanSerializerto make use of non-shaded Jackson classes in order to serializeLogicalPlans. Note: class names are no longer serialized. - Flink: fix Flink CI
#1801@pawel-big-lebowski
Specifies an older image version that succeeds on CI in order to fix the Flink integration.