1.17.1 - 2024-06-21
Added
- Java: dataset namespace resolver feature
#2720@pawel-big-lebowski
Adds a dataset namespace resolving mechanism that resolves dataset namespaces based on the resolvers configured. The core mechanism is implemented in openlineage-java and can be used within the Flink and Spark integrations. - Spark: add transformation extraction
#2758@tnazarew
Adds a transformation type extraction mechanism. - Spark: add GCP run and job facets
#2643@codelixir
AddsGCPRunFacetBuilderandGCPJobFacetBuilderto report additional facets when running on Google Cloud Platform. - Spark: improve namespace format for SQLServer
#2773@dolfinus
Improves the namespace format for SQLServer. - Spark: verify jar content after build
#2698@pawel-big-lebowski
Adds a tool to verifyshadowJarcontent and prevent reported issues. These are hard to prevent currently and require manual verification of manually unpacked jar content. - Spec: add transformation type info
#2756@tnazarew
Adds information about the transformation type inColumnLineageDatasetFacet.transformationTypeandtransformationDescriptionare marked as deprecated. - Spec: implementing facet registry (following #2161)
#2729@harels
Introduces the foundations of the new facet Registry into the repo. - Spec: register GCP common job facet
#2740@ngorchakova
Registers the GCP job facet that contains common attributes that will improve the way lineage is parsed and displayed by the GCP platform. Based on the proposal, GCP Lineage would like to define facets that are expected from integrations. The list of support facets is not final and will be extended further by next PR.