Spark: Add iceberg s3 tables catalog support#4558@mobuchowskiFix incorrect Glue table symlink attachment when using S3 Tables as the Iceberg catalog by adding dedicated S3 Tables catalog support and preventing Glue fallback when the S3 Tables catalog is active.
Spark: Add Lakehouse Hive Catalog handling#4574@tnazarewAdd support for the Lakehouse (formerly BigLake) Hive Catalog used by Spark jobs running on Managed Spark (formerly Dataproc).
Spark: Add Snowflake Iceberg REST catalog (Horizon) support#4546@adnanhemaniAdd support for the Snowflake Horizon Iceberg REST catalog, enabling the Spark integration to emit correct dataset identifiers with Snowflake namespaces for Iceberg tables managed through Snowflake's REST catalog.
Client/Java: Upgrade Kinesis Producer Library to 1.0.7 and AWS SDK to 2.44.3#4536@W-ElyUpgrade the Amazon Kinesis Producer Library (KPL) from 0.15.12 to 1.0.7 and update AWS SDK dependencies to 2.44.3 to maintain compatibility and pick up upstream fixes.
Python: Log if JSON parsing fails for env var#4552@kacpermudaAdd a warning log message when JSON parsing of OpenLineage configuration from environment variables fails, making configuration errors easier to diagnose.
Spark: Make iceberg catalog identifiers optional#4557@mobuchowskiChange BaseCatalogTypeHandler.getIdentifier() to return Optional<DatasetIdentifier> instead of a nullable value, making the contract for Iceberg catalog type handlers more explicit and null-safe.
Spark: Refactor of Dataset builder#4587@tnazarewRefactor Spark integration's dataset-building logic from overloaded methods to a more flexible builder pattern, improving extensibility and maintainability.
Python: Detect GitHub pull request head refs#4561@hcthakur2004Fix pull request number detection in the Python client to recognize refs/pull/<number>/head format (used in GitHub Actions pull_request events) in addition to the existing refs/pull/<number>/merge form.
Spark: Fix SnowflakeCatalogTypeHandler.getIdentifier() return type#4571@mobuchowskiFix SnowflakeCatalogTypeHandler.getIdentifier() to return Optional<DatasetIdentifier> as required by BaseCatalogTypeHandler, correcting a type mismatch introduced when the Snowflake Iceberg REST catalog handler was first added.
Spark: Map sqlserver to MsSqlDialect#4556@dbathie-wtgMap Spark JDBC sqlserver URLs to MsSqlDialect so SQL Server queries using bracketed identifiers ([schema].[table]) are parsed correctly during lineage extraction.