1.49.0 - 2026-06-10
Added
- Java: Add CassandraJdbcExtractor
#4610@matveeysv AddCassandraJdbcExtractorto parse Cassandra JDBC URLs according to the driver specification, enabling lineage tracking for Cassandra databases via JDBC.
Fixed
- dbt: Add missing column lineage for structured log processor
#4599@fm100 Fix missing column lineage facet inDbtStructuredLogsProcessorwhen using--consume-structured-logsoption by attaching the column lineage facet to the output dataset on node finished events. - dbt: Use fully qualified job ID for externalQuery run facet for BigQuery
#4591@fm100 Use a fully qualified job ID (with project ID and location) for theexternalQueryIdin theexternalQueryrun facet when using the dbt-bigquery adapter; also addsexternalQueryrun facet support when using--consume-structured-logs. - Spark: Fix GCP Dataproc jobId when job has multiple attempts
#4598@codelixir Exclude thedataproc_job_attempt_timestamptag prefix when reading the job ID from Yarn tags in the GCP Dataproc facet, preventing the attempt timestamp from being reported as the job ID on retried jobs. - Spark: Fix lakehouse detection mechanism
#4611@tnazarew Fix incorrect Spark configuration properties used in the Lakehouse Hive Catalog detection logic and extend catalog detection toV2SessionCatalogHandlerused by DatasetBuilders. - Spark: Fix Snowflake column lineage for quoted identifiers
#4602@mrpalash-amz Fix missing column lineage when the Snowflake Spark connector uses quoted identifiers (e.g. in AWS Glue ETL withdbtablepath) by applyingstripQuotes()normalization on both sides of identifier comparisons inColumnLevelLineageBuilderandSqlCollector.