0.20.4 - 2023-02-07
Added
- Airflow: add new extractor for
GCSToGCSOperator#1495@sekikn
Adds a new extractor for this operator. - Flink: resolve topic names from regex, support 1.16.0
#1522@pawel-big-lebowski
Adds support for Flink 1.16.0 and makes the integration resolve topic names from Kafka topic patterns. - Proxy: implement lineage event validator for client proxy
#1469@fm100
Implements logic in the proxy (which is still in development) for validating and handling lineage events.
Changed
- CI: use
ruffinstead of flake8, isort, etc., for linting and formatting#1526@mobuchowski
Adopts theruffpackage, which combines several linters and formatters into one fast binary.
Fixed
- Airflow: make the Trino catalog non-mandatory
#1572@JDarDagran
Makes the Trino catalog optional in the Trino extractor. - Common: add explicit SQL dependency
#1532@mobuchowski
Addresses a 0.19.2 breaking change to the GE integration by including the SQL dependency explicitly. - DBT: adjust
tqdmlogging indbt-ol#1549@JdarDagran
Adjuststqdmto show the correct number of iterations and adds START events for parent runs. - DBT: fix typo in log output
#1493@denimalpaca
Fixes 'emittled' typo in log output. - Great Expectations/Airflow: follow Snowflake dataset naming rules
#1527@mobuchowski
Normalizes Snowflake dataset and datasource naming rules among DBT/Airflow/GE; canonizes old Snowflake account paths around making them all full-size with account, region and cloud names. - Java and Python Clients: Kafka does not initialize properties if they are empty; check and notify about Confluent-Kafka requirement
#1556@mobuchowski
Fixes the failure to initializeKafkaTransportin the Java client and adds an exception if the requiredconfluent-kafkamodule is missing from the Python client. - Spark: add square brackets for list-based Spark configs
#1507@Varunvaruns9
Adds a condition to treat configs with[]as lists. Note:[]will be required for list-based configs starting with 0.21.0. - Spark: fix several Spark/BigQuery-related issues
#1557@mobuchowski
Fixes the assumption that a version is always a number; adds support forHadoopMapReduceWriteConfigUtil; makes the integration accessBigQueryUtilandgetTableIdusing reflection, which supports all BigQuery versions; makes logs provide the full serialized LogicalPlan ondebug. - SQL: only report partial failures `#1479 @mobuchowski
Changes the parser so it reports partial failures instead of failing the whole extraction.