Package io.openlineage.client
Class OpenLineage
java.lang.Object
io.openlineage.client.OpenLineage
Usage:
URI producer = URI.create("http://my.producer/uri");
OpenLineage ol = new OpenLineage(producer);
UUID runId = UUID.randomUUID();
RunFacets runFacets =
ol.newRunFacetsBuilder().nominalTime(ol.newNominalTimeRunFacet(now, now)).build();
Run run = ol.newRun(runId, runFacets);
String name = "jobName";
String namespace = "namespace";
JobFacets jobFacets = ol.newJobFacetsBuilder().build();
Job job = ol.newJob(namespace, name, jobFacets);
List<InputDataset> inputs = Arrays.asList(ol.newInputDataset("ins", "input", null, null));
List<OutputDataset> outputs = Arrays.asList(ol.newOutputDataset("ons", "output", null, null));
RunEvent runStateUpdate =
ol.newRunEvent(now, OpenLineage.RunEvent.EventType.START, run, job, inputs, outputs);
-
Nested Class Summary
Modifier and TypeClassDescriptionstatic interface
Interface for BaseEventstatic interface
Interface for BaseFacetstatic interface
static final class
model class for ColumnLineageDatasetFacetfinal class
builder class for ColumnLineageDatasetFacetstatic final class
model class for ColumnLineageDatasetFacetFieldsstatic final class
model class for ColumnLineageDatasetFacetFieldsAdditionalstatic final class
builder class for ColumnLineageDatasetFacetFieldsAdditionalstatic final class
model class for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsstatic final class
builder class for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsstatic final class
model class for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsstatic final class
builder class for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsstatic final class
builder class for ColumnLineageDatasetFacetFieldsstatic final class
model class for DataQualityAssertionsDatasetFacetstatic final class
model class for DataQualityAssertionsDatasetFacetAssertionsstatic final class
builder class for DataQualityAssertionsDatasetFacetAssertionsfinal class
builder class for DataQualityAssertionsDatasetFacetstatic final class
model class for DataQualityMetricsInputDatasetFacetfinal class
builder class for DataQualityMetricsInputDatasetFacetstatic final class
model class for DataQualityMetricsInputDatasetFacetColumnMetricsstatic final class
model class for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalstatic final class
builder class for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalstatic final class
model class for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesstatic final class
builder class for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesstatic final class
builder class for DataQualityMetricsInputDatasetFacetColumnMetricsstatic interface
Interface for Datasetstatic final class
model class for DatasetEventfinal class
builder class for DatasetEventstatic interface
Interface for DatasetFacetstatic final class
model class for DatasetFacetsstatic final class
builder class for DatasetFacetsstatic final class
model class for DatasetVersionDatasetFacetfinal class
builder class for DatasetVersionDatasetFacetstatic final class
model class for DatasourceDatasetFacetfinal class
builder class for DatasourceDatasetFacetstatic class
static class
static class
static class
static class
static final class
model class for DocumentationDatasetFacetfinal class
builder class for DocumentationDatasetFacetstatic final class
model class for DocumentationJobFacetfinal class
builder class for DocumentationJobFacetstatic final class
model class for ErrorMessageRunFacetfinal class
builder class for ErrorMessageRunFacetstatic final class
model class for ExternalQueryRunFacetfinal class
builder class for ExternalQueryRunFacetstatic final class
model class for ExtractionErrorRunFacetfinal class
builder class for ExtractionErrorRunFacetstatic final class
model class for ExtractionErrorRunFacetErrorsstatic final class
builder class for ExtractionErrorRunFacetErrorsstatic final class
model class for GcpDataprocSparkRunFacetfinal class
builder class for GcpDataprocSparkRunFacetstatic final class
model class for GcpLineageJobFacetfinal class
builder class for GcpLineageJobFacetstatic final class
model class for GcpLineageJobFacetOriginstatic final class
builder class for GcpLineageJobFacetOriginstatic final class
model class for InputDatasetstatic final class
builder class for InputDatasetstatic interface
Interface for InputDatasetFacetstatic final class
model class for InputDatasetInputFacetsstatic final class
builder class for InputDatasetInputFacetsstatic final class
model class for Jobstatic final class
builder class for Jobstatic final class
model class for JobEventfinal class
builder class for JobEventstatic interface
Interface for JobFacetstatic final class
model class for JobFacetsstatic final class
builder class for JobFacetsstatic final class
model class for JobTypeJobFacetfinal class
builder class for JobTypeJobFacetstatic final class
model class for LifecycleStateChangeDatasetFacetfinal class
builder class for LifecycleStateChangeDatasetFacetstatic final class
model class for LifecycleStateChangeDatasetFacetPreviousIdentifierstatic final class
builder class for LifecycleStateChangeDatasetFacetPreviousIdentifierstatic final class
model class for NominalTimeRunFacetfinal class
builder class for NominalTimeRunFacetstatic final class
model class for OutputDatasetstatic final class
builder class for OutputDatasetstatic interface
Interface for OutputDatasetFacetstatic final class
model class for OutputDatasetOutputFacetsstatic final class
builder class for OutputDatasetOutputFacetsstatic final class
model class for OutputStatisticsOutputDatasetFacetfinal class
builder class for OutputStatisticsOutputDatasetFacetstatic final class
model class for OwnershipDatasetFacetfinal class
builder class for OwnershipDatasetFacetstatic final class
model class for OwnershipDatasetFacetOwnersstatic final class
builder class for OwnershipDatasetFacetOwnersstatic final class
model class for OwnershipJobFacetfinal class
builder class for OwnershipJobFacetstatic final class
model class for OwnershipJobFacetOwnersstatic final class
builder class for OwnershipJobFacetOwnersstatic final class
model class for ParentRunFacetfinal class
builder class for ParentRunFacetstatic final class
model class for ParentRunFacetJobstatic final class
builder class for ParentRunFacetJobstatic final class
model class for ParentRunFacetRunstatic final class
builder class for ParentRunFacetRunstatic final class
model class for ProcessingEngineRunFacetfinal class
builder class for ProcessingEngineRunFacetstatic final class
model class for Runstatic final class
builder class for Runstatic final class
model class for RunEventfinal class
builder class for RunEventstatic interface
Interface for RunFacetstatic final class
model class for RunFacetsstatic final class
builder class for RunFacetsstatic final class
model class for SchemaDatasetFacetfinal class
builder class for SchemaDatasetFacetstatic final class
model class for SchemaDatasetFacetFieldsstatic final class
builder class for SchemaDatasetFacetFieldsstatic final class
model class for SourceCodeJobFacetfinal class
builder class for SourceCodeJobFacetstatic final class
model class for SourceCodeLocationJobFacetfinal class
builder class for SourceCodeLocationJobFacetstatic final class
model class for SQLJobFacetfinal class
builder class for SQLJobFacetstatic final class
model class for StaticDatasetstatic final class
builder class for StaticDatasetstatic final class
model class for StorageDatasetFacetfinal class
builder class for StorageDatasetFacetstatic final class
model class for SymlinksDatasetFacetfinal class
builder class for SymlinksDatasetFacetstatic final class
model class for SymlinksDatasetFacetIdentifiersstatic final class
builder class for SymlinksDatasetFacetIdentifiers -
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionFactory method for ColumnLineageDatasetFacetCreates a builder for ColumnLineageDatasetFacetFactory method for ColumnLineageDatasetFacetFieldsnewColumnLineageDatasetFacetFieldsAdditional
(List<OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFields> inputFields, String transformationDescription, String transformationType) Factory method for ColumnLineageDatasetFacetFieldsAdditionalCreates a builder for ColumnLineageDatasetFacetFieldsAdditionalnewColumnLineageDatasetFacetFieldsAdditionalInputFields
(String namespace, String name, String field, List<OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations> transformations) Factory method for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsCreates a builder for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsnewColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations
(String type, String subtype, String description, Boolean masking) Factory method for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsCreates a builder for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsCreates a builder for ColumnLineageDatasetFacetFieldsnewDataQualityAssertionsDatasetFacet
(List<OpenLineage.DataQualityAssertionsDatasetFacetAssertions> assertions) Factory method for DataQualityAssertionsDatasetFacetnewDataQualityAssertionsDatasetFacetAssertions
(String assertion, Boolean success, String column) Factory method for DataQualityAssertionsDatasetFacetAssertionsCreates a builder for DataQualityAssertionsDatasetFacetAssertionsCreates a builder for DataQualityAssertionsDatasetFacetnewDataQualityMetricsInputDatasetFacet
(Long rowCount, Long bytes, Long fileCount, OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetrics columnMetrics) Factory method for DataQualityMetricsInputDatasetFacetCreates a builder for DataQualityMetricsInputDatasetFacetFactory method for DataQualityMetricsInputDatasetFacetColumnMetricsnewDataQualityMetricsInputDatasetFacetColumnMetricsAdditional
(Long nullCount, Long distinctCount, Double sum, Double count, Double min, Double max, OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles quantiles) Factory method for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalCreates a builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalFactory method for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesCreates a builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesCreates a builder for DataQualityMetricsInputDatasetFacetColumnMetricsnewDatasetEvent
(ZonedDateTime eventTime, OpenLineage.StaticDataset dataset) Factory method for DatasetEventCreates a builder for DatasetEventnewDatasetFacets
(OpenLineage.DocumentationDatasetFacet documentation, OpenLineage.DatasourceDatasetFacet dataSource, OpenLineage.DatasetVersionDatasetFacet version, OpenLineage.SchemaDatasetFacet schema, OpenLineage.OwnershipDatasetFacet ownership, OpenLineage.StorageDatasetFacet storage, OpenLineage.ColumnLineageDatasetFacet columnLineage, OpenLineage.SymlinksDatasetFacet symlinks, OpenLineage.LifecycleStateChangeDatasetFacet lifecycleStateChange) Factory method for DatasetFacetsCreates a builder for DatasetFacetsnewDatasetVersionDatasetFacet
(String datasetVersion) Factory method for DatasetVersionDatasetFacetCreates a builder for DatasetVersionDatasetFacetnewDatasourceDatasetFacet
(String name, URI uri) Factory method for DatasourceDatasetFacetCreates a builder for DatasourceDatasetFacetnewDocumentationDatasetFacet
(String description) Factory method for DocumentationDatasetFacetCreates a builder for DocumentationDatasetFacetnewDocumentationJobFacet
(String description) Factory method for DocumentationJobFacetCreates a builder for DocumentationJobFacetnewErrorMessageRunFacet
(String message, String programmingLanguage, String stackTrace) Factory method for ErrorMessageRunFacetCreates a builder for ErrorMessageRunFacetnewExternalQueryRunFacet
(String externalQueryId, String source) Factory method for ExternalQueryRunFacetCreates a builder for ExternalQueryRunFacetnewExtractionErrorRunFacet
(Long totalTasks, Long failedTasks, List<OpenLineage.ExtractionErrorRunFacetErrors> errors) Factory method for ExtractionErrorRunFacetCreates a builder for ExtractionErrorRunFacetnewExtractionErrorRunFacetErrors
(String errorMessage, String stackTrace, String task, Long taskNumber) Factory method for ExtractionErrorRunFacetErrorsCreates a builder for ExtractionErrorRunFacetErrorsnewGcpDataprocSparkRunFacet
(String appId, String appName, String batchId, String batchUuid, String clusterName, String clusterUuid, String jobId, String jobUuid, String projectId, String queryNodeName, String sessionId, String sessionUuid) Factory method for GcpDataprocSparkRunFacetCreates a builder for GcpDataprocSparkRunFacetnewGcpLineageJobFacet
(String displayName, OpenLineage.GcpLineageJobFacetOrigin origin) Factory method for GcpLineageJobFacetCreates a builder for GcpLineageJobFacetnewGcpLineageJobFacetOrigin
(String sourceType, String name) Factory method for GcpLineageJobFacetOriginCreates a builder for GcpLineageJobFacetOriginnewInputDataset
(String namespace, String name, OpenLineage.DatasetFacets facets, OpenLineage.InputDatasetInputFacets inputFacets) Factory method for InputDatasetCreates a builder for InputDatasetnewInputDatasetInputFacets
(OpenLineage.DataQualityAssertionsDatasetFacet dataQualityAssertions, OpenLineage.DataQualityMetricsInputDatasetFacet dataQualityMetrics) Factory method for InputDatasetInputFacetsCreates a builder for InputDatasetInputFacetsnewJob
(String namespace, String name, OpenLineage.JobFacets facets) Factory method for JobCreates a builder for JobnewJobEvent
(ZonedDateTime eventTime, OpenLineage.Job job, List<OpenLineage.InputDataset> inputs, List<OpenLineage.OutputDataset> outputs) Factory method for JobEventCreates a builder for JobEventnewJobFacets
(OpenLineage.JobTypeJobFacet jobType, OpenLineage.SourceCodeJobFacet sourceCode, OpenLineage.OwnershipJobFacet ownership, OpenLineage.GcpLineageJobFacet gcp_lineage, OpenLineage.SQLJobFacet sql, OpenLineage.SourceCodeLocationJobFacet sourceCodeLocation, OpenLineage.DocumentationJobFacet documentation) Factory method for JobFacetsCreates a builder for JobFacetsnewJobTypeJobFacet
(String processingType, String integration, String jobType) Factory method for JobTypeJobFacetCreates a builder for JobTypeJobFacetnewLifecycleStateChangeDatasetFacet
(OpenLineage.LifecycleStateChangeDatasetFacet.LifecycleStateChange lifecycleStateChange, OpenLineage.LifecycleStateChangeDatasetFacetPreviousIdentifier previousIdentifier) Factory method for LifecycleStateChangeDatasetFacetCreates a builder for LifecycleStateChangeDatasetFacetnewLifecycleStateChangeDatasetFacetPreviousIdentifier
(String name, String namespace) Factory method for LifecycleStateChangeDatasetFacetPreviousIdentifierCreates a builder for LifecycleStateChangeDatasetFacetPreviousIdentifiernewNominalTimeRunFacet
(ZonedDateTime nominalStartTime, ZonedDateTime nominalEndTime) Factory method for NominalTimeRunFacetCreates a builder for NominalTimeRunFacetnewOutputDataset
(String namespace, String name, OpenLineage.DatasetFacets facets, OpenLineage.OutputDatasetOutputFacets outputFacets) Factory method for OutputDatasetCreates a builder for OutputDatasetnewOutputDatasetOutputFacets
(OpenLineage.OutputStatisticsOutputDatasetFacet outputStatistics) Factory method for OutputDatasetOutputFacetsCreates a builder for OutputDatasetOutputFacetsnewOutputStatisticsOutputDatasetFacet
(Long rowCount, Long size, Long fileCount) Factory method for OutputStatisticsOutputDatasetFacetCreates a builder for OutputStatisticsOutputDatasetFacetFactory method for OwnershipDatasetFacetCreates a builder for OwnershipDatasetFacetnewOwnershipDatasetFacetOwners
(String name, String type) Factory method for OwnershipDatasetFacetOwnersCreates a builder for OwnershipDatasetFacetOwnersFactory method for OwnershipJobFacetCreates a builder for OwnershipJobFacetnewOwnershipJobFacetOwners
(String name, String type) Factory method for OwnershipJobFacetOwnersCreates a builder for OwnershipJobFacetOwnersFactory method for ParentRunFacetCreates a builder for ParentRunFacetnewParentRunFacetJob
(String namespace, String name) Factory method for ParentRunFacetJobCreates a builder for ParentRunFacetJobnewParentRunFacetRun
(UUID runId) Factory method for ParentRunFacetRunCreates a builder for ParentRunFacetRunnewProcessingEngineRunFacet
(String version, String name, String openlineageAdapterVersion) Factory method for ProcessingEngineRunFacetCreates a builder for ProcessingEngineRunFacetnewRun
(UUID runId, OpenLineage.RunFacets facets) Factory method for RunCreates a builder for RunnewRunEvent
(ZonedDateTime eventTime, OpenLineage.RunEvent.EventType eventType, OpenLineage.Run run, OpenLineage.Job job, List<OpenLineage.InputDataset> inputs, List<OpenLineage.OutputDataset> outputs) Factory method for RunEventCreates a builder for RunEventnewRunFacets
(OpenLineage.ErrorMessageRunFacet errorMessage, OpenLineage.ExternalQueryRunFacet externalQuery, OpenLineage.ExtractionErrorRunFacet extractionError, OpenLineage.ParentRunFacet parent, OpenLineage.NominalTimeRunFacet nominalTime, OpenLineage.GcpDataprocSparkRunFacet gcp_dataproc_spark, OpenLineage.ProcessingEngineRunFacet processing_engine) Factory method for RunFacetsCreates a builder for RunFacetsFactory method for SchemaDatasetFacetCreates a builder for SchemaDatasetFacetnewSchemaDatasetFacetFields
(String name, String type, String description, List<OpenLineage.SchemaDatasetFacetFields> fields) Factory method for SchemaDatasetFacetFieldsCreates a builder for SchemaDatasetFacetFieldsnewSourceCodeJobFacet
(String language, String sourceCode) Factory method for SourceCodeJobFacetCreates a builder for SourceCodeJobFacetnewSourceCodeLocationJobFacet
(String type, URI url, String repoUrl, String path, String version, String tag, String branch) Factory method for SourceCodeLocationJobFacetCreates a builder for SourceCodeLocationJobFacetnewSQLJobFacet
(String query) Factory method for SQLJobFacetCreates a builder for SQLJobFacetnewStaticDataset
(String namespace, String name, OpenLineage.DatasetFacets facets) Factory method for StaticDatasetCreates a builder for StaticDatasetnewStorageDatasetFacet
(String storageLayer, String fileFormat) Factory method for StorageDatasetFacetCreates a builder for StorageDatasetFacetFactory method for SymlinksDatasetFacetCreates a builder for SymlinksDatasetFacetnewSymlinksDatasetFacetIdentifiers
(String namespace, String name, String type) Factory method for SymlinksDatasetFacetIdentifiersCreates a builder for SymlinksDatasetFacetIdentifiers
-
Constructor Details
-
OpenLineage
Starting point to create OpenLineage objects. Use the OpenLineage instance to create events and facets- Parameters:
producer
- the identifier of the library using the client to generate OpenLineage events
-
-
Method Details
-
newRunEvent
public OpenLineage.RunEvent newRunEvent(ZonedDateTime eventTime, OpenLineage.RunEvent.EventType eventType, OpenLineage.Run run, OpenLineage.Job job, List<OpenLineage.InputDataset> inputs, List<OpenLineage.OutputDataset> outputs) Factory method for RunEvent- Parameters:
eventTime
- the time the event occurred ateventType
- the current transition of the run state. It is required to issue 1 START event and 1 of [ COMPLETE, ABORT, FAIL ] event per run. Additional events with OTHER eventType can be added to the same run. For example to send additional metadata after the run is completerun
- the runjob
- the jobinputs
- The set of **input** datasets.outputs
- The set of **output** datasets.- Returns:
- RunEvent
-
newRunEventBuilder
Creates a builder for RunEvent- Returns:
- a new builder for RunEvent
-
newSchemaDatasetFacetFields
public OpenLineage.SchemaDatasetFacetFields newSchemaDatasetFacetFields(String name, String type, String description, List<OpenLineage.SchemaDatasetFacetFields> fields) Factory method for SchemaDatasetFacetFields- Parameters:
name
- The name of the field.type
- The type of the field.description
- The description of the field.fields
- Nested struct fields.- Returns:
- SchemaDatasetFacetFields
-
newSchemaDatasetFacetFieldsBuilder
Creates a builder for SchemaDatasetFacetFields- Returns:
- a new builder for SchemaDatasetFacetFields
-
newParentRunFacet
public OpenLineage.ParentRunFacet newParentRunFacet(OpenLineage.ParentRunFacetRun run, OpenLineage.ParentRunFacetJob job) Factory method for ParentRunFacet- Parameters:
run
- the runjob
- the job- Returns:
- ParentRunFacet
-
newParentRunFacetBuilder
Creates a builder for ParentRunFacet- Returns:
- a new builder for ParentRunFacet
-
newOutputStatisticsOutputDatasetFacet
public OpenLineage.OutputStatisticsOutputDatasetFacet newOutputStatisticsOutputDatasetFacet(Long rowCount, Long size, Long fileCount) Factory method for OutputStatisticsOutputDatasetFacet- Parameters:
rowCount
- The number of rows written to the datasetsize
- The size in bytes written to the datasetfileCount
- The number of files written to the dataset- Returns:
- OutputStatisticsOutputDatasetFacet
-
newOutputStatisticsOutputDatasetFacetBuilder
public OpenLineage.OutputStatisticsOutputDatasetFacetBuilder newOutputStatisticsOutputDatasetFacetBuilder()Creates a builder for OutputStatisticsOutputDatasetFacet- Returns:
- a new builder for OutputStatisticsOutputDatasetFacet
-
newExtractionErrorRunFacetErrors
public OpenLineage.ExtractionErrorRunFacetErrors newExtractionErrorRunFacetErrors(String errorMessage, String stackTrace, String task, Long taskNumber) Factory method for ExtractionErrorRunFacetErrors- Parameters:
errorMessage
- Text representation of extraction error message.stackTrace
- Stack trace of extraction error messagetask
- Text representation of task that failed. This can be, for example, SQL statement that parser could not interpret.taskNumber
- Order of task (counted from 0).- Returns:
- ExtractionErrorRunFacetErrors
-
newExtractionErrorRunFacetErrorsBuilder
Creates a builder for ExtractionErrorRunFacetErrors- Returns:
- a new builder for ExtractionErrorRunFacetErrors
-
newJobTypeJobFacet
public OpenLineage.JobTypeJobFacet newJobTypeJobFacet(String processingType, String integration, String jobType) Factory method for JobTypeJobFacet- Parameters:
processingType
- Job processing type like: BATCH or STREAMINGintegration
- OpenLineage integration type of this job: for example SPARK|DBT|AIRFLOW|FLINKjobType
- Run type, for example: QUERY|COMMAND|DAG|TASK|JOB|MODEL. This is an integration-specific field.- Returns:
- JobTypeJobFacet
-
newJobTypeJobFacetBuilder
Creates a builder for JobTypeJobFacet- Returns:
- a new builder for JobTypeJobFacet
-
newRun
Factory method for Run- Parameters:
runId
- The globally unique ID of the run associated with the job.facets
- The run facets.- Returns:
- Run
-
newRunBuilder
Creates a builder for Run- Returns:
- a new builder for Run
-
newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles()Factory method for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles- Returns:
- DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles
-
newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesBuilder
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesBuilder newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantilesBuilder()Creates a builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles- Returns:
- a new builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles
-
newExternalQueryRunFacet
public OpenLineage.ExternalQueryRunFacet newExternalQueryRunFacet(String externalQueryId, String source) Factory method for ExternalQueryRunFacet- Parameters:
externalQueryId
- Identifier for the external systemsource
- source of the external query- Returns:
- ExternalQueryRunFacet
-
newExternalQueryRunFacetBuilder
Creates a builder for ExternalQueryRunFacet- Returns:
- a new builder for ExternalQueryRunFacet
-
newOutputDatasetOutputFacets
public OpenLineage.OutputDatasetOutputFacets newOutputDatasetOutputFacets(OpenLineage.OutputStatisticsOutputDatasetFacet outputStatistics) Factory method for OutputDatasetOutputFacets- Parameters:
outputStatistics
- the outputStatistics- Returns:
- OutputDatasetOutputFacets
-
newOutputDatasetOutputFacetsBuilder
Creates a builder for OutputDatasetOutputFacets- Returns:
- a new builder for OutputDatasetOutputFacets
-
newNominalTimeRunFacet
public OpenLineage.NominalTimeRunFacet newNominalTimeRunFacet(ZonedDateTime nominalStartTime, ZonedDateTime nominalEndTime) Factory method for NominalTimeRunFacet- Parameters:
nominalStartTime
- An [ISO-8601](https://en.wikipedia.org/wiki/ISO_8601) timestamp representing the nominal start time (included) of the run. AKA the schedule timenominalEndTime
- An [ISO-8601](https://en.wikipedia.org/wiki/ISO_8601) timestamp representing the nominal end time (excluded) of the run. (Should be the nominal start time of the next run)- Returns:
- NominalTimeRunFacet
-
newNominalTimeRunFacetBuilder
Creates a builder for NominalTimeRunFacet- Returns:
- a new builder for NominalTimeRunFacet
-
newRunFacet
- Returns:
- RunFacet
-
newOwnershipJobFacetOwners
Factory method for OwnershipJobFacetOwners- Parameters:
name
- the identifier of the owner of the Job. It is recommended to define this as a URN. For example application:foo, user:jdoe, team:datatype
- The type of ownership (optional)- Returns:
- OwnershipJobFacetOwners
-
newOwnershipJobFacetOwnersBuilder
Creates a builder for OwnershipJobFacetOwners- Returns:
- a new builder for OwnershipJobFacetOwners
-
newGcpLineageJobFacet
public OpenLineage.GcpLineageJobFacet newGcpLineageJobFacet(String displayName, OpenLineage.GcpLineageJobFacetOrigin origin) Factory method for GcpLineageJobFacet- Parameters:
displayName
- The name of the job to be used on UIorigin
- the origin- Returns:
- GcpLineageJobFacet
-
newGcpLineageJobFacetBuilder
Creates a builder for GcpLineageJobFacet- Returns:
- a new builder for GcpLineageJobFacet
-
newInputDatasetFacet
- Returns:
- InputDatasetFacet
-
newJobEvent
public OpenLineage.JobEvent newJobEvent(ZonedDateTime eventTime, OpenLineage.Job job, List<OpenLineage.InputDataset> inputs, List<OpenLineage.OutputDataset> outputs) Factory method for JobEvent- Parameters:
eventTime
- the time the event occurred atjob
- the jobinputs
- The set of **input** datasets.outputs
- The set of **output** datasets.- Returns:
- JobEvent
-
newJobEventBuilder
Creates a builder for JobEvent- Returns:
- a new builder for JobEvent
-
newExtractionErrorRunFacet
public OpenLineage.ExtractionErrorRunFacet newExtractionErrorRunFacet(Long totalTasks, Long failedTasks, List<OpenLineage.ExtractionErrorRunFacetErrors> errors) Factory method for ExtractionErrorRunFacet- Parameters:
totalTasks
- The number of distinguishable tasks in a run that were processed by OpenLineage, whether successfully or not. Those could be, for example, distinct SQL statements.failedTasks
- The number of distinguishable tasks in a run that were processed not successfully by OpenLineage. Those could be, for example, distinct SQL statements.errors
- the errors- Returns:
- ExtractionErrorRunFacet
-
newExtractionErrorRunFacetBuilder
Creates a builder for ExtractionErrorRunFacet- Returns:
- a new builder for ExtractionErrorRunFacet
-
newOwnershipDatasetFacetOwners
public OpenLineage.OwnershipDatasetFacetOwners newOwnershipDatasetFacetOwners(String name, String type) Factory method for OwnershipDatasetFacetOwners- Parameters:
name
- the identifier of the owner of the Dataset. It is recommended to define this as a URN. For example application:foo, user:jdoe, team:datatype
- The type of ownership (optional)- Returns:
- OwnershipDatasetFacetOwners
-
newOwnershipDatasetFacetOwnersBuilder
Creates a builder for OwnershipDatasetFacetOwners- Returns:
- a new builder for OwnershipDatasetFacetOwners
-
newColumnLineageDatasetFacetFieldsAdditional
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditional newColumnLineageDatasetFacetFieldsAdditional(List<OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFields> inputFields, String transformationDescription, String transformationType) Factory method for ColumnLineageDatasetFacetFieldsAdditional- Parameters:
inputFields
- the inputFieldstransformationDescription
- a string representation of the transformation appliedtransformationType
- IDENTITY|MASKED reflects a clearly defined behavior. IDENTITY: exact same as input; MASKED: no original data available (like a hash of PII for example)- Returns:
- ColumnLineageDatasetFacetFieldsAdditional
-
newColumnLineageDatasetFacetFieldsAdditionalBuilder
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalBuilder newColumnLineageDatasetFacetFieldsAdditionalBuilder()Creates a builder for ColumnLineageDatasetFacetFieldsAdditional- Returns:
- a new builder for ColumnLineageDatasetFacetFieldsAdditional
-
newDataQualityMetricsInputDatasetFacetColumnMetricsAdditional
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditional newDataQualityMetricsInputDatasetFacetColumnMetricsAdditional(Long nullCount, Long distinctCount, Double sum, Double count, Double min, Double max, OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalQuantiles quantiles) Factory method for DataQualityMetricsInputDatasetFacetColumnMetricsAdditional- Parameters:
nullCount
- The number of null values in this column for the rows evaluateddistinctCount
- The number of distinct values in this column for the rows evaluatedsum
- The total sum of values in this column for the rows evaluatedcount
- The number of values in this columnmin
- the minmax
- the maxquantiles
- The property key is the quantile. Examples: 0.1 0.25 0.5 0.75 1- Returns:
- DataQualityMetricsInputDatasetFacetColumnMetricsAdditional
-
newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalBuilder
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsAdditionalBuilder newDataQualityMetricsInputDatasetFacetColumnMetricsAdditionalBuilder()Creates a builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditional- Returns:
- a new builder for DataQualityMetricsInputDatasetFacetColumnMetricsAdditional
-
newDataQualityMetricsInputDatasetFacetColumnMetrics
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetrics newDataQualityMetricsInputDatasetFacetColumnMetrics()Factory method for DataQualityMetricsInputDatasetFacetColumnMetrics- Returns:
- DataQualityMetricsInputDatasetFacetColumnMetrics
-
newDataQualityMetricsInputDatasetFacetColumnMetricsBuilder
public OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetricsBuilder newDataQualityMetricsInputDatasetFacetColumnMetricsBuilder()Creates a builder for DataQualityMetricsInputDatasetFacetColumnMetrics- Returns:
- a new builder for DataQualityMetricsInputDatasetFacetColumnMetrics
-
newDataQualityMetricsInputDatasetFacet
public OpenLineage.DataQualityMetricsInputDatasetFacet newDataQualityMetricsInputDatasetFacet(Long rowCount, Long bytes, Long fileCount, OpenLineage.DataQualityMetricsInputDatasetFacetColumnMetrics columnMetrics) Factory method for DataQualityMetricsInputDatasetFacet- Parameters:
rowCount
- The number of rows evaluatedbytes
- The size in bytesfileCount
- The number of files evaluatedcolumnMetrics
- The property key is the column name- Returns:
- DataQualityMetricsInputDatasetFacet
-
newDataQualityMetricsInputDatasetFacetBuilder
public OpenLineage.DataQualityMetricsInputDatasetFacetBuilder newDataQualityMetricsInputDatasetFacetBuilder()Creates a builder for DataQualityMetricsInputDatasetFacet- Returns:
- a new builder for DataQualityMetricsInputDatasetFacet
-
newDocumentationJobFacet
Factory method for DocumentationJobFacet- Parameters:
description
- The description of the job.- Returns:
- DocumentationJobFacet
-
newDocumentationJobFacetBuilder
Creates a builder for DocumentationJobFacet- Returns:
- a new builder for DocumentationJobFacet
-
newJobFacets
public OpenLineage.JobFacets newJobFacets(OpenLineage.JobTypeJobFacet jobType, OpenLineage.SourceCodeJobFacet sourceCode, OpenLineage.OwnershipJobFacet ownership, OpenLineage.GcpLineageJobFacet gcp_lineage, OpenLineage.SQLJobFacet sql, OpenLineage.SourceCodeLocationJobFacet sourceCodeLocation, OpenLineage.DocumentationJobFacet documentation) Factory method for JobFacets- Parameters:
jobType
- the jobTypesourceCode
- the sourceCodeownership
- the ownershipgcp_lineage
- the gcp_lineagesql
- the sqlsourceCodeLocation
- the sourceCodeLocationdocumentation
- the documentation- Returns:
- JobFacets
-
newJobFacetsBuilder
Creates a builder for JobFacets- Returns:
- a new builder for JobFacets
-
newInputDatasetInputFacets
public OpenLineage.InputDatasetInputFacets newInputDatasetInputFacets(OpenLineage.DataQualityAssertionsDatasetFacet dataQualityAssertions, OpenLineage.DataQualityMetricsInputDatasetFacet dataQualityMetrics) Factory method for InputDatasetInputFacets- Parameters:
dataQualityAssertions
- the dataQualityAssertionsdataQualityMetrics
- the dataQualityMetrics- Returns:
- InputDatasetInputFacets
-
newInputDatasetInputFacetsBuilder
Creates a builder for InputDatasetInputFacets- Returns:
- a new builder for InputDatasetInputFacets
-
newDatasetFacet
- Returns:
- DatasetFacet
-
newDeletedDatasetFacet
- Returns:
- a deleted DatasetFacet
-
newOwnershipDatasetFacet
public OpenLineage.OwnershipDatasetFacet newOwnershipDatasetFacet(List<OpenLineage.OwnershipDatasetFacetOwners> owners) Factory method for OwnershipDatasetFacet- Parameters:
owners
- The owners of the dataset.- Returns:
- OwnershipDatasetFacet
-
newOwnershipDatasetFacetBuilder
Creates a builder for OwnershipDatasetFacet- Returns:
- a new builder for OwnershipDatasetFacet
-
newSQLJobFacet
Factory method for SQLJobFacet- Parameters:
query
- the query- Returns:
- SQLJobFacet
-
newSQLJobFacetBuilder
Creates a builder for SQLJobFacet- Returns:
- a new builder for SQLJobFacet
-
newDatasetFacets
public OpenLineage.DatasetFacets newDatasetFacets(OpenLineage.DocumentationDatasetFacet documentation, OpenLineage.DatasourceDatasetFacet dataSource, OpenLineage.DatasetVersionDatasetFacet version, OpenLineage.SchemaDatasetFacet schema, OpenLineage.OwnershipDatasetFacet ownership, OpenLineage.StorageDatasetFacet storage, OpenLineage.ColumnLineageDatasetFacet columnLineage, OpenLineage.SymlinksDatasetFacet symlinks, OpenLineage.LifecycleStateChangeDatasetFacet lifecycleStateChange) Factory method for DatasetFacets- Parameters:
documentation
- the documentationdataSource
- the dataSourceversion
- the versionschema
- the schemaownership
- the ownershipstorage
- the storagecolumnLineage
- the columnLineagesymlinks
- the symlinkslifecycleStateChange
- the lifecycleStateChange- Returns:
- DatasetFacets
-
newDatasetFacetsBuilder
Creates a builder for DatasetFacets- Returns:
- a new builder for DatasetFacets
-
newOwnershipJobFacet
public OpenLineage.OwnershipJobFacet newOwnershipJobFacet(List<OpenLineage.OwnershipJobFacetOwners> owners) Factory method for OwnershipJobFacet- Parameters:
owners
- The owners of the job.- Returns:
- OwnershipJobFacet
-
newOwnershipJobFacetBuilder
Creates a builder for OwnershipJobFacet- Returns:
- a new builder for OwnershipJobFacet
-
newParentRunFacetJob
Factory method for ParentRunFacetJob- Parameters:
namespace
- The namespace containing that jobname
- The unique name for that job within that namespace- Returns:
- ParentRunFacetJob
-
newParentRunFacetJobBuilder
Creates a builder for ParentRunFacetJob- Returns:
- a new builder for ParentRunFacetJob
-
newGcpLineageJobFacetOrigin
public OpenLineage.GcpLineageJobFacetOrigin newGcpLineageJobFacetOrigin(String sourceType, String name) Factory method for GcpLineageJobFacetOrigin- Parameters:
sourceType
- Type of the source. Possible values can be found in GCP documentation (https://cloud.google.com/data-catalog/docs/reference/data-lineage/rest/v1/projects.locations.processes#SourceType)name
- If the sourceType isn't CUSTOM, the value of this field should be a GCP resource name of the system, which reports lineage. The project and location parts of the resource name must match the project and location of the lineage resource being created. More details in GCP documentation https://cloud.google.com/data-catalog/docs/reference/data-lineage/rest/v1/projects.locations.processes#origin- Returns:
- GcpLineageJobFacetOrigin
-
newGcpLineageJobFacetOriginBuilder
Creates a builder for GcpLineageJobFacetOrigin- Returns:
- a new builder for GcpLineageJobFacetOrigin
-
newOutputDataset
public OpenLineage.OutputDataset newOutputDataset(String namespace, String name, OpenLineage.DatasetFacets facets, OpenLineage.OutputDatasetOutputFacets outputFacets) Factory method for OutputDataset- Parameters:
namespace
- The namespace containing that datasetname
- The unique name for that dataset within that namespacefacets
- The facets for this datasetoutputFacets
- The output facets for this dataset- Returns:
- OutputDataset
-
newOutputDatasetBuilder
Creates a builder for OutputDataset- Returns:
- a new builder for OutputDataset
-
newErrorMessageRunFacet
public OpenLineage.ErrorMessageRunFacet newErrorMessageRunFacet(String message, String programmingLanguage, String stackTrace) Factory method for ErrorMessageRunFacet- Parameters:
message
- A human-readable string representing error message generated by observed systemprogrammingLanguage
- Programming language the observed system uses.stackTrace
- A language-specific stack trace generated by observed system- Returns:
- ErrorMessageRunFacet
-
newErrorMessageRunFacetBuilder
Creates a builder for ErrorMessageRunFacet- Returns:
- a new builder for ErrorMessageRunFacet
-
newJobFacet
- Returns:
- JobFacet
-
newDeletedJobFacet
- Returns:
- a deleted JobFacet
-
newRunFacets
public OpenLineage.RunFacets newRunFacets(OpenLineage.ErrorMessageRunFacet errorMessage, OpenLineage.ExternalQueryRunFacet externalQuery, OpenLineage.ExtractionErrorRunFacet extractionError, OpenLineage.ParentRunFacet parent, OpenLineage.NominalTimeRunFacet nominalTime, OpenLineage.GcpDataprocSparkRunFacet gcp_dataproc_spark, OpenLineage.ProcessingEngineRunFacet processing_engine) Factory method for RunFacets- Parameters:
errorMessage
- the errorMessageexternalQuery
- the externalQueryextractionError
- the extractionErrorparent
- the parentnominalTime
- the nominalTimegcp_dataproc_spark
- the gcp_dataproc_sparkprocessing_engine
- the processing_engine- Returns:
- RunFacets
-
newRunFacetsBuilder
Creates a builder for RunFacets- Returns:
- a new builder for RunFacets
-
newSchemaDatasetFacet
public OpenLineage.SchemaDatasetFacet newSchemaDatasetFacet(List<OpenLineage.SchemaDatasetFacetFields> fields) Factory method for SchemaDatasetFacet- Parameters:
fields
- The fields of the data source.- Returns:
- SchemaDatasetFacet
-
newSchemaDatasetFacetBuilder
Creates a builder for SchemaDatasetFacet- Returns:
- a new builder for SchemaDatasetFacet
-
newSourceCodeJobFacet
Factory method for SourceCodeJobFacet- Parameters:
language
- Language in which source code of this job was written.sourceCode
- Source code of this job.- Returns:
- SourceCodeJobFacet
-
newSourceCodeJobFacetBuilder
Creates a builder for SourceCodeJobFacet- Returns:
- a new builder for SourceCodeJobFacet
-
newDocumentationDatasetFacet
Factory method for DocumentationDatasetFacet- Parameters:
description
- The description of the dataset.- Returns:
- DocumentationDatasetFacet
-
newDocumentationDatasetFacetBuilder
Creates a builder for DocumentationDatasetFacet- Returns:
- a new builder for DocumentationDatasetFacet
-
newSourceCodeLocationJobFacet
public OpenLineage.SourceCodeLocationJobFacet newSourceCodeLocationJobFacet(String type, URI url, String repoUrl, String path, String version, String tag, String branch) Factory method for SourceCodeLocationJobFacet- Parameters:
type
- the source control systemurl
- the full http URL to locate the filerepoUrl
- the URL to the repositorypath
- the path in the repo containing the source filesversion
- the current version deployed (not a branch name, the actual unique version)tag
- optional tag namebranch
- optional branch name- Returns:
- SourceCodeLocationJobFacet
-
newSourceCodeLocationJobFacetBuilder
Creates a builder for SourceCodeLocationJobFacet- Returns:
- a new builder for SourceCodeLocationJobFacet
-
newDataQualityAssertionsDatasetFacet
public OpenLineage.DataQualityAssertionsDatasetFacet newDataQualityAssertionsDatasetFacet(List<OpenLineage.DataQualityAssertionsDatasetFacetAssertions> assertions) Factory method for DataQualityAssertionsDatasetFacet- Parameters:
assertions
- the assertions- Returns:
- DataQualityAssertionsDatasetFacet
-
newDataQualityAssertionsDatasetFacetBuilder
public OpenLineage.DataQualityAssertionsDatasetFacetBuilder newDataQualityAssertionsDatasetFacetBuilder()Creates a builder for DataQualityAssertionsDatasetFacet- Returns:
- a new builder for DataQualityAssertionsDatasetFacet
-
newColumnLineageDatasetFacet
public OpenLineage.ColumnLineageDatasetFacet newColumnLineageDatasetFacet(OpenLineage.ColumnLineageDatasetFacetFields fields) Factory method for ColumnLineageDatasetFacet- Parameters:
fields
- Column level lineage that maps output fields into input fields used to evaluate them.- Returns:
- ColumnLineageDatasetFacet
-
newColumnLineageDatasetFacetBuilder
Creates a builder for ColumnLineageDatasetFacet- Returns:
- a new builder for ColumnLineageDatasetFacet
-
newDatasetEvent
public OpenLineage.DatasetEvent newDatasetEvent(ZonedDateTime eventTime, OpenLineage.StaticDataset dataset) Factory method for DatasetEvent- Parameters:
eventTime
- the time the event occurred atdataset
- the dataset- Returns:
- DatasetEvent
-
newDatasetEventBuilder
Creates a builder for DatasetEvent- Returns:
- a new builder for DatasetEvent
-
newProcessingEngineRunFacet
public OpenLineage.ProcessingEngineRunFacet newProcessingEngineRunFacet(String version, String name, String openlineageAdapterVersion) Factory method for ProcessingEngineRunFacet- Parameters:
version
- Processing engine version. Might be Airflow or Spark version.name
- Processing engine name, e.g. Airflow or SparkopenlineageAdapterVersion
- OpenLineage adapter package version. Might be e.g. OpenLineage Airflow integration package version- Returns:
- ProcessingEngineRunFacet
-
newProcessingEngineRunFacetBuilder
Creates a builder for ProcessingEngineRunFacet- Returns:
- a new builder for ProcessingEngineRunFacet
-
newGcpDataprocSparkRunFacet
public OpenLineage.GcpDataprocSparkRunFacet newGcpDataprocSparkRunFacet(String appId, String appName, String batchId, String batchUuid, String clusterName, String clusterUuid, String jobId, String jobUuid, String projectId, String queryNodeName, String sessionId, String sessionUuid) Factory method for GcpDataprocSparkRunFacet- Parameters:
appId
- Application ID set in the spark configuration of the current context. Its format depends on the resource manager.appName
- App name set in the spark configuration of the current context. It may be provided by the user.batchId
- Populated only for Dataproc serverless batches. The resource id of the batch.batchUuid
- Populated only for Dataproc serverless batches. A UUID generated by the service when it creates the batch.clusterName
- Populated only for Dataproc GCE workloads. The cluster name is unique within a GCP project.clusterUuid
- Populated only for Dataproc GCE workloads. A UUID generated by the service at the time of cluster creation.jobId
- Populated only for Dataproc GCE workloads. If not specified by the user, the job ID will be provided by the service.jobUuid
- Populated only for Dataproc GCE workloads. A UUID that uniquely identifies a job within the project over time.projectId
- The GCP project ID that the resource belongs to.queryNodeName
- The name of the query node in the executed Spark Plan. Often used to describe the command being executed.sessionId
- Populated only for Dataproc serverless interactive sessions. The resource id of the session, used for URL generation.sessionUuid
- Populated only for Dataproc serverless interactive sessions. A UUID generated by the service when it creates the session.- Returns:
- GcpDataprocSparkRunFacet
-
newGcpDataprocSparkRunFacetBuilder
Creates a builder for GcpDataprocSparkRunFacet- Returns:
- a new builder for GcpDataprocSparkRunFacet
-
newLifecycleStateChangeDatasetFacetPreviousIdentifier
public OpenLineage.LifecycleStateChangeDatasetFacetPreviousIdentifier newLifecycleStateChangeDatasetFacetPreviousIdentifier(String name, String namespace) Factory method for LifecycleStateChangeDatasetFacetPreviousIdentifier- Parameters:
name
- the namenamespace
- the namespace- Returns:
- LifecycleStateChangeDatasetFacetPreviousIdentifier
-
newLifecycleStateChangeDatasetFacetPreviousIdentifierBuilder
public OpenLineage.LifecycleStateChangeDatasetFacetPreviousIdentifierBuilder newLifecycleStateChangeDatasetFacetPreviousIdentifierBuilder()Creates a builder for LifecycleStateChangeDatasetFacetPreviousIdentifier- Returns:
- a new builder for LifecycleStateChangeDatasetFacetPreviousIdentifier
-
newJob
Factory method for Job- Parameters:
namespace
- The namespace containing that jobname
- The unique name for that job within that namespacefacets
- The job facets.- Returns:
- Job
-
newJobBuilder
Creates a builder for Job- Returns:
- a new builder for Job
-
newDatasetVersionDatasetFacet
Factory method for DatasetVersionDatasetFacet- Parameters:
datasetVersion
- The version of the dataset.- Returns:
- DatasetVersionDatasetFacet
-
newDatasetVersionDatasetFacetBuilder
Creates a builder for DatasetVersionDatasetFacet- Returns:
- a new builder for DatasetVersionDatasetFacet
-
newSymlinksDatasetFacetIdentifiers
public OpenLineage.SymlinksDatasetFacetIdentifiers newSymlinksDatasetFacetIdentifiers(String namespace, String name, String type) Factory method for SymlinksDatasetFacetIdentifiers- Parameters:
namespace
- The dataset namespacename
- The dataset nametype
- Identifier type- Returns:
- SymlinksDatasetFacetIdentifiers
-
newSymlinksDatasetFacetIdentifiersBuilder
public OpenLineage.SymlinksDatasetFacetIdentifiersBuilder newSymlinksDatasetFacetIdentifiersBuilder()Creates a builder for SymlinksDatasetFacetIdentifiers- Returns:
- a new builder for SymlinksDatasetFacetIdentifiers
-
newDataQualityAssertionsDatasetFacetAssertions
public OpenLineage.DataQualityAssertionsDatasetFacetAssertions newDataQualityAssertionsDatasetFacetAssertions(String assertion, Boolean success, String column) Factory method for DataQualityAssertionsDatasetFacetAssertions- Parameters:
assertion
- Type of expectation test that dataset is subjected tosuccess
- the successcolumn
- Column that expectation is testing. It should match the name provided in SchemaDatasetFacet. If column field is empty, then expectation refers to whole dataset.- Returns:
- DataQualityAssertionsDatasetFacetAssertions
-
newDataQualityAssertionsDatasetFacetAssertionsBuilder
public OpenLineage.DataQualityAssertionsDatasetFacetAssertionsBuilder newDataQualityAssertionsDatasetFacetAssertionsBuilder()Creates a builder for DataQualityAssertionsDatasetFacetAssertions- Returns:
- a new builder for DataQualityAssertionsDatasetFacetAssertions
-
newParentRunFacetRun
Factory method for ParentRunFacetRun- Parameters:
runId
- The globally unique ID of the run associated with the job.- Returns:
- ParentRunFacetRun
-
newParentRunFacetRunBuilder
Creates a builder for ParentRunFacetRun- Returns:
- a new builder for ParentRunFacetRun
-
newColumnLineageDatasetFacetFieldsAdditionalInputFields
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFields newColumnLineageDatasetFacetFieldsAdditionalInputFields(String namespace, String name, String field, List<OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations> transformations) Factory method for ColumnLineageDatasetFacetFieldsAdditionalInputFields- Parameters:
namespace
- The input dataset namespacename
- The input dataset namefield
- The input fieldtransformations
- the transformations- Returns:
- ColumnLineageDatasetFacetFieldsAdditionalInputFields
-
newColumnLineageDatasetFacetFieldsAdditionalInputFieldsBuilder
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFieldsBuilder newColumnLineageDatasetFacetFieldsAdditionalInputFieldsBuilder()Creates a builder for ColumnLineageDatasetFacetFieldsAdditionalInputFields- Returns:
- a new builder for ColumnLineageDatasetFacetFieldsAdditionalInputFields
-
newOutputDatasetFacet
- Returns:
- OutputDatasetFacet
-
newStaticDataset
public OpenLineage.StaticDataset newStaticDataset(String namespace, String name, OpenLineage.DatasetFacets facets) Factory method for StaticDataset- Parameters:
namespace
- The namespace containing that datasetname
- The unique name for that dataset within that namespacefacets
- The facets for this dataset- Returns:
- StaticDataset
-
newStaticDatasetBuilder
Creates a builder for StaticDataset- Returns:
- a new builder for StaticDataset
-
newColumnLineageDatasetFacetFields
Factory method for ColumnLineageDatasetFacetFields- Returns:
- ColumnLineageDatasetFacetFields
-
newColumnLineageDatasetFacetFieldsBuilder
public OpenLineage.ColumnLineageDatasetFacetFieldsBuilder newColumnLineageDatasetFacetFieldsBuilder()Creates a builder for ColumnLineageDatasetFacetFields- Returns:
- a new builder for ColumnLineageDatasetFacetFields
-
newSymlinksDatasetFacet
public OpenLineage.SymlinksDatasetFacet newSymlinksDatasetFacet(List<OpenLineage.SymlinksDatasetFacetIdentifiers> identifiers) Factory method for SymlinksDatasetFacet- Parameters:
identifiers
- the identifiers- Returns:
- SymlinksDatasetFacet
-
newSymlinksDatasetFacetBuilder
Creates a builder for SymlinksDatasetFacet- Returns:
- a new builder for SymlinksDatasetFacet
-
newStorageDatasetFacet
public OpenLineage.StorageDatasetFacet newStorageDatasetFacet(String storageLayer, String fileFormat) Factory method for StorageDatasetFacet- Parameters:
storageLayer
- Storage layer provider with allowed values: iceberg, delta.fileFormat
- File format with allowed values: parquet, orc, avro, json, csv, text, xml.- Returns:
- StorageDatasetFacet
-
newStorageDatasetFacetBuilder
Creates a builder for StorageDatasetFacet- Returns:
- a new builder for StorageDatasetFacet
-
newColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations newColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations(String type, String subtype, String description, Boolean masking) Factory method for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations- Parameters:
type
- The type of the transformation. Allowed values are: DIRECT, INDIRECTsubtype
- The subtype of the transformationdescription
- a string representation of the transformation appliedmasking
- is transformation masking the data or not- Returns:
- ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations
-
newColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsBuilder
public OpenLineage.ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsBuilder newColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformationsBuilder()Creates a builder for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations- Returns:
- a new builder for ColumnLineageDatasetFacetFieldsAdditionalInputFieldsTransformations
-
newInputDataset
public OpenLineage.InputDataset newInputDataset(String namespace, String name, OpenLineage.DatasetFacets facets, OpenLineage.InputDatasetInputFacets inputFacets) Factory method for InputDataset- Parameters:
namespace
- The namespace containing that datasetname
- The unique name for that dataset within that namespacefacets
- The facets for this datasetinputFacets
- The input facets for this dataset.- Returns:
- InputDataset
-
newInputDatasetBuilder
Creates a builder for InputDataset- Returns:
- a new builder for InputDataset
-
newDatasourceDatasetFacet
Factory method for DatasourceDatasetFacet- Parameters:
name
- the nameuri
- the uri- Returns:
- DatasourceDatasetFacet
-
newDatasourceDatasetFacetBuilder
Creates a builder for DatasourceDatasetFacet- Returns:
- a new builder for DatasourceDatasetFacet
-
newLifecycleStateChangeDatasetFacet
public OpenLineage.LifecycleStateChangeDatasetFacet newLifecycleStateChangeDatasetFacet(OpenLineage.LifecycleStateChangeDatasetFacet.LifecycleStateChange lifecycleStateChange, OpenLineage.LifecycleStateChangeDatasetFacetPreviousIdentifier previousIdentifier) Factory method for LifecycleStateChangeDatasetFacet- Parameters:
lifecycleStateChange
- The lifecycle state change.previousIdentifier
- Previous name of the dataset in case of renaming it.- Returns:
- LifecycleStateChangeDatasetFacet
-
newLifecycleStateChangeDatasetFacetBuilder
public OpenLineage.LifecycleStateChangeDatasetFacetBuilder newLifecycleStateChangeDatasetFacetBuilder()Creates a builder for LifecycleStateChangeDatasetFacet- Returns:
- a new builder for LifecycleStateChangeDatasetFacet
-