Symlinks Facet
The symlinks facet is used to list alternative identifiers for a single dataset. A dataset might be referenced by its physical location (e.g., a file path) in one context and by a logical name (e.g., a database table name) in another. This facet allows OpenLineage to understand that these different identifiers refer to the same entity, creating a unified lineage graph.
Fields Description
identifiers
: An array containing one or more alternative identifiers for the dataset.namespace
: The namespace of the alternative identifier (e.g., Glue Catalog).name
: The name of the dataset within the given namespace (e.g., Glue Table).type
: A string describing the type of the identifier.
namespace
, name
and type
are required fields
Example:
{
...
"inputs": {
"namespace": "s3://{bucket name}",
"name": "{object key}",
"facets": {
"symlinks": {
"_producer": "https://some.producer.com/version/1.0",
"_schemaURL": "https://openlineage.io/spec/facets/1-0-1/SymlinksDatasetFacet.json",
"identifiers": [
"namespace": "arn:aws:glue:{region}:{account id}",
"name": "table/{database name}/{table name}",
"type": "TABLE"
]
}
}
}
...
}
The facet specification can be found here.