Interface LineageRelation
public interface LineageRelation
Interface to be implemented by extension classes that extend
org.apache.spark.sql.sources.BaseRelation.
Implementing this interface allows for the extraction of lineage information from
BaseRelation objects. The methods getNamespace and getName, provided by DatasetIdentifier, must be implemented by the classes that implement this interface. This
identifier must follow the naming conventions outlined in the OpenLineage Naming Specification, ensuring
consistency across datasets for lineage tracking and data cataloging.
-
Method Summary
Modifier and TypeMethodDescriptionio.openlineage.client.utils.DatasetIdentifiergetLineageDatasetIdentifier(String sparkListenerEventName, io.openlineage.client.OpenLineage openLineage) Returns aDatasetIdentifiercontaining the namespace and name of the dataset for lineage tracking purposes.
-
Method Details
-
getLineageDatasetIdentifier
io.openlineage.client.utils.DatasetIdentifier getLineageDatasetIdentifier(String sparkListenerEventName, io.openlineage.client.OpenLineage openLineage) Returns aDatasetIdentifiercontaining the namespace and name of the dataset for lineage tracking purposes.- Parameters:
sparkListenerEventName- the name of the Spark listener event triggering the lineage extractionopenLineage- an instance ofOpenLineageused for lineage-related operations- Returns:
- a
DatasetIdentifierrepresenting the dataset associated with the event
-