FindMatches class
Identifies matching records in the input DynamicFrame
and creates a new DynamicFrame
with a unique identifier assigned to each group of matching records.
To import:
from awsglueml.transforms import FindMatches
Methods
apply(frame, transformId, transformation_ctx = "", info = "", stageThreshold = 0, totalThreshold = 0, enforcedMatches = none, computeMatchConfidenceScores = 0)
Identifies matching records in the input DynamicFrame
and creates a new DynamicFrame
with a unique identifier assigned to each group of matching records.
frame
– TheDynamicFrame
to apply the FindMatches transform. Required.transformId
– The unique ID associated with the FindMatches transform to apply on records in theDynamicFrame
. Required.transformation_ctx
– A unique string that is used to identify stats/state information. Optional.info
– A string to be associated with errors in the transformation. Optional.stageThreshold
– The maximum number of errors that can occur in the transformation before it errors out. Optional. The default is zero.totalThreshold
– The maximum number of errors that can occur overall before processing errors out. Optional. The default is zero.enforcedMatches
– TheDynamicFrame
used to enforce matches. Optional. The default is None.computeMatchConfidenceScores
– A Boolean value indicating whether to compute a confidence score for each group of matching records. Optional. The default is false.
Returns a new DynamicFrame
with a unique identifier assigned to each group of matching records.