FindMatches class - AWS Glue

FindMatches class

Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.

To import:

from awsglueml.transforms import FindMatches

Methods

apply(frame, transformId, transformation_ctx = "", info = "", stageThreshold = 0, totalThreshold = 0, enforcedMatches = none, computeMatchConfidenceScores = 0)

Identifies matching records in the input DynamicFrame and creates a new DynamicFrame with a unique identifier assigned to each group of matching records.

  • frame – The DynamicFrame to apply the FindMatches transform. Required.

  • transformId – The unique ID associated with the FindMatches transform to apply on records in the DynamicFrame. Required.

  • transformation_ctx – A unique string that is used to identify stats/state information. Optional.

  • info – A string to be associated with errors in the transformation. Optional.

  • stageThreshold – The maximum number of errors that can occur in the transformation before it errors out. Optional. The default is zero.

  • totalThreshold – The maximum number of errors that can occur overall before processing errors out. Optional. The default is zero.

  • enforcedMatches – The DynamicFrame used to enforce matches. Optional. The default is None.

  • computeMatchConfidenceScores – A Boolean value indicating whether to compute a confidence score for each group of matching records. Optional. The default is false.

Returns a new DynamicFrame with a unique identifier assigned to each group of matching records.