Skip to content

AWS Glue FindMatches now supports incrementally matching new data against an existing dataset

The FindMatches ML transform in AWS Glue now allows you to match newly arrived data against existing matched datasets. The FindMatches transform allows you to identify duplicate or matching records in your dataset, even when the records do not have a common unique identifier and no fields match exactly. It makes it faster and easier to clean and deduplicate data sets.

Source:: Amazon AWS