-
Help with Blocking Keys for Phone Numbers not generating results
Hi all, I'm trying to create two blocking keys for phone numbers, one that works off the first 5 digits, and one that works off the last 5 digits. I've tried several variations of this both in with other blocking keys and now on their own, and they are not generating any results, can anybody help with suggestions of where…
-
How long can/should the "Analyze blocking keys" function take?
Morning all, I'm getting up and running with the find duplicates feature and have created my own blocking keys and matching rules now which are producing good results for the clustering, however when I try to analyse it in "Find Duplicates Workbench" the "Analyze blocking keys" tool is taking an extremely long time to…
-
How to identify blocking keys for Find Duplicates
Blocking keys identifies records that are similar, creating blocks or potential groups of matches. Let’s look at an example where you have a list of names and date of birth that may contain duplicates. The rule of thumb is to be able to identify any possible chances of matches. Which elements would you use to say that any…
-
Tips and Tricks to make Find Duplicates Blocking Keys and Ruleset more readable
What is your first impression of the blocking keys and ruleset definitions required for Find Duplicates? We have observed that it may take a bit of learning to understand the syntax and structure to correctly update the keys and rules. Here are a few tips and tricks to help ease your experience with reading and updating…
-
Tuning Blocking Keys
Review Find Duplicates step results Reviewing the Find Duplicates results may not be the best way to confirm the effectiveness of the blocking keys. However, it does help to reveal obvious issues that may trigger further investigation. Once a set of Blocking Keys and Rules has been established at the Find Duplicates…
-
Building Rules
In order to start testing the Find Duplicates step, the Find Duplicates settings will also need to have a ruleset defined in addition to the blocking keys. When building rules, we will have to think about the following: How the Ruleset relates to Blocking Keys Blocking keys identifies potential matches. Rules determines if…
-
Exact match, fuzzy match and de-duplication with Find Duplicates
Hi everyone, I'm starting a series of articles all about matching and linking records to find duplicates in Aperture Data Studio with the intention to encourage some learning and interaction. Start here: Why worry about duplicated data? Simple ways to identify and resolve duplicated data in Aperture Data studio: Exact…
-
Quick introduction to Blocking Keys and Rules for Find Duplicates
The concept of Blocking Keys and Rules may be foreign to you if you haven’t already used the Find Duplicates step in Aperture Data Studio. Blocking keys identifies records that are similar, creating blocks or potential groups of matches. Rules compares every set of records in the resulting blocks, returning…
-
How do we expand on the provided base blocking keys and rule sets on Aperture Data Studio?
Hi team I am asking this question on behalf of one of our Credit Services team members. They are looking to create Blocking keys and Rules that accommodate for a text string (Drivers License) and date of birth, but is running into issues. The rules that they are currently using are part of the attachments. Is there an easy…