Spelling Check

Marco_13112001
edited December 2023 in General

I've quite a few datasets that have spelling problems or words with number instead of a letter and vice versa i.e. Flat I (1 is replaced by an "l" ) or FIeming ( "l" is replaced by a capital "i") . What is your advise to handle this issue?

Tagged:

Best Answer

  • Josh Boxer
    Josh Boxer Administrator
    Answer ✓

    This is a good question. Spell check would not work for addresses as there are so many 'correct' words that would be flagged

    If there are a handful of common issues that you wanted to flag then you could build out a Contains function:

    image.png

    If you are confident that the correct things are being detected then work on updating this to a Replace.

    You could make it more generic using Regex to find any words containing numbers for example:

    image.png

    Might want to expand to ignore 'words' that are just numbers.

    An example function using Regex here: https://community.experianaperture.io/discussion/570/invalid-character-for-names

    Interested to see if anyone has any better ideas to solve this issue

Answers

  • @Sueann See no, we are not using the validate addresses step. The dataset was created with this issues so we could simulate the process and I couldn't find a answer on the "normal" function list. However, your answer combine with @Josh Boxer gave me a path to solve the issue.