Row Quality is not accurate
Postcode | Postcode - Weight | Postcode should not be null for a UK address | Postcode should be a valid UK postcode | First Address Line | First Address Line - Weight | First Address Line should not be null | First Address Line should not contain special characters | Primary Email address | Primary Email address - Weight | Primary Email Address should not be null | Primary Email Address should be in a valid format | Internal Reference | Internal Reference - Weight | Internal Reference should not be null | Country of Tax Residence | Country of Tax Residence - Weight | Country of Tax Residence should not be null | External Reference | External Reference - Weight | External Reference should not be null | External Reference should be in a valid format | Row quality |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Pass | 1 | Pass | Pass | Pass | 1 | Pass | Pass | Fail | 1 | Fail | Ignore | Pass | 1 | Pass | Pass | 1 | Pass | Pass | 1 | Pass | Pass | 88.89 |
Hi, have a look at this example above. This data has been copied and pasted from a validate step choosing the option 'Show all rows'. Why is the row quality 88.89% rather than 90%? There are 10 rules and only 1 of them failed for this particular row.
Comments
-
Postcode
Postcode - Weight
Postcode should not be null for a UK address
Postcode should be a valid UK postcode
First Address Line
First Address Line - Weight
First Address Line should not be null
First Address Line should not contain special characters
Primary Email address
Primary Email address - Weight
Primary Email Address should not be null
Primary Email Address should be in a valid format
Internal Reference
Internal Reference - Weight
Internal Reference should not be null
Country of Tax Residence
Country of Tax Residence - Weight
Country of Tax Residence should not be null
External Reference
External Reference - Weight
External Reference should not be null
External Reference should be in a valid format
Row quality
Pass
1
Pass
Pass
Pass
1
Pass
Pass
Pass
1
Pass
Pass
Pass
1
Pass
Pass
1
Pass
Fail
1
Pass
Fail
90
Here's another row example from the same validate step. This row has also failed only 1 rule, and the row quality is 90% which is expected.
0 -
There are 9 rules, not 10, as 'Primary Email Address should be in a valid format' has returned 'Ignore' for this row.
1 / 9 = 0.1111 »» 100 - 11.11 = 88.89
0 -
I see. This affects the solution you mentioned here (I can't seem to comment on my other post):
Is there a way to replicate the COUNTIF formula? — Experian Data Quality Community
In one of the steps I need to specify the number of rules, but how can I do that dynamically for each row, taking into account any ignored rules in each row?
0 -
Suggested another approach in that thread using Replace and Add
0
