Experian Data Quality Community
Learn, collaborate and solve problems with like-minded data quality enthusiasts.
Discussion List
-
Unioning data example workflow with customer dataThis workflow combines the sample data of Customer V1 and Customer V3 to demonstrate that the data …
-
Joining Data example for Purchase Order Headers and LinesThis workflow uses PO sample data The PO header and rows, is joined using the order number This wo…
-
Changing source file schemae and data definitionsLegacy system files often come without a schema or data definition embedded in the file, and if the…
-
JDBC Performance issues can simply be about securityIn Data Studio you have a number of different ways of accessing data. File Based methods Data conne…
-
Refresh your sourcesOccasionally you may find that the staged sources fall out of synch in how Data Studio renders them…
-
How to resolve a unmapped source on a workflowIn a previous tip I suggested you might want to export and import your workflow between systems. …
-
Data Wrangling - is it so bad?An interesting perspective from Pete Aven of Marklogic, popped up in my feed this week, written on …
-
Why might my Find Duplicates results look different?If I use the Find Duplicates step on its own, in some instances I get more clusters of records (clu…
-
Do businesses run on premium data? New study assesses variables in data quality toolsLisa Ehrlinger from Johannes Kepler Universität Linz Linz, Austria, and her team have identified 66…
-
Jobs with large lookup filesAt St. James's Place, prior to highlighting whether client details need to be quality checked, we n…
-
Dealing with PIITalking to a prospect today about PII - identifying it on data loaded into the system and then proc…
-
When should I use a Match Lookup rather than a joinI have a large amount of data (hundreds of millions of transactional records) that I need to match …
-
What's better invalid data or missing data?A discussion that seems to come up from time-to-time is, whether it is better to have gaps in your …
-
Extract a number (6-char SIC code) from a string and perform a lookup (2-char SIC to get category)Using some of the Company's House open data, I recently built a simple workflow that extracts the S…
-
Floating Point numbers with Microsoft Excel Open XML Spreadsheet (.XLSX) filesWhen using .XLSX files in Aperture, be aware that floating point numbers will be treated differentl…
-
The importance of profiling when using Find duplicatesWhen implementing Find duplicates in Aperture Data Studio we've seen many examples of the importanc…
-
Great article on building loyalty with a Single Customer View!Just sharing a great article written by @aysha_aktemur on how Aperture Data Studio empowers loyalty…
-
-
Removing an unwanted intermittent string from the start of an alphanumeric fieldAt St. James’s Place we needed to remove a string, that occurred intermittently, from the start of …
-
What happened to the Python step in Data Studio?In Data Studio v1.1 there was a Python Step - what has happened to that and what are the options fo…
-
I cannot access any files I previously loaded through Data ExplorerSometimes you will find that the view of files in the data explorer is out of sync with what is sto…
-
Working with DatesI have a set of data that contains some dates What I would like to do, is split the day, month and …
-
How do you work out where the row comes from ?When you're combining data from multiple data sources that potentially have duplicated data, you ma…
-
Solving for literal null valuesDepending on the source that you are working with, you may find that you actually have literal null…
-
Dealing with character setsSometimes, the file that want to use with Data Studio will contain characters that would come into…
-
Fuzzy Matching logicHow does the fuzzy matching in the Find Duplicates step work?
-
Find Duplicates Step configurationWhat is the difference between the Name, Household and Address/Location choices in the default Fin…
Categories
- All categories
- 17 Get started
- 484 Get involved
- 18 Support
- 254 Resources
- 3 Events
- Upcoming events
- 3 Event recaps
- 18 Ideas and roadmap
- 97 Community categories
Popular tags
- Release notes 140
- Aperture Data Studio 131
- Find Duplicates 59
- Functions 55
- Data Studio 31
- Workflows 29
- Datasets 27
- dmx 24
- REST API 22
- feature demos 22
- tips and tricks 19
- Rules 18
- v2.12 14
- v2.8 14
- Community Challenge 14
- v2.11 13
- Extensions 13
- v2.5 12
- Notifications 11
- Transform 11
- v2.10 11
- Address Validation 10
- Automations 10
- Dashboards & Charts 10
- blocking keys 10