Experian Data Quality Community
Learn, collaborate and solve problems with like-minded data quality enthusiasts.
Discussion List
-
Post Code improvementIf a UK postcode is listed in format AB123CD AB12CD A12CD How can I strip this out to a normalised …
-
Is UniqueI've had a few questions around checking if a value is unique in a column. Voila: (Thanks @Ian Thor…
-
Extracting a specific piece of information from an address using regular expressions and lookupsData Studio has several built-in functions to allow you parse strings of text and extract a particu…
-
Workflow comments and notesIs there any way to create and display comments or notes within a workflow? An example might be som…
-
How to trigger an e-mail notification if a job failsData Studio has a range of default notification options which allow you to send a custom email to d…
-
Checking data like social security numbers (SSN) for correctnessSocial Security Numbers, National Identity numbers and the like, on face value are simply strings o…
-
Removing value (NULL) from Excel spreadsheetsHi, Quick question, can Aperture handle while loading a replace task for a specific set of values ?…
-
Checking data like social security numbers (SSN) for correctnessSocial Security Numbers, National Identity numbers and the like, on face value are simply strings o…
-
Sharing database connections across teamsWhat's the recommended approach to sharing tables/views and connections to databases across a wider…
-
Getting started with Regular Expressions in Data StudioA regular expression (from here on we’ll call them “regex” for short) is a well-known and powerful …
-
Tips to improve workflow performanceIf you ever find that a workflow is taking a long time to execute, you might want to think of ways …
-
Using a discriminant for Find Duplicates clusteringFind Duplicates will use fuzzy matching to link records, however, in some cases you may have a disc…
-
Data Studio v1.6.0 is now available!The latest version of Data Studio introduces new features, existing functionality enhancements, per…
-
Creating a new data connection to Salesforce from Data Studio via JDBCIt is likely that quite soon after you start to use Data Studio you will want to connect Data Studi…
-
Validate Customer data example WorkflowThis simple validation workflow leverages the Sample Data of Customer V1, The FILTER step filters t…
-
Unioning data example workflow with customer dataThis workflow combines the sample data of Customer V1 and Customer V3 to demonstrate that the data …
-
Joining Data example for Purchase Order Headers and LinesThis workflow uses PO sample data The PO header and rows, is joined using the order number This wo…
-
Changing source file schemae and data definitionsLegacy system files often come without a schema or data definition embedded in the file, and if the…
-
JDBC Performance issues can simply be about securityIn Data Studio you have a number of different ways of accessing data. File Based methods Data conne…
-
Refresh your sourcesOccasionally you may find that the staged sources fall out of synch in how Data Studio renders them…
-
How to resolve a unmapped source on a workflowIn a previous tip I suggested you might want to export and import your workflow between systems. …
-
Data Wrangling - is it so bad?An interesting perspective from Pete Aven of Marklogic, popped up in my feed this week, written on …
-
Why might my Find Duplicates results look different?If I use the Find Duplicates step on its own, in some instances I get more clusters of records (clu…
-
Do businesses run on premium data? New study assesses variables in data quality toolsLisa Ehrlinger from Johannes Kepler Universität Linz Linz, Austria, and her team have identified 66…
-
Jobs with large lookup filesAt St. James's Place, prior to highlighting whether client details need to be quality checked, we n…
-
Data Studio v1.6.0 is coming soon!Coming up in the next version of Data Studio: ability to select a predefined custom layout for the…
-
Finding your User Id in Aperture Data StudioHow do I find the import directory for my user (amongst others)? On the server, if you head to your…
-
Dealing with PIITalking to a prospect today about PII - identifying it on data loaded into the system and then proc…
-
Using 'Matches' functions to identify personal information (and automate the detection of this)Following on from @Steve's discussion (here), I wanted to share some further thoughts on this subje…
-
When should I use a Match Lookup rather than a joinI have a large amount of data (hundreds of millions of transactional records) that I need to match …
-
What's better invalid data or missing data?A discussion that seems to come up from time-to-time is, whether it is better to have gaps in your …
-
Extract a number (6-char SIC code) from a string and perform a lookup (2-char SIC to get category)Using some of the Company's House open data, I recently built a simple workflow that extracts the S…
-
Floating Point numbers with Microsoft Excel Open XML Spreadsheet (.XLSX) filesWhen using .XLSX files in Aperture, be aware that floating point numbers will be treated differentl…
-
Case sensitive menu and function searchWhen searching for functions in the left hand side of the workflow, I have noticed that the search …
-
The importance of profiling when using Find duplicatesWhen implementing Find duplicates in Aperture Data Studio we've seen many examples of the importanc…
-
Great article on building loyalty with a Single Customer View!Just sharing a great article written by @aysha_aktemur on how Aperture Data Studio empowers loyalty…
-
-
Removing an unwanted intermittent string from the start of an alphanumeric fieldAt St. James’s Place we needed to remove a string, that occurred intermittently, from the start of …
-
How can I drop and create a table when writing to a database ?I have a workflow which creates a table on an Oracle SQL Server. When the workflow is re-run, I wan…
-
Timeout during exportI have a workflow that is exporting to a new table in an external database, using the option to cre…
-
What happened to the Python step in Data Studio?In Data Studio v1.1 there was a Python Step - what has happened to that and what are the options fo…
-
Data Studio v1.5.1 is now available!This maintenance release resolves workflow and custom step backward compatibility issues as well as…
-
I cannot access any files I previously loaded through Data ExplorerSometimes you will find that the view of files in the data explorer is out of sync with what is sto…
-
Data Studio v1.5.0 is now available!The latest version of Data Studio introduces new features, existing functionality enhancements, per…
-
What is cross-field matching?What do i do if i have multiple addresses or multiple phone number fields in my web forms but my cu…
-
Data Studio v1.5.0 is coming soon!Coming up in the next version of Data Studio: Updated version of the address validation engine New…
-
Working with DatesI have a set of data that contains some dates What I would like to do, is split the day, month and …
-
How do you work out where the row comes from ?When you're combining data from multiple data sources that potentially have duplicated data, you ma…
-
Solving for literal null valuesDepending on the source that you are working with, you may find that you actually have literal null…
-
Dealing with character setsSometimes, the file that want to use with Data Studio will contain characters that would come into…