Experian Data Quality Community
Learn, collaborate and solve problems with like-minded data quality enthusiasts.
Discussion List
-
Data Studio v1.6.0 is now available!The latest version of Data Studio introduces new features, existing functionality enhancements, per…
-
Creating a new data connection to Salesforce from Data Studio via JDBCIt is likely that quite soon after you start to use Data Studio you will want to connect Data Studi…
-
Validate Customer data example WorkflowThis simple validation workflow leverages the Sample Data of Customer V1, The FILTER step filters t…
-
Unioning data example workflow with customer dataThis workflow combines the sample data of Customer V1 and Customer V3 to demonstrate that the data …
-
Joining Data example for Purchase Order Headers and LinesThis workflow uses PO sample data The PO header and rows, is joined using the order number This wo…
-
Changing source file schemae and data definitionsLegacy system files often come without a schema or data definition embedded in the file, and if the…
-
JDBC Performance issues can simply be about securityIn Data Studio you have a number of different ways of accessing data. File Based methods Data conne…
-
Refresh your sourcesOccasionally you may find that the staged sources fall out of synch in how Data Studio renders them…
-
How to resolve a unmapped source on a workflowIn a previous tip I suggested you might want to export and import your workflow between systems. …
-
Importing and Exporting WorkflowsOne of the very first things that you will probably wonder, is just how portable are my workflows t…
-
Data Wrangling - is it so bad?An interesting perspective from Pete Aven of Marklogic, popped up in my feed this week, written on …
-
Why might my Find Duplicates results look different?If I use the Find Duplicates step on its own, in some instances I get more clusters of records (clu…
-
Do businesses run on premium data? New study assesses variables in data quality toolsLisa Ehrlinger from Johannes Kepler Universität Linz Linz, Austria, and her team have identified 66…
-
Jobs with large lookup filesAt St. James's Place, prior to highlighting whether client details need to be quality checked, we n…
-
Data Studio v1.6.0 is coming soon!Coming up in the next version of Data Studio: ability to select a predefined custom layout for the…
-
Finding your User Id in Aperture Data StudioHow do I find the import directory for my user (amongst others)? On the server, if you head to your…
-
Dealing with PIITalking to a prospect today about PII - identifying it on data loaded into the system and then proc…
-
Using 'Matches' functions to identify personal information (and automate the detection of this)Following on from @Steve's discussion (here), I wanted to share some further thoughts on this subje…
-
When should I use a Match Lookup rather than a joinI have a large amount of data (hundreds of millions of transactional records) that I need to match …
-
What's better invalid data or missing data?A discussion that seems to come up from time-to-time is, whether it is better to have gaps in your …
-
Extract a number (6-char SIC code) from a string and perform a lookup (2-char SIC to get category)Using some of the Company's House open data, I recently built a simple workflow that extracts the S…
-
Floating Point numbers with Microsoft Excel Open XML Spreadsheet (.XLSX) filesWhen using .XLSX files in Aperture, be aware that floating point numbers will be treated differentl…
-
Case sensitive menu and function searchWhen searching for functions in the left hand side of the workflow, I have noticed that the search …
-
The importance of profiling when using Find duplicatesWhen implementing Find duplicates in Aperture Data Studio we've seen many examples of the importanc…
-
Great article on building loyalty with a Single Customer View!Just sharing a great article written by @aysha_aktemur on how Aperture Data Studio empowers loyalty…
-
-
Removing an unwanted intermittent string from the start of an alphanumeric fieldAt St. James’s Place we needed to remove a string, that occurred intermittently, from the start of …
-
How can I drop and create a table when writing to a database ?I have a workflow which creates a table on an Oracle SQL Server. When the workflow is re-run, I wan…
-
Timeout during exportI have a workflow that is exporting to a new table in an external database, using the option to cre…
-
What happened to the Python step in Data Studio?In Data Studio v1.1 there was a Python Step - what has happened to that and what are the options fo…
-
Data Studio v1.5.1 is now available!This maintenance release resolves workflow and custom step backward compatibility issues as well as…
-
I cannot access any files I previously loaded through Data ExplorerSometimes you will find that the view of files in the data explorer is out of sync with what is sto…
-
Data Studio v1.5.0 is now available!The latest version of Data Studio introduces new features, existing functionality enhancements, per…
-
What is cross-field matching?What do i do if i have multiple addresses or multiple phone number fields in my web forms but my cu…
-
Data Studio v1.5.0 is coming soon!Coming up in the next version of Data Studio: Updated version of the address validation engine New…
-
Working with DatesI have a set of data that contains some dates What I would like to do, is split the day, month and …
-
How do you work out where the row comes from ?When you're combining data from multiple data sources that potentially have duplicated data, you ma…
-
Solving for literal null valuesDepending on the source that you are working with, you may find that you actually have literal null…
-
Dealing with character setsSometimes, the file that want to use with Data Studio will contain characters that would come into…
-
Fuzzy Matching logicHow does the fuzzy matching in the Find Duplicates step work?
-
Find Duplicates Step configurationWhat is the difference between the Name, Household and Address/Location choices in the default Fin…
-
Find duplicates step - why should I create custom rules?Find duplicates step - why should I create custom rules? The find duplicate step comes with three p…
-
What's your most used function?Which function in Data Studio do you use the most?
-
Work with all the dataOne of the key reasons to work with Data Studio is that you have the choice to work with all of the…
-
Data Studio v1.4.0 is now available!The latest version of Data Studio introduces several new features, performance improvements and a n…
-
What's the main reason you're using Data Studio?Feel free to leave a comment as well.