Merge & Fuzzy Merge
Combining two datasets into one
Beyond connecting a single dataset, you can create a new dataset by merging two or more datasets. By clicking the blue plus sign you can add a second dataset. We have added 'Supplemental Churn'. Your data sources do not need to be the same format, for example, you can have one from a csv and another from salesforce.
Next select the blue plus sign again and go to 'Prepare data'. Select 'Merge'. You can now select the Primary Dataset and Supplementary Dataset to merge, as well as the column to match on. In the below example, two customer churn datasets are merged on the customerID column. Once the columns are selected press 'Merge Datasets'.
For additional control there are advanced options available as part of the merge function.
By default, the match sensitivity is a “fuzzy merge,” which means that the datasets will be merged, even if the column names aren’t completely identical.
Alternatively, you can select “Exact Match Only” under the match sensitivity section, which will only merge the datasets on exact column matches.
The next setting is “Merge Type,” which allows you to select between these two options:
- Keep all rows in primary dataset
- Keep only rows which appear in both datasets
Scored 0-1 this value demonstrates how similar the records are with 1 being a perfect match and 0 being completely different.