[Beta]{class="badge informative"}

Deduplication deduplication

The Deduplication activity is a Targeting activity. This activity allows you to delete duplicates in the result(s) of the inbound activities. The Deduplication activity is generally used following targeting activities and before activities that allow the use of targeted data.

Configuration

Follow these steps to configure the Scheduler activity:

  1. Add a Deduplication activity to your workflow.

  2. In the Fields to identify duplicates section, click the Add attribute button to specify the fields for which the identical values allow the duplicates to be identified: email address, first name, last name, etc. The order of the fields allows you to specify those to process first.

  3. Select the number of unique Duplicates to keep. The default value for this field is 1. The value 0 allows you to keep all the duplicates.

  4. Select the Deduplication method to use:

    • Random selection: randomly selects the record to be kept out of the duplicates.
    • Using an expression: this lets you keep the records in which the value of the expression entered is the smallest or the biggest. ++ Expression ++ Sort
    • Following a list of values: lets you define a value priority for one or more fields. To define the values, click Attribute to select a field or create an expression, then add the value(s) into the appropriate table. To define a new field, click the Add button located above the list of values. ++ Sort
  5. Check the Generate complement option if you wish to exploit the remaining population. The complement consists of all the duplicates. An additional transition will then be added to the activity.

Example

recommendation-more-help