Look-alike audiences guide

IMPORTANT
Look-alike insights and Look-alike audiences are only available in the B2C edition.

In Adobe Experience Platform, Look-alike audiences provide intelligent insights on each of your audiences, leveraging machine-learning-based insights to identify and target high-value customers with your marketing campaigns.

With Look-alike audiences, you can create expanded audiences that target customers similar to your high-performing audiences or target customers similar to previously converted audiences.

Terminology terminology

Before getting started with Look-alike audiences, make sure to understand the following concepts:

  • Base audience: The base audience is the audience that you want to find out more insights about. This is the audience that the look-alike model is based on.
  • Look-alike model: A look-alike model is a machine learning model that is trained on every eligible base audience without any customer input. Each look-alike model creates the influential factors and similarity graphs. A look-alike model does not get scored.
  • Look-alike audience: A Look-alike audience is the audience that is created when a look-alike model with a selected similarity threshold is applied to the base audience. You can create multiple Look-alike audiences using the same look-alike model. The Look-alike audience is what gets scored.
  • Total addressable audience size: The total addressable audience size is the total number of profiles in the past 30 days minus the base audience population in the past 30 days. For example, if a customer has 10 million profiles in the past 30 days, and the base audience has 1 million profiles in the past 30 days, the total addressable audience size is 9 million profiles.

Eligibility eligibility

In order to use look-alike insights, the base audience must meet the following eligibility criteria:

  • The base audience must be created within Platform.
    • Externally-generated audiences are not eligible for look-alike insights.
  • The base audience must be on the default merge policy.
  • The base audience must not use fields that are restricted by data governance.

Look-alike model details details

In Adobe Experience Platform, the look-alike model consumes three different types of data points:

  • Audience membership over the past 30 days
  • Experience events over the past 30 days that have been ingested in the Real-Time Customer Profile
  • Profile attributes over the past 30 days that have been ingested in the Real-Time Customer Profile

All of these data points are turned into key value pairs which are fed into the look-alike model. Only the key value pairs with a significant percentage of profiles matching will be kept.

At this time, the look-alike model is run every 24 hours, creating and re-creating the influential factors and similarity graphs for the base audiences. Scoring for the Look-alike audiences is also run frequently.

Entitlements entitlements

The following entitlements apply for usage of Look-alike audiences:

  • Real-Time CDP Prime customers are entitled to 5 active Look-alike audiences in production sandboxes
  • Real-Time CDP Ultimate customers are entitled to 20 active Look-alike audiences in production sandboxes
  • Development sandboxes are limited to 5 Look-alike audiences for all Real-Time CDP customers

Add-on packs, which will be available at a later date, increase the entitlements for production sandboxes by 20 Look-alike audiences per pack.

To confirm if you have access to Look-alike audiences, please contact your Adobe representative.

View look-alike insights view

Look-alike insights is built-in with the audience details page. To look at the look-alike insights for an audience, select Audiences in the left navigation bar, followed by Browse, and the audience you want to view the insights for.

The Audiences button is highlighted, as well as the base audience being used for look-alike modeling.

The audience details page appears. Select Look-alike insights tab to view the audience’s look-alike insights. The Look-alike insights page is displayed. This page has three main elements - the similarity and reach graph, the Look-alike audiences, and the influential factors.

The Look-alike insights tab is highlighted, displaying the look-alike insights for the base audience.

Similarity and reach similarity-and-reach

The similarity and reach section displays a graph that plots the expected reach of a Look-alike audience consisting of profiles above a given similarity score. The similarity score represents the distance of similarity between the base audience’s profile and the look-alike insight’s profile.

The similarity and reach graph is highlighted.

On this graph, the x-axis measures the similarity percentage between a profile and members of the selected audience. The similarity score ranges from 0% to 100%, with a higher similarity score indicating that a profile is closer, in terms of influential factor values, to members of the selected audience.

The y-axis shows the expected count of profiles with the similarity percentage that corresponds with the matching value of the x-axis. This expected count of profiles ranges from 0 to the total addressable audience size or 25 million profiles, whichever is lower. This axis is measured on a logarithmic scale to improve the readability of the graph.

Please note that the graph is cumulative from right to left. This means that at any point in the graph, the value of the y-axis is the number of profiles that have a similarity above the similarity threshold. For example, if the x-axis is at 60% and the y-axis is 10 million, this means that there are 10 million profiles which have a similarity at or above 60% to the base audience.

You can hover over a specific point in the graph to display the similarity percentage and the expected profile count for the currently highlighted point.

Look-alike audiences list

The Look-alike audiences section displays a list of all the Look-alike audiences that have been previously created for the selected base audience.

The Look-alike audiences section is highlighted.

Influential factors influential-factors

The influential factors section displays the top 100 factors that influence the look-alike model for the selected base audience. These influential factors are the profile attributes, the experience events, and the audience memberships that are the most important in explaining similarities in the base audience. Understanding the top influential factors lets you better personalize your marketing content for this audience and any Look-alike audience you create from it. Please note that not all the influential factors that affect the look-alike model will be displayed.

For influential factors that are numeric, the key value pairs may be put into buckets, depending on the number of different values that key has. For example, if you have a key of income, there most likely would be many unique values. As a result, the key value pairs will be placed into buckets tha could look like income=[0 -> 30000], income=[30000 -> 50000], and income=[50000 -> 100000].

These buckets are regularly re-computed to ensure the data is kept up-to-date.

The influential factors section is highlighted.

NOTE
The influential factors are sorted in order of importance and are independent of each other.
Field
Description
Type
The type of data that the influential factor is derived from. This can be a profile attribute, an experience event, or an audience membership.
Key
The name of the data field. For keys of the audience membership type, this value represents the namespace of the audience where the data comes from. Possible values include ups (Segmentation Service) and AO (Audience Orchestration). For keys of other types, this value represents the XDM field path. For example, if the company Luma has a custom field called income, the key would be _luma.income
Value
The value varies depending on the influential factor that it represents. For profile attributes or experience events, this field represents the value or value range of the data field that indicates the similarity to the members of the base audience. The value range is written in the form [A -> B], where A represents the lower range while B represents the higher range. For audience memberships, this field is the name of the audience.
Importance
The relative level of importance of the influential factor. This can be high, medium, or low.

Create a Look-alike audience create

IMPORTANT
You cannot use a Look-alike audience as the base audience for another Look-alike audience. That is to say, you cannot create chained Look-alike audiences.

To create a Look-alike audience, you’ll need to select the audience you want to base the Look-alike audience off of. To access your list of available audiences, select Audiences in the left navigation bar, followed by Browse. The list of audiences appears. On this page, you can select the audience you want to use as your base audience.

The Audiences button is highlighted, as well as the base audience being used for look-alike modeling.

On the audience details page, select Create look-alike audience to begin the process of creating a Look-alike audience.

The Create look-alike audience button is highlighted.

The Create a look-alike audience popover appears. On this page, you can set the similarity percentage for the Look-alike audience.

The Create a look-alike audience popover is displayed.

You can set this similarity percentage in three different ways:

  • Move the slider to set the similarity percentage
  • Enter the similarity percentage in the numeric entry box next to the slider
  • Hover over the graph and select the desired location to set the similarity percentage

You can also update details about the Look-alike audience, including its name and description. By default, the Look-alike audience’s name will be generated based on the base audience’s name and the similarity percentage previously specified.

The basic information is highlighted within the Create a look-alike audience popover.

Select Create to finish creating your Look-alike audience.

The create button is highlighted within the Create a look-alike audience popover.

The newly created Look-alike audience can be accessed in the Look-alike audiences section of the audience details page, and is also available in the Audience Portal and for other downstream usages. Please note that it will take some time for the Look-alike audience to be scored. Until it is scored, the profile count will be appear to be 0.

View Look-alike audience details view-details

To view details of a Look-alike audience, select the Look-alike audience in the Look-alike audiences section of the base audience.

The Look-alike audiences section is highlighted.

The audience details page appears. For more information on this page, please read the audience details section of the Audience Portal overview.

Details of the Look-alike audience are displayed.

Exclude data fields from look-alike modeling exclude

IMPORTANT
You are responsible for ensuring that data, including sensitive data, is labeled appropriately and that the data usage policies have been defined and enabled to comply with the legal and regulatory obligations under which you operate. You should also be aware that the data fields or segment memberships that are not directly correlated with data fields typically associated with sensitive or protected data types can be a source of potential bias. You are responsible in analyzing your data to identify, label, and apply the appropriate data usage policies to your data, including any data fields that may proxy for sensitive or protected data types and should be excluded from modeling.

Look-alike audiences can be configured to exclude data fields that are restricted for the “Data Science” marketing action by applying the relevant data usage labels and policies. Data that is labeled as restricted from use for data science will be removed from consideration when training a Look-alike audience model and when generating a Look-alike audience from the trained model.

NOTE
Changes to the data usage labels on the base audience may take up to 48 hours to take effect.

The standard “C9” label can be used to label data that should not be used for data science and can be enforced by enabling the standard “Restrict data science” policy. You can also create additional policies to restrict data with other labels, including sensitive labels, from usage for data science. For more information on managing data usage policies, please read the data usage policies UI guide. For more information on managing data usage labels, please read the data usage labels UI guide.

By default, if a base audience has no contract labels, the modeling process for Look-alike audiences will exclude any field, dataset, or audience based on the enabled privacy policy for your organization.

Next steps

After reading this guide, you have learned how to view look-alike insights and create Look-alike audiences based on these insights. For more information on audiences in the Adobe Experience Platform UI, please read the Segmentation Service UI guide.

recommendation-more-help
770bc05d-534a-48a7-9f07-017ec1e14871