Get Started with Datasets

All data that is ingested into Adobe Experience Platform is persisted within the Data Lake as datasets. A dataset is a storage and management construct for a collection of data, typically a table, that contains a schema (columns) and fields (rows).

Access datasets

The Datasets workspace in Adobe Journey Optimizer user interface allows you to explore data and create datasets.

Select Datasets in the left-navigation to open the Datasets dashboard.

Adding data to Adobe Experience Platform is the foundation to building a Profile. You will then be able to leverage profiles in Adobe Journey Optimizer. First define schemas, use ETL tools to prepare and standardize your data, then create datasets based on your schemas.

Select the Browse tab to display the list of all available datasets for your organization. Details are displayed for each listed dataset, including its name, the schema the dataset adheres to, and status of the most recent ingestion run.

By default, only the datasets that you have ingested into are shown. If you want to see the system-generated datasets, enable the Show system datasets toggle from the filter.

Select the name of a dataset to access its Dataset activity screen and see details of the dataset you selected. The activity tab includes a graph visualizing the rate of messages being consumed as well as a list of successful and failed batches.

Preview datasets

From the Dataset activity screen, select Preview dataset near the top-right corner of your screen to preview the most recent successful batch in this dataset. When a dataset is empty, the preview link is deactivated.

Create datasets

To create a new dataset, start by selecting Create dataset in the Datasets dashboard.

You can:

Watch this video to learn how to create a dataset, map it to a schema, add data to it, and confirm that the data has been ingested.

Data Governance

In a dataset, browse the Data Governance tab to check labels at the dataset and field level. Data Governance categorize data according to the type of policies that apply.

One of the core capabilities of Adobe Experience Platform is to bring data from multiple enterprise systems together to better allow marketers to identify, understand, and engage customers. This data may be subject to usage restrictions defined by your organization or by legal regulations. It is therefore important to ensure that your data operations are compliant with data usage policies.

Adobe Experience Platform Data Governance allows you to manage customer data and ensure compliance with regulations, restrictions, and policies applicable to data use. It plays a key role within Experience Platform at various levels, including cataloging, data lineage, data usage labeling, data usage policies, and controlling usage of data for marketing actions.

Learn more about Data Governance and data usage labels in the Data Governance documentation

Samples and use cases

Learn how to create a schema, a dataset and ingest data to add Test profiles in Adobe Journey Optimizer in this end-to-end sample

Learn more about dataset creation in Adobe Experience Platform documentation.

Learn how to use Datasets UI in the Data Ingestion overview documentation.

A list of use cases with query examples is available here.

See also

On this page