Data Distiller capabilities

The Data Distiller capabilities section provides documentation links to more advanced Data Distiller features:

  • Data exploration: Learn how to explore, troubleshoot, and verify batch ingested data using SQL.
  • Derived datasets for Experience Platform applications: Learn how to create derived datasets to support complex and diverse use cases that maximize your data utility.
  • AI/ML pipelines: Learn about important concepts behind your preferred machine learning tools and how to build custom models that support your marketing use cases. This series of guides describes the necessary steps to build feature pipelines that prepare data from Experience Platform to feed custom models in your machine learning environment.
  • SQL insights: Learn about the key capabilities and required steps to develop an insights dashboard from SQL with Data Distiller.

The Query Service workspace with the Data Distiller capabilities section highlighted.

Select a quick link to navigate to the relevant Data Distiller dashboards Templates. Each accelerator provides powerful tools and visualizations to help you analyze audience data, optimize segmentation, and enhance targeting strategies.

  • Advanced audience overlaps: From this dashboard you can analyze audience intersections between multiple audience segments to uncover valuable insights and optimize segmentation strategies. You can also export your insights for further offline analysis or reporting purposes.
  • Audience comparison: From this dashboard, you can compare and contrast key audience metrics side-by-side to analyze two audience groups in detail. These insights help you understand audience size, growth, and other key performance indicators, enabling you to refine segmentation and optimize targeting strategies with data-driven decisions.
  • Audience trends: Use the Audience trends dashboard to visualize how your audiences evolve over time through key metrics like audience growth, identity counts, and single identity profiles. Track trends to uncover valuable insights into audience behavior, empowering you to refine segmentation, enhance engagement, and optimize targeting strategies for more effective campaigns.
    Track audience metrics over time to monitor changes in audience size, identity growth, and overall engagement.
  • Audience identity overlaps: Use the Audience Identity Overlaps dashboard to analyze identity overlaps within selected audiences. Visualizations and tabulated data provide insights to optimize identity stitching, reduce redundancy, and improve segmentation. These insights enable more effective targeting, enhanced personalization, and streamlined customer interactions.

The Query Service workspace with the Data Distiller accelerators section highlighted.

Data Distiller examples

Select a card to open documentation guides and examples to help you make the most of Data Distiller:

  • Decile-based derived datasets: Learn how to create decile-based derived datasets for segmentation and audience creation in Adobe Experience Platform. Using an airline loyalty scenario, it covers schema design, decile calculations, and query examples for ranking and aggregating data.
  • Customer lifetime value: Learn how to track and visualize customer lifetime value with Real-Time CDP and custom dashboards. Use these insights to develop strategies for acquiring new customers, retain existing ones, and maximize profit margins.
  • Propensity score: Learn how to determine propensity scores using machine-learning predictive models. This guide covers sending data for training, applying trained models with SQL, and predicting customer purchase likelihood.
  • Consent analysis: Learn how to analyze and track customer consent using Real-Time CDP, Query Service, and Data Distiller. This guide covers building consent dashboards, refining segmentation, tracking trends, and ensuring compliance, helping you build trust and deliver personalized experiences.
  • Fuzzy match: Learn how to perform a ‘fuzzy’ match on your Experience Platform data to find approximate matches and analyze string similarity across datasets. Follow this guide to save time and make your data more accessible. The example demonstrates how to match hotel room attributes between two travel agency datasets, showing how to efficiently match, compare, and reconcile large, complex datasets for consistency and accuracy.

The Query Service workspace with the Data Distiller examples section highlighted.

Key metrics

The key metrics section displays visualizations of important data that helps you monitor Query Service usage. For each chart, you can select the ellipsis (...) in the top right followed by View more to view either a tabulated form of the results, or download the data as a CSV file to view in a spreadsheet. For more details, refer to the View more guide.

Set a date filter

To apply a global date filter for these visualizations, select the filter icon ( A filter icon. ) and adjust the date range in the Filters dialog. Apply this filter to tailor the displayed metrics for a specific time frame and enhance the relevance of your analysis.

The Filters dialog for the key metrics charts in the Query Service Workspace.

Distiller batch queries

The Distiller batch queries chart provides a breakdown of query activity by day, highlighting the number of processed CTAS and ITAS (interactive and scheduled) queries. The chart highlights patterns, such as spikes in interactive queries on certain days and the infrequent use of scheduled queries. Use these insights to optimize performance by identifying peak activity periods, refining scheduling strategies, and balancing query execution to improve workflow efficiency and resource utilization.

The Distiller batch queries chart.

Compute hours consumed

The Compute hours consumed chart provides a day-by-day visualization of compute hours used to process Query Service operations. Use these compute hour trends to monitor resource consumption, identify high-demand periods, and optimize query execution to ensure efficient resource allocation and performance.

The Compute hours consumed chart.

Data exploratory queries

The Data exploratory queries chart displays the number of SELECT queries processed on demand each day. This visualization highlights query activity trends, such as spikes in usage on specific days, to help you understand when your data exploration efforts are most active. Use these insights to monitor query usage patterns, balance workloads, and optimize resource allocation for exploratory data analysis. This analysis ensures more efficient use of Query Service and improved planning for high-demand periods.

The Data exploratory queries chart.

Query Editor

Use the Query Editor to write and execute queries without using an external client. Select Create Query to open the Query Editor and create a new query. You can also access the Query Editor by selecting a query from the Log or Templates tabs. If you select a previously executed or saved query, the Query Editor opens and displays the SQL for your selected query.

The Queries dashboard with Create Query highlighted.

As you type in the Query Editor, the editor automatically completes SQL reserved words, tables, and field names within tables. When you have finished writing your query, select the play icon ( The play icon. ) to run the query. The Console tab below the editor shows what Query Service is currently doing, and indicates when a query has been returned. The Result tab, next to Console, displays the query results. See the Query Editor guide for more information on using the Query Editor.

The Query Editor workspace.