Set up and manage your Content AI Sources
This guide walks you through setting up Content AI Sources in Cloud Manager - from meeting prerequisites to creating a content source and confirming it is indexed and available.
Prerequisites prerequisites
Before you begin, ensure the following conditions are met:
- You have an active Cloud Manager program with at least one AEM as a Cloud Service environment.
- You hold the System Administrator role in Admin Console for the program.
- The environment product profile has been provisioned in Adobe Admin Console, see Set Up an Adobe Developer Console Project.
Step 1 - Open the Content AI Configuration Tab open-tab
-
Sign in to Cloud Manager and select your program.
-
From the Program Overview, locate the Environments section and select the environment you want to configure.
-
On the environment detail page, select the Content AI Configuration tab.
Step 2 - Create a Content AI Source create-source
A content source defines the website that Content AI crawls and indexes.
-
On the Content AI Configuration tab, select Create Source.
-
In the Create/Add new Content AI Source dialog, fill in the fields:
table 0-row-2 1-row-2 2-row-2 3-row-2 4-row-2 5-row-2 Field Description Content AI Configuration Name A unique identifier for this source (for example, my-site-index). Cannot be changed after creation.Description (Optional) A short description of the content source. Website address The root URL of the website to crawl (for example, https://www.example.com/).Exclude URLs (Optional) URL patterns to skip during crawling. Refresh frequency How often Content AI re-crawls the source: Weekly, Daily, Daily 4×, 60 Min, or 15 Min.
-
Select Create Source.
Step 3 - Trigger Acquisition trigger-acquisition
After the source is created, its status is New. Run an initial acquisition to start indexing.
-
In the source list, select the more actions (…) icon next to your source, then select Trigger acquisition.
-
In the Trigger Acquisition dialog, review the source details - Content source, Last run, and Next scheduled run - and select Trigger.
Step 4 - Monitor Indexing Status monitor-status
After acquisition starts, the source status updates in real time.
Wait for the status to reach Available before searching the index or testing the API.
Step 5 - Search Indexed Content search-content
Once the source status is Available, you can run search queries directly from Cloud Manager to verify that content has been indexed correctly.
-
In the source list, select Search next to your source.
-
Enter a query in the search field. Results show a list of matching items with a match score and content type (for example, PAGE or PDF). Selecting a result opens a preview on the right.
Modify or Delete a Source modify-source
To update a source configuration after it has been created:
-
In the source list, select the more actions (…) icon next to the source, then select Edit.
-
In the Modify Content AI Source dialog, update the Description, Website address, Exclude URLs, or Refresh frequency as needed. The Content AI Configuration Name is read-only and cannot be changed.
-
Select Save to apply the changes, or select Delete in the lower-left of the dialog to remove the source entirely.
note warning WARNING Deleting a source is permanent. All indexed content for that source is removed and can no longer serve search queries.
The source list updates to reflect your changes. If you deleted the source, it no longer appears in the list.
Next Steps next-steps
- Set Up an Adobe Developer Console Project - Create the ADC project and credentials you need to call the API.
- Content AI API reference - Query your indexed content using semantic, fulltext, or hybrid search endpoints.
Troubleshooting troubleshooting
- Source stays in Indexing for an extended period. Retry the acquisition from the (…) menu. If the status does not advance after a second run, verify that the Website address is publicly reachable and that the Exclude URLs patterns do not filter out every page.
- Source moves back to New after a run. The crawler could not fetch any pages from the configured root URL. Confirm the URL responds with
200 OKand that the site is not blocking automated requests. - Search returns no results for an Available source. Indexing succeeded, but no content matched the query. Try a broader query or check that the crawled URLs include the pages you expect.