Common Logs

Description description

Experience Manager

Document will describe common log entries you’ll see and what they mean and how to deal with them.

Table of contents

Resolution resolution

GLOB warning

Sample Log Entry:

`[` Fri Jul 20 03:35:09 2018`]`  `[` W`]`  `[` pid 8300 (tid 139937910880384)`]`  /etc/httpd/conf/publish-filters.any:5: Allowing requests with globs is considered unsafe. Please consult the documentation at '' on how to use attributes method/url/query/protocol/path/selectors/extension/suffix instead.

Since dispatcher model 4.2.x, they started discouraging people from using the following type of matches in their filters file:

/0041 { /type "allow" /glob "* *.css *"   }  ## enable css

Instead, it’s better to use the new syntax like:

/0041 { /type "allow" /url "*.css" } ## enable css

Or even better to not match on a wildcard matcher at all:

/0041 { /type "allow" /extension "css" } ## enable css

Doing either of the suggested methods would get rid of that error message from the logs.

Filter Rejects


These entries don’t always show up even if rejections are happening if the log level is set too low. Set it to Info or debug to ensure you can see if the filters are rejecting the requests.

Sample Log entry:

`[` Fri Jul 20 17:25:48 2018`]`  `[` D`]`  `[` pid 25939 (tid 139937517123328)`]`  Filter rejects: GET /libs/granite/core/content/login.html HTTP/1.1


`[` Fri Jul 20 22:16:55 2018`]`  `[` I`]`  `[` pid 128803`]`  "GET /system/console/" ! - 8ms `[` publishfarm/-`]`


Understand that the dispatcher’s rules were set to filter out that request. In this case, the page attempted to be visited was rejected on purpose, and we wouldn’t want to do anything with this.

If your log looks like this entry:

`[` Fri Jul 20 17:26:47 2018`]`  `[` D`]`  `[` pid 20051 (tid 139937517123328)`]`  Filter rejects: GET /etc/designs/exampleco/fonts/montserrat-regular/montserrat-regular-webfont.eot HTTP/1.1

That’s letting us know that our design file .eot is being blocked, and we will want to fix that.

So we should look at our filters file and add the following line to allow .eot files through

/0011 { /type "allow" /method "GET" /extension 'eot' /path "/etc/designs/*" }

This would allow the file through and stop this from logging.

If you want to see what’s being filtered out you can run this command on your log file:

$ grep "Filter rejects\|\!" /var/log/httpd/dispatcher.log | awk 'match($0, /\/.*\//, m){ print m`[` 0`]`  }' | awk '{ print $1 }'| sort | uniq -c | sort -rn

Timeouts from render

Socket timeout sample log entry:

`[` Fri Jul 20 22:31:15 2018`]`  `[` W`]`  `[` pid 3648`]`  Unable to connect socket to Connection timed out
`[` Fri Jul 20 22:31:15 2018`]`  `[` W`]`  `[` pid 3648`]`  Unable to connect to any backend in farm authorfarm

This occurs when you have the wrong IP address configured in the renders section of your farm. That or the AEM instance stopped responding or listening, and the dispatcher can’t reach it.

Check your firewall rules and that the AEM instance is running and healthy.

Gateway timeout sample log entries:

`[` Fri Jul 20 22:32:42 2018`]`  `[` I`]`  `[` pid 3648`]`  "GET /favicon.ico" 502 - 54034ms `[` authorfarm/-`]`
`[` Fri Jul 20 22:35:45 2018`]`  `[` I`]`  `[` pid 3648`]`  "GET /favicon.ico" 503 - 54234ms `[` authorfarm/-`]`

This means the AEM instance had an open socket it could reach and timed out with the response. This means your AEM instance was too slow or unhealthy, and the dispatcher reached its configured timeout settings in the render section of the farm. Either increase the timeout setting or get your AEM instance healthy.

Caching level

Sample Log entry:

`[` Fri Jul 20 23:00:19 2018`]`  `[` I`]`  `[` pid 16004 (tid 140134145820416)`]`  Current cache hit ratio: 87.94 %

This means your fetching from render level vs. from cache is measured. You want to hit 80+ percent from the cache, and you should follow the help here:

To get this number as high as possible.


Even if you have your cache settings in the farm file to cache everything, you might be flushing too often or too aggressively, which can cause a lower percentage cache hit ratio to occur

Missing Directory

Sample log entry:

`[` Fri Jul 20 14:02:43 2018`]`  `[` E`]`  `[` pid 4728 (tid 140662586435328)`]`  Unable to create parent directory /mnt/var/www/author/libs/dam/content/asseteditors/formitems.overlay.infinity.json/application: Not a directory

This will typically show up when an item is being fetched while a cache clear is occurring at the same time.

That or the document root directory has bad permissions on it or the wrong SELinux file context on the folder that is denying apache from creating the new needed sub-directories.

For permission issues, look at the permissions of the document root and make sure they look similar to the following:

dispatcher$ ls -la

Vanity URL not found

Sample log entry:

`[` Thu Sep 27 17:35:11 2018`]`  `[` D`]`  `[` pid 18936`]`  Checking vanity URLs
`[` Thu Sep 27 17:35:11 2018`]`  `[` D`]`  `[` pid 18936`]`  Vanity URL file (/tmp/vanity_urls) not found, fetching...
`[` Thu Sep 27 17:35:11 2018`]`  `[` W`]`  `[` pid 18936`]`  Unable to fetch vanity URLs from remote server returned: HTTP/1.1 404 Not Found

This error occurs when you’ve configured your dispatcher to use the dynamic auto-filter to allow vanity URLs but have not finished the setup by installing the package on the AEM renderer.

To fix this, please install the vanity URL feature pack on the AEM instance and allow it to be ready by the anonymous user. Details here

A working vanity URL setup looks like this:

`[` Thu Sep 27 17:40:29 2018`]`  `[` D`]`  `[` pid 21844`]`  Checking vanity URLs
`[` Thu Sep 27 17:40:29 2018`]`  `[` D`]`  `[` pid 21844`]`  Vanity URL file (/tmp/vanity_urls) not found, fetching...
`[` Thu Sep 27 17:40:29 2018`]`  `[` D`]`  `[` pid 21844`]`  Loaded 18 vanity URLs from file /tmp/vanity_urls

Missing Farm

Sample log entry:

`[` Wed Nov 13 17:17:26 2019`]`  `[` W`]`  `[` pid 19173:tid 140542738364160`]`  No farm matches host '', selected last farm 'publishfarm'

This error indicates that from all of the farm files available in /etc/httpd/conf.dispatcher.d/enabled_farms/, they weren’t able to find a matching entry from the /virtualhost section.

The farm files match traffic based on the domain name or path with which the request came in with. It uses glob matching, and if it doesn’t match, then you’ve either not configured your farm properly, typed the entry into the farm, or have the entry missing entirely. When the farm doesn’t match any entries, it defaults to the last farm included in the stack of farm files. In this example, it was 999_ams_publish_farm.any, which is named the generic name of publishfarm.

Here is an example farm file /etc/httpd/conf.dispatcher.d/enabled_farms/300_weretail_publish_farm.any that’s been reduced to highlight the relevant parts.

Item Served from

Sample log entry:

`[` Tue Nov 26 16:41:34 2019`]`  `[` I`]`  `[` pid 9208 (tid 140112092391168)`]`  "GET /content/we-retail/us/en.html" - + 24034ms `[` publishfarm/0`]`

The page was fetched via the GET http method for the content /content/we-retail/us/en.html, and it took 24034 milliseconds. The part we wanted to pay attention to is at the very end [ publishfarm/0] . You’ll see that it targeted and matched the publishfarm. The request was fetched from render 0, which means that this page had to be requested from AEM and then cached. Now, let’s request this page again and see what happens to the log