Difference between Country and geo_country column in Data Feed
This article explains in detail the difference between the columns “Country” and “geo_country” in the Customer Journey Analytics data feed with an example.
Description description
Environment
- Customer Journey Analytics
- Analytics
Issue/Symptoms
Below is Sample Raw Data for such event:
Resolution resolution
The country column is the top-level domain. At the beginning of the internet, the top-level domain and the country were basically the same things, which is probably why the column is called country. There was no IP to geo info mappings at that time. If the country column in the data were to be named now, it would be called the top-level domain (or something like that, but not country).
With IP to geo info mappings that are used now, the IP of the hit is used to determine the geo_country
, geo_region
, geo_city
, etc., as well as the domain. The top-level domain (the country column) is determined from the domain. The domain, however, has become less significant in the IP geo info mappings.
In fact, the domain is no longer required to be provided when IPs are purchased/allocated. It is up to the IP block owner to determine if they provide domain info and, if so, how detailed/accurate information they provide. With many international companies now, even when domain information is provided, they will often provide the same domain for all IP blocks used worldwide (which means the top-level domain won’t match the geo_country
or the country where the IP is actually used).
So, in Adobe Analytics, domains such as au-net.ne.jp
, ocn.ne.jp
, spmode.ne.jp
, transix.jp
, etc. (those where the domain ends in .jp
) will show as Japan in the country column. While domains such as bbtec.net
, jptransit.net
, ntt.net
, etc. (those where the domain ends in .net
) will show as *304 (Network)*in the country column, those domains such as aetosjapan.com
, zscaler.com
, etc. (those where the domain ends in .com
) will show as *300 (Commercial)*in the country column, etc.
As the domain has become less significant in IP mappings, and as domains with .net
, .com
, etc. have become much more common outside of the US, using the top-level domain (country column) is no longer as valuable for determining actual country information for the origins of the hits for a visitor. The geo_country
column is much more accurate for determining the country of origin of the hits for a visitor.
One additional thing to remember, although this only applied to one of the hits with geo_country = "jpn"
in a particular instance, it can be significant, especially if traffic is from mobile devices and other situations that may change IPs mid-visit. In Adobe Analytics, to provide consistent reporting throughout a visit, for certain technology, etc. fields (country
, domain
, geo_country
, geo_city
, browser
, os
, and others), the value from the first hit of a visit is used for every hit in a visit. This can sometimes cause issues in debugging when looking only at the IP for a hit and wondering why in Analytics, a different domain, geo_city
, etc., is used than the IP would resolve to. The values from the first hit of a visit are used for all hits in the visit.