Having correct data to base marketing decisions on is essential, and Google Analytics provides that data. Unfortunately there are many who market dishonestly which hurts your analytics data in the form of referrer spam. Essentially referrer spam are bots that are sent to your website. These bots have different purposes, but the result is the same. You’ll have incorrect data. These bots will:
- increase your bounce rate, how many users come to your site and leave almost immediately
- overall sessions
- decrease your goal conversion rate
- and more.
To combat spam, there are two simple filters you can put in place. These are a crawler spam filter and a ghost spam filter. We’ll go over how to implement each of these filters respectively.
Crawler Spam Filter
- Go to your Admin tab
- Under View Column Select Filters
- Click on New Filter
- Enter Crawler Spam Filter as name
- Configure the filter:
- Filter Type: Custom
- Filter Field: find and select Campaign Source
- Filter Pattern text box: Enter the Crawler Spam expression
- Hit Save
- Repeat for each Crawler Spam Expression creating a name such as “Crawler Spam Filter 2” and so on
Crawler Spam Expressions:
Please click on the following link which should take you to the section that shows the various crawler expression. As of July 6, 2018 there are 5 expressions listed on their website.
Ghost Spam Filter
If the client has an established Analytics:
- Go to the Reporting section in Analytics and set time-frame calendar to 1 year (if it’s the property has less than a year select everything).
- On the left-hand sidebar select Audience.
- Expand Technology and select Network.
- At the top of the report make sure you select Hostname (by default Service Provider is selected)
- You should have at least one valid hostname, which is your main Domain. The rest will depend on the configuration of your site and all the services where you added your tracking code ->UA-XXXXXX-1.
- Make a list of your valid hostnames. If you want to exclude any non-spam hostnames from your reports DON’T add them to the list.
Note: Often spammers will use known services such as amazon.com as a hostname to trick you. If you don’t have any of your services or GA code on a site it shouldn’t be a hostname.
- Create an Expression (REGEX) that matches all of them. Understand REGEX.
It may look something like this: sebomarketing|translatingservice|webcaceservice
It is crucial that you add all the relevant hostnames. Otherwise, you may lose out on valid data.
- Go to the Admin tab and select the View where you want to apply your filters
- Select Filters in the last column “View”.
- Select New Filter
- Select Create New Filter and enter Include Valid hostnames as a name.
- In Filter Type select Custom
- Make sure to choose INCLUDE and select Hostname from the drop-down.
- Paste, the Hostname REGEX that you build previously in the Filter Pattern field.
- Select Verify this filter. Make sure you are not filtering out any valid hostnames and save.
If you are setting up a new client without Analytics:
- Skip going to the Network Hostname section
- Do everything the same when setting up the Valid Hostname Filter
- In the Filter Pattern field add a very simple expression that contains the domain without the “www” or “.com” and googleusercontent. If you were doing it for sebomarketing.com it would look like this: sebomarketing|googleusercontent
- Set a calendar event for a couple months in the future to look at the non-filtered view and add hostnames that are applicable to the valid hostname filter in the filtered view.