Paths in JS variables affecting SEO
Google's Webmaster tool run against xfm.co.uk recently returned 14,000+ 404 errors on the domain.
90%% of these were puzzling URLs - half with a subdirectory that doesn't exist on the site and half with paths to the (third party) ad server apended to the site domain.
It appears that Googlebots are indexing paths that it finds as javascript variables in the host document.
Adverts and Tracking to the site are dealt with by two third party sites and copius amounts of JS.
Omniture's SiteCatalyst (formerly Websidestory's HBX) script includes a "category" variable in the HTTP request to their server with multiple levels of categories separated by back-slashes. For e.g.
//MULTI-LEVEL CONTENT CATEGORY hbx.mlc="/XFM/Yo+Want+Rock+You+Got+It++The+Darkness+Cometh";
Googlebots are then trying to retrieve the URL:
http://www.xfm.co.uk/XFM/Yo+Want+Rock+You+Got+It++The+Darkness+Cometh
which returns a 404 error and, presumably, affects the search ranking of the site. This mlc value is unique to every page in the site - which, presumably, menas that for every valid page in the google db, there's a corresponding 404 error :(
Likewise with the ad code which likes to generate script includes client side by using document.write (*sigh*) In and amongst the javascript is the part of the path that's common to all the script includes on the page:
var sitearea = "/SITE=xfm/AREA=Home/GUID=5911643/pageid=5911643/LOGIN=0/CODE1=/CODE2=/CODE3=/";
Resulting in more bad URLs:
http://www.xfm.co.uk/SITE=xfm/AREA=Home/GUID=5911643/pageid=5911643/LOGIN=0/CODE1=/CODE2=/CODE3=/
Given that the GUID value is unique to not just every page but every http request that's even more 404 errors being indexed by google :(
Latest Posts
iPhone App: Flick Cricket
10:15a.m., 4 Sep
Cricket seems like a ridiculous sport to me. I like hitting the ball - hitting a ball as hard as ...Javascript from the wonderful world of online advertising
8:23p.m., 1 Sep
I had a right old rant about the quality of javascript that you find being served by major providers of ...Web performance - Analysing Heart.co.uk
5:17p.m., 26 Aug
A few weeks back I, together with m'colleague Jon Topper, delivered a presentation about the performance of heart.co.uk to the ...Installing MySQL and MySQLdb on OSX
10:19a.m., 19 Aug
I've just set up my dev environment for Django. Again. This is the fourth time I've had to do it. ...