In Google Webmaster Tools, under crawling errors, my number one 404 error is something called
www.domain.com/a
There is no such thing in my site and there is no link to it. What is wrong, what does it mean? In the site log, there is no error about 404 error pointing to /a. W3C checker says no error. What can I do?
It's not you, it has to be the Googlebot. (none of this has to do with php;
perhaps somebody can remove the php tag.)I have eight sites in my Google Webmaster Tools and two totally unrelated sites are showing this same thing. Neither site showing this issue uses PHP.
The only thing I can think of is that Google's bot is incorrectly parsing the HTML and somehow appending closing "a" tags onto the root URL. Sounds stupid but it's the only thing that can explain multiple unrelated instances of this problem.Has anyone started a thread on this in the Google Forum yet?I'm not sure what effect this will have on the root problem but I put this in my htaccess file to hopefully clear out the false 404 errors. (Edit: this is the recommended solution by Google.)
EDIT:
Reported to Google Help Forum:
http://www.google.com/support/forum
EDIT 2:
Both sites validate at the W3C and neither use PHP.
EDIT 3:
The "/a" is coming from inside jQuery v1.5.2. This is the root cause.
Within jQuery itself is this line...
EDIT 4:
Block it with the robots.txt file. (According to Google, this will only change the Webmaster Tools Crawl Error from a "404" to a "Blocked by robots.txt")
EDIT 5:
The following is a response from Google employee JohnMu on this issue in the thread I started at Google Groups. Looks like I'm going to stick with the 301 after all.
Check your Apache error logs. Or install a small PHP file which saves the
$_SERVER["HTTP_REFERER"]
like:Then create a .htaccess file to redirect calls to /a to this PHP:
Alternatively you can save your PHP on the name
/a/index.php
, that way you do not need redirect.Do 301 redirection or otherwise do URL removal request form your web master tools account.
GWT--> Site configuration--> crawl access--> URL removal request