Google crawl errors for non-existant URLs
-
I have 53 x 404 crawl errors for URLs that do not exist on my site.
I am guessing crawl errors arrise from the site map sent to Google by WP, so why are there URLs in my site map that do not exist on my site?
I have forwarded the list to support, but wonder if anyone else is having the same problem. A search of the forums did not show up anything related recently.
I know that Google does not like 404 errors and penalises sited for such, so I would like this to be rectified ASAP.
Together with my other current issue of “the tag” not being on all my pages for Quantcast to find (https://en.forums.wordpress.com/topic/tag-not-implemented-on-every-page-for-quantcast-to-read?replies=8), I am feeling a little too much like an unloved lab rat at the moment.
The blog I need help with is: (visible only to logged in users)
-
I do wonder if this is related to the introduction of the Country Statistics, as when I look at my list on the Stats page, I get a lot of “Permission denied” errors at the bottom of the list, which also makes the map scroll down out of view.
-
This is getting worse the more stats are collected and now I cannot even get to see my map as the “Permission denied” list grows and scrolls the map out of view.
Can anyone else confirm they are having this problem?
This is the Summary by Country on the new Home Page stats, not our Admin page stats.
I am still awating a response from Support, but if others are experiencing the same problem, perhaps the priority may raise a little!
-
when I look at my list on the Stats page, I get a lot of “Permission denied” errors at the bottom of the list, which also makes the map scroll down out of view. … This is the Summary by Country on the new Home Page stats, not our Admin page stats.
I don’t experience any “permissions denied” at all. I’m using Firefox 11
-
Hi TimeThief – this is you and not the “other” one? LOL
Yes, Chrome seems OK too, but the fact those permission denieds display at all links suspiciously (to me, being in IT) to the non existant URLs creating my Google crawl errors. There were all of this type:
http://teamoyeniyi.com/wiki/Serbia_and_Montenegro
http://teamoyeniyi.com/wiki/New_ZealandNow, those look suspicially linked to the new country stats…….
-
-
I checked my crawl errors at google WT also after reading this:
I have 108 errors on my blog, of URL that don’t exist.
example:
h t t p: // echopen.wordpress.com/category/zen/zazen/page/3/I also have 49 errors on my blog attached to my profile.
Hopefully someone has an explanation?
or is there away to submit a new google map if necessary? -
Google are also crawling less pages per day on my site – don’t know about yours.
That won’t help page rank much.
-
I have this problem with crawl errors too, and URL’s that don’t exist, but also on the Google Webmaster Tools page everything is in another language. Either German or Dutch for some odd reason then it will suddenly go to English. Has Google lost it’s mind? All that seems to crawl my site from Google is Google Web Preview (not Google Bot) and it uses different IP’s from different locations (eg. London and America), making it seem like a ‘real’ person, but the browser is always Google Web Preview. Does anyone know what that is all about?
-
Has your site been properly authenticated with Google Webmaster Tools? I assume it has, because you have access to the Tools page for your site.
http://en.support.wordpress.com/webmaster-tools/
Sometimes I see chinese characters flash up, but always mine is in English.
This worries me because of the possible impact on site reputation with the search engines.
-
Yes it’s been authenticated. It was showing my entire blog as a 404 earlier but Google Bot has since crawled 5 times tonight and ‘ismyblogworking’ doesn’t detect any problems I don’t know what’s going on but something is definitely odd with Google at the moment, and the ‘foreign’ language that then switches to English in WebMaster tools is a mystery!
-
@teamoyeniyi
@dlager
OMG! I’m afraid to even look at my Google Webmasters Account crawl errors now. :( -
Sorry I made the leap on cause without being clear earlier TimeThief. Just the links and the timing were connected. Being qualified in the area of systems development, I see those things.
-
Did either of your sites have infinite scroll on them at any point?
I have been checking my crawl errors three or four times a week since I moved onto WordPress.COM 18 months ago – most of the crawl errors I see so far make sense (Google refuses to drop a test blog I had on the site and some other problems) – the number of pages etc. that are crawled bounce around but the bounce has been consistent.
Please don’t get me started on the over 1,000 crawl errors I have had to fight over the last 18 months – if WordPress.COM allowed us to exclude old sub directories things would have been easier.
-
No Auxclass. Thankfully, Traction does not have infinite scrolling.
My crawl errors started with the introduction of the country stats and all of them have the /wiki/country_name after my domain name. Seems a little too co-incidental to me!
I checked last night and I still have the same number of crawl errors, although the number of countries I am read from has grown. So it seems as if MAYBE they spotted the problem and fixed it going forward, but the crawl errors generated during the initial deployment are still active.
-
No I have not had infinite scroll.
I did remove a couple tags that were duplicate names of categories which would certainly cause the errors.
But that was months ago, should have only been a few errors, and it does not include the link I gave above.I just hope staff is aware of it.
I may contact support next week, if there is no answer in this thread. -
I already contact Support, the day I started this thread. So far, no response. I think they must have quite a backlog, with everything that is going on.
Actually, the Quantcast tag issue I raised (link in first post) is actually a bigger problem, I believe, as far as measuring traffic stats is concerned.
-
- The topic ‘Google crawl errors for non-existant URLs’ is closed to new replies.