Page is a not externally linkable
- Search Engines
-- Sitemaps, Meta Data, and robots.txt
---- Crawl Errors - How to manage them effectively?


shaunm - 6:59 am on Nov 16, 2012 (gmt 0)


@phranque
Thanks and sorry for delayed reply.

"you see?" or "google reports?"

This is what happens.
I am logged into my GWT account. I clicked on one of my websites profile and got to its dashboards.

When I go to 'health' -> 'Crawl errors', there are these tabs 'sever error' 'soft 404' 'Access denied' 'Not found' 'Not followed' 'Others'

I know about all other tabs except this 'Not followed' thing, so when I mouse over to that tab I am seeing around 35k URLs reported as 'Not followed'. As you know, Google only shows 1000 of them in that page.

There are 3 columns in that page: URL, Response Code, Detected.

All the URLs(1000) are appearing with a response code of 301.

Now when i click on each URL, another window opens with tabs such as 'Error details' 'In sitemaps' 'Linked from'. Above all, the complete URL.

It also shows a message as follows
There was a problem with active content or redirects. More info.
Google couldn't follow your URL because it redirected too many times.


But when I check personally, there isn't multiple redirects but only a single redirect.

Among those 1000 URLs reported as 301, there are URLs that doesn't exist in my site. I mean it looks more of a URL parameters.

If the actual URL is example.com/service/review.aspx, the reported URL is example.com/service/review.aspx?~99566565/page2/page3/page4=

When I copy it and paste it in any web browser, it redirects 2 times and finally land in a clean short URL. At the same time when I do 'fetch as google' it gets a 'Success' status. How come Google reports a URL as 301, at the same time when I fetch as Google it shows 'Success'?!? But the same URL is redirect to another URL when I copied it and paste in a Web browser?

Thanks for helping me out.


Thread source:: http://www.webmasterworld.com/robots_txt/4517482.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com