Msg#: 3607975 posted 1:18 pm on Mar 22, 2008 (gmt 0)
I have a well-indexed forum, but have noticed that G is picking up all the links to "report this thread", which are useless and also lead to some 404 errors.
I want to block all urls that start /foros/?func=report
But allow any other pages that are under /foros/
If I get this wrong, my forums will drop out of G's index, so I need to make sure I do this carefully! Could anyone advise me on the correct disallow statement that would SPECIFICALLY block the "func=report" urls ONLY?
Msg#: 3607975 posted 5:50 pm on May 12, 2008 (gmt 0)
one more solution that you can have that to redirect the 404 urls which is in your point of view are unnecessary urls to your favorite urls. Because stoping 404 error by using robots.txt file is not a good idea.
I advice you to please use 301 redirect on all 404 errors urls..
Msg#: 3607975 posted 1:00 pm on May 13, 2008 (gmt 0)
To block googlebot from accessing those pages add this line to the robots.txt file: Disallow: /*func=report$
bilalseo does make a good point. it is better to identify 404 errors and if possible 301 redirect them to the new file. if there is no new file location and the information does not exist on your site you should serve a 404 error page with helpful links for the users. this will help make it clear to the users what is going on and what options are available to them.
Msg#: 3607975 posted 5:43 pm on May 16, 2008 (gmt 0)
Goodroi I want to ask one thing, that if the 404 pages remains cached and got maturity then what is the right method to remove them from cache and what steps should I take to overcome. I read before in google webmaster area about removal or permanent removal of directory, pages and urls from the google. But that wasn't entertained me as I could be.
Msg#: 3607975 posted 5:59 pm on May 16, 2008 (gmt 0)
Looking at the OP, the request was to block "urls that start /foros/?func=report" so I assume we need to drop the $ at least. And doesn't Google allow question marks in disallow? So, the simplest would be: