homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Gold Sponsor 2015!
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

How to remove junk pages from google index?

5+ Year Member

Msg#: 3052482 posted 6:13 am on Aug 19, 2006 (gmt 0)

I recently changed the structure of my site.

I changed directories and page names.

Google has many old pages indexed.

I want those junk pages to be removed.

What should i do?

I have already added those directories and pages in robots.txt file.

What else i can do?

Or should i just wait for another week or two?



WebmasterWorld Senior Member tedster us a WebmasterWorld Top Contributor of All Time 10+ Year Member

Msg#: 3052482 posted 4:13 pm on Aug 19, 2006 (gmt 0)

If the urls are now 404, and you've added them to robots.txt, those urls still may show as suppemental in site: search for many, many months -- and I would suggest letting Google handle them however they want to. Trying to force things can sometimes work against you.

If you are still seeing traffic coming from Google to those old urls, you might consider what they were looking for and actually serve them some relevant content. But I suggest not worrying about seeing a "perfect" site: search result.

And going forward from here, do remember that old saying -- Cool URI's Don't Change [w3.org]. So I'd say take some time right now to consider your new url naming scheme and if it can handle future growth and change for you without going through another upheaval.


5+ Year Member

Msg#: 3052482 posted 8:46 pm on Aug 20, 2006 (gmt 0)

I had the same problem ... I took over responsibility for a site that was a mess. I rue the day that I re-organised it to be more logical and structured!

In Sitemaps, Google was reporting 404 errors on 29 pages that didn't exist anymore. I was struggling to think why Google was trying to crawl those legacy pages, so I used their "URL Removal" tool.

I've got a few of our obsolete pages listed there. It says:

2006-02-03 01:54:07 GMT :
removal of ......html

But it doesn't stop Google trying to crawl those pages though!

Just to check that I haven't got any links lurking in our code, I downloaded the entire site to my PC and did a text search for those URLs ... nothing.

However, now our complete cache in Google has gone missing!

My conclusion is that although we have no cache showing on Google results, they do have an old cache lying in some murky corner which they are using to try to index our current site.


10+ Year Member

Msg#: 3052482 posted 8:52 pm on Aug 20, 2006 (gmt 0)


Global Options:
 top home search open messages active posts  

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved