homepage Welcome to WebmasterWorld Guest from 54.227.89.236
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Visit PubCon.com
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Can I drop multiple URLs from Google's index using a wildcard?
tkchinese




msg:4581565
 6:28 am on Jun 6, 2013 (gmt 0)

I made a mistake with one of my website and a whole bunch of data got picked up by Google before I caught it.

I need to fix this relatively fast by asking Google to drop the index pages.

The URL would be something like:

domain.com/index.html?action=Abcd

Does anyone know if I can submit:

domain.com/index.html?action=*

as a single request through webmaster tools? If not, what are my other alternatives?

 

lucy24




msg:4581593
 8:31 am on Jun 6, 2013 (gmt 0)

What exactly do you need to get rid of?

page name
parameter name
some value of parameter

?

tkchinese




msg:4581609
 9:45 am on Jun 6, 2013 (gmt 0)

I want to deindex all parameter urls which are more than 1 million

I want to do very carefully which doesn't impact on site home page domain.com/index.html

only kill parameter urls through webmasters tool?

jakebohall




msg:4581635
 12:59 pm on Jun 6, 2013 (gmt 0)

Have you read over this page? [support.google.com...]

lucy24




msg:4581852
 6:41 pm on Jun 6, 2013 (gmt 0)

ALL parameters? Go to the "parameters" section of wmt and tell google to ignore the parameters. They will probably already be listed.

Unless those same parameters do have meaning on other pages. Then you have to proceed more carefully.

seoskunk




msg:4581855
 6:52 pm on Jun 6, 2013 (gmt 0)

You can indeed use a wild card in robots.txt something like you describe could be tested in wmt. But I would suggest to drop all parameters...

user-agent: *
disallow: /*?

tkchinese




msg:4581947
 3:53 am on Jun 7, 2013 (gmt 0)

Thanks Lucy & Jakebohall I have already set (No URLs) in URL parameters settings.

Now my question is:
(No URL) setting means Google won't crawl these urls anymore but will it deindex also? or I have to use url removal tool to drop these urls from google?

If i have to remove from url removal tool what url should I provide and will it impact my home page as well?

domain.com/index.html?action=*

lucy24




msg:4581961
 5:27 am on Jun 7, 2013 (gmt 0)

(No URL) setting means Google won't crawl these urls anymore but will it deindex also?

Other way around. It may still crawl them if it finds them-- but results will be indexed together with the parameterless version.

McMohan




msg:4581964
 5:56 am on Jun 7, 2013 (gmt 0)

Go to the "parameters" section of wmt and tell google to ignore the parameters

From my experience, using Parameters handling at WMT isn't much of use if you want to ensure Google doesn't count those URLs that carry specific parameters. It is just a directive, and Google may choose to act on its own volition. More often that not, I have seen Google neglecting the directive given.

You can indeed use a wild card in robots.txt something like you describe could be tested in wmt

It will only help in avoiding new URLs getting indexed, but the ones already in the index, will continue to be in the index.

tkchinese, as far as I know, finding a way to add robots noindex tag is the only workable solution, if that can be managed. Else, try your luck with parameters.

lucy24




msg:4581981
 7:41 am on Jun 7, 2013 (gmt 0)

It will only help in avoiding new URLs getting indexed

Not even that: It will only help in avoiding new URLs getting crawled. It took me at least a year to wrap my brain around this fact, so I'm not letting go of it.

McMohan




msg:4582009
 10:23 am on Jun 7, 2013 (gmt 0)

It will only help in avoiding new URLs getting crawled


:-) I will give it to you.

tkchinese




msg:4582011
 10:31 am on Jun 7, 2013 (gmt 0)

Friends I'm going to apply NoIndex...I didn't find any other solution.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved