Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Webmaster tools reporting googlebot access to wordpress login pages.

         

Sgt_Kickaxe

4:37 pm on Aug 5, 2011 (gmt 0)



In Google webmaster tools, under parameters in the Site Config section, Google is reporting a lot of 'redirect_to' parameters. Clicking on edit and then on show example urls confirms that googlebot is actively seeing my sites login pages.

I'm not worried that Google is actually getting into my admin panel, without being logged in that's not possible, but I don't want these pages on my link graph or as part of any metrics used to evaluate my site, they are strictly admin pages not designed for users.

This is what GWT reports...

Sample of recently crawled URLs
http://www.example.com/wp-login.php?redirect_to=


So how would I best go about removing these from Googles history and avoiding any more 'recent crawls' to them?

netmeg

6:40 pm on Aug 5, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Block them in robots.txt and then remove them with the URL removal tool?

I think I have everything with a question mark in the string blocked in robots.txt for WordPress, because I couldn't think of an instance where I'd want it in Google or Bing.

Sgt_Kickaxe

2:16 am on Aug 6, 2011 (gmt 0)



Google's been good about not including them in search results but I don't even want the pages included in my 'link graph' as existing, they don't exist for regular users. Is robots.txt enough to drop them right out of your link graph? You can add any page to robots.txt but that doesn't mean they don't aquire pagerank or get some from other pages. I have NO idea where Google would have found a link to follow to get to these pages...

tedster

3:49 am on Aug 6, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If Google now has the URLs, then they will never forget them. I think you just need to relax about the situation because it's now a done deal. Settle for keeping those URLs out of the index, because you cannot control things any further than that.

indyank

4:32 am on Aug 6, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If Google now has the URLs, then they will never forget them


Unless their server having that db crashes and they loose data.But who knows google might also have a back up of the whole internet :)

Tedster, that is precisely what I wanted to say. Yes, one should just be happy if google doesn't show them in their index.

leadegroot

9:23 am on Aug 7, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A quick search showed me there is an existing wordpress plugin to no index the login page.
Its called Noindex Login (I'm unaffiliated)
Don't know if it works on the current versions, but worth a shot.
Short of bot-detection and not displaying the login option for them, I think no indexing is probably the best bet :)