homepage Welcome to WebmasterWorld Guest from 54.167.96.124
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Site crawled without submitting to Google, no links, nothing.
darshanaasodekar




msg:3540948
 8:17 am on Jan 5, 2008 (gmt 0)

How is it possible that without indexing a site it is crawled by Google? My site is not submitted to any of the search engines. This site is not having any incoming links, still it i crawled by Google and hence cached...
Why does this happen?
What should I do now, if I don't want Google to crawl this site again?

 

Robert Charlton




msg:3540990
 11:19 am on Jan 5, 2008 (gmt 0)

Hi darshanaasodekar - This thread might be of help...

Why is Google indexing my entire web server?
google indexing
[webmasterworld.com...]

The consensus is that you need to block your site on the server if it's under development or if you don't want it indexed. Lack of inbound links is not enough, as publicly available server logs are likely to get spidered by Googlebot. Use password protection or the no-index robots meta tag on pages you want to block.

ecmedia




msg:3541053
 2:43 pm on Jan 5, 2008 (gmt 0)

"This site is not having any incoming links" is not always possible to say with certainty. Maybe you didn't put any links but others might without your knowledge and it is not always possible to find these links through a search engine index. The biggest mistake you committed is not used the robots.txt file to block Google. No damage done, though. You can still tell G to go away and eventually it will drop all pages.

dan404




msg:3541144
 5:39 pm on Jan 5, 2008 (gmt 0)

Do you have the Google Tool Bar installed showing you page rank?
If so that's one way to surely get Googlebot's attention.
You need to use your robots.txt to block bots for sure, or throw a redirect to an error page until you are ready to open for business. I wouldn't recommend the error page though unless you want to throw Google for a loop, it may never come back!

To block all (good) bots just make a robots.txt file and enter:

User-agent: *
Disallow: /

drewls




msg:3541317
 9:48 pm on Jan 5, 2008 (gmt 0)

Didn't Google become a registrar a while back? This would give them access to lists of new domain registrations, so they could go out and check them out regardless of links or submissions.

darshanaasodekar




msg:3541890
 5:58 am on Jan 7, 2008 (gmt 0)

Thanks all. Thanks for robots.txt. I have included meta tags in my site for no follow and have done the needful. Thanks for the help...:)

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved