Welcome to WebmasterWorld Guest from 50.17.5.36

Message Too Old, No Replies

I register a domain. within a couple of days Google is visiting.

     
8:35 pm on Jan 22, 2011 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 27, 2004
posts:88
votes: 0


I register a domain... within a couple of days Google is visiting... I haven't told anybody... posted a link anywhere etc etc...

I know I can put up a robots file saying don't follow etc etc..

But once it is in Google... than all the million of other robots will be visiting.

This is a private domain for a very specific purpose..
nothing illegal.

But I wonder about this synergy with my domain registration and Google.

For example I have another domain... very private to me only... and the same thing happened. The problem with that domain is it is blog... and run off PHP... and as you all know.. the hackers continually are trying to exploit PHP exploits. I gather dozens of them weekly trying to hack...

I find it rather intrusive that when I register a domain...
Google is alerted...

Trust me when I am saying I am doing nothing remotely illegal... no way, no how.

There was nothing in the terms of service agreement when I register my domains about them submitting my registration.
My registration company is a rather large one.

Just because I register a domain... what gives anybody the permission to index it ?

Once again.. these are private domains... just to me personally.

edited: for below.

The blog is run off my own servers/domains not one of the mass blog sites on the net.

And... both domains are run off the same situation... that is my hosting company.

Maybe... my hosting company is parlaying information to Google... that when I create a domain they send the information to Goog... but I find that unlikely.. but maybe probable.. and still I haven't ever seen a terms of service for that.
9:12 pm on Jan 22, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member aristotle is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month

joined:Aug 4, 2008
posts:2684
votes: 95


There are some sites like quantcast that publish domain registration information. I think googlebot may find the link to your site on those sites.
9:29 pm on Jan 22, 2011 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 27, 2004
posts:88
votes: 0


Not affiliated with quantcast. Never heard of them... and I am pretty sure my company doesn't use them.. or never seen that they would have.

<snip>

I am not sure if I am allowed to post the name... so moderators edit this if necessary.

Thanks group. and mods.

(Mod note: Edited hosting company name. Thanks for your understanding, kahuna.)

[edited by: Robert_Charlton at 11:12 pm (utc) on Jan 22, 2011]

9:32 pm on Jan 22, 2011 (gmt 0)

Full Member

5+ Year Member

joined:Aug 16, 2010
posts:214
votes: 11


i think google has access to the .com zone file. As soon your domain has ns records google will find it.
9:45 pm on Jan 22, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member jimbeetle is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Oct 26, 2002
posts:3292
votes: 6


Not much you can do to hide from Google as it has been a registrar for a number of years and has access to domain data.
10:42 pm on Jan 22, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 5+ Year Member Top Contributors Of The Month

joined:Aug 29, 2006
posts:1312
votes: 0


Just because I register a domain... what gives anybody the permission to index it?

You give indexing permission by default (unless you take steps to prevent it).

This is not a Google-specific question - several other bots are likely to have turned up before GoogleBot, within hours (not days) of the domain going live.

It has nothing to do with your hosting company or registrar, the WHOIS information is in a public database - you can use privacy services to hide your personal details, but not the fact that the domain exists.

The lesson to learn is that you need to control bot access from the very start, usually with a combination of robots.txt or meta tags (for honourable bots) and .htaccess or other exclusion methods for the really creepy ones.

...
1:21 am on Jan 23, 2011 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 27, 2004
posts:88
votes: 0


Thanks once again group... I haven't been very active around here for years because I have been rather busy on other matters... case in point.......

My blog pertains to my last nine years of taking care of my alzheimers father... which is my rather personal and experiences dealing with many situations. Seeing so many attempts to hack in to the PHP side of my site(s) pisses me off. But this is not personal to me... I am sure most of you are aware of these "comment spammers" etc... the best a person can do is keep up with updates to php and their other scripts, so these malicious people can do no harm.

My thought is that if the domain had an ability to be private it should be... well I guess it "ain't" so.
Yes.. we can hide the "whois" info... but I am not worried about that. Because hiding that information is available.

Maybe this is in one of the categories where we don't want regulation to many parts of the internet... so "I" have to live with these paradoxes.

Before I set up the newest domain... I put in place scripts to catch all who enter.. well the log files are there but my scripts give me minute by minute "ease" of evaluation... and "G" was the first to make the visit.
And so I posted here...

I think to myself.. that the competition factor of indexing a site... before another search engine, gives that search engine a competitive edge. Without consideration to the owner of the site for this competitive motive.

I am probably in the smallest percentile possible... in that I don't want the site immediately index and placed in a search engine.

Considering all the work "we" try to get for placement.

I think that... well.. I shouldn't have even gotten a domain name... and that would have solved the problem in both of my situations. The blog I mentioned above... the domain registered this week.

The blog with it's domain I started years ago... and noticing the php exploit attempts... I changed the directory name so only I can see and work on it. But I still see the attempts almost daily to find exploits under the actual domain name.

The domain I set up this week... and with dropping in my snooping scripts... saw "G" find it in no time at all.

Thanks a bunch group :-)) you have helped me better understand what is going on... whether I like it or not...

I guess I can only sum it up to a famous quote by the revered writer to Surfing Magazine Don Redondo (water surfing.. you know with salt and waves)...
"Well cornflake... if you wanted to be in the water today by yourself... you should have stayed in bed"

Thanks again group and this website.
K.
3:20 am on Jan 23, 2011 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member themadscientist is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month

joined:Apr 14, 2008
posts:2910
votes: 62


I've had this happen too when I've registered some 'obscure' domain names and they weren't only .com either... One thing you can do for an 'easy login' page is use the FireFox user-agent switcher to create a unique user-agent for your browser, then you can stick a php switch in...

if($_SERVER['HTTP_USER_AGENT']!=='YourUserAgentHere') {
header('HTTP/1.1 403 Forbidden');
echo '
<html><head>
<title>403 Forbidden</title>
<meta name="robots" content="noindex,nofollow,noarchive">
</head>
<body>You Do NOT Have Permission to Access This Page</body>';
} else {
What you see goes here
}
10:08 am on Jan 23, 2011 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Nov 27, 2003
posts: 1642
votes: 0


I've seen Google hit newly registered domains within minutes more times than I can count - its just what they do. But - don't think that discovery crawl will be enough to be "placed in a search engine" - it takes a lot more work than just registering the domain to get traffic :)

If you really don't want your site being indexed, just put in a robots.txt entry to forbid all crawling. This won't keep out the spam crawlers, but it will keep the legit crawlers out 99.9% of the time :)
12:29 am on Jan 30, 2011 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 27, 2004
posts:88
votes: 0


Sorry to revive this topic... as all has been explained.

But today I got hit by a bot called copilot.thunderstone.com

I hope I am not violating terms of service here by mentioning there name, it is a bot.
It appears to be a device driven bot... that is from their website. Maybe there is a "software" version they are running.

As mentioned this is "private" domain... complete whois anonymous .
G comes back many times a day... and now Yawhooo has made it's appearance .

There is nothing illegal going on from my end... and I know I can do robots.txt file to exclude...

But it still causes me wonder how just registering a domain brings the "world" to know it is out there.

Especially this thunderstone bot... oh well... guess I need to go out and shovel more snow from the house in place of these trival matters.

Thanks again group.. and my apologies for reviving this thread.

K.
8:45 am on Jan 30, 2011 (gmt 0)

Moderator This Forum from US 

WebmasterWorld Administrator robert_charlton is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Nov 11, 2000
posts:11314
votes: 165


Why is Google indexing my entire web server?
http://www.webmasterworld.com/google/3396393.htm [webmasterworld.com]

Note... the Google "secret server" article is now offline, but the quoted references about public server logs and referrer logs should suffice.
2:14 pm on Jan 30, 2011 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member netmeg is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Mar 30, 2005
posts:12677
votes: 143


There are plenty of sites out there that release lists every day of new domains that are registered, and new domains that are dropped. And those sites are picked up by Google. You can't control it. You can't stop it. It is what it is. You can put a lock on the front door (by making it password protected) You can NOINDEX it. Robots.txt only works for well-behaved bots (and there are plenty who ignore it) And the domain name will *still* be on those lists, so the fact that it exists will still be out there, even people can't see the site behind the domain. Like I said, it is what it is.
7:51 pm on Jan 30, 2011 (gmt 0)

Junior Member

5+ Year Member

joined:July 24, 2010
posts:58
votes: 0


Did you sign up for Google apps? If yes, that's the reason. I have seen the Gbot hitting new domains when I signed up for G apps, almost instantly.