homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Visit PubCon.com
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

Restricting Ports and Goolge
Will TCP/IP Filtering Affect Goolge Bot?

 5:54 pm on Mar 2, 2004 (gmt 0)

My server was compromised via SQL ports. Almost 4 terabytes stolen in a very short period.

I am now using TCP/IP filtering on all my I.P.s. I am only permitting TCP Ports: 80 & 3389.

What port does Google use and will filtering I.P.s affect my crawls.



 8:21 pm on Mar 2, 2004 (gmt 0)

"Goolge" Yes, I know typing and spelling are not my strong points!


 8:45 pm on Mar 2, 2004 (gmt 0)

Googlebot would use the default port for the protocol. In this case that would be port 80 for HTTP.
It wouldn't have any reason to access using another port unless that port was possibly part of a link it might try and follow. I'm not certain that googlebot as a user agent is even to allowed to access ports other than 80 and 443 (for https). In other words, Googlebot isn't doing a port scan, it's following links.


 3:33 pm on Mar 3, 2004 (gmt 0)

This does bring up an interesting point though.

I have administrative pages that allow a user to view stats on their site. All stats from my stat server are ported through port 8080. (ie []

Will a bot follow these? I have all ports blocked except for the ones I need, and one of them is 8080.

Unfortunately, my stat server keeps no logs on itself so there is no way to determine whether the bot is actually penatrating to the program.

Any thoughts?

Just wondering.


 3:40 pm on Mar 3, 2004 (gmt 0)

I'm pretty sure Googlebot can access ports other than the defaults for HTTP and HTTPS. Like nanocet said, it follows links. When ports other than the default are used, Google usually shows them on the SERPs.


 3:44 pm on Mar 3, 2004 (gmt 0)


That is frightening. I guess I will get busy with th robots.txt.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved