homepage Welcome to WebmasterWorld Guest from 54.166.113.249
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
Google crawling Internet chat?
punta

10+ Year Member



 
Msg#: 18094 posted 11:43 am on Nov 5, 2003 (gmt 0)

A number of people on the web have reported seing a google bot of sorts on their Internet Relay Chat servers, joining and parting channels.

Is seems that google might be archiving the contents of public channels with the aim of making them searchable, in the same way that you can search newsgroup posts via Google groups.

 

dcheney

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 18094 posted 12:29 pm on Nov 5, 2003 (gmt 0)

I'm a regular op on a couple of EFnet channels - haven't seen or heard of anything like that.

punta

10+ Year Member



 
Msg#: 18094 posted 12:34 pm on Nov 5, 2003 (gmt 0)

It seems that they're testing things out on smaller servers before launching it on a larger scale. If they have bugs in the software, it's better to cause a nuisance on a small network than something as large as EFnet.

There's more information here:

[manero.org...]

abates

10+ Year Member



 
Msg#: 18094 posted 7:59 pm on Nov 5, 2003 (gmt 0)

A chat room I hang out in had a similar problem a while back with a site which sent a bot in and was echoing everything said to a web page without telling any of us. Once it was rumbled, it started morphing nicks to evade our bans... (wasn't Googlebot, I hasten to add)

I hope Google isn't going to start archiving IRC chat sessions, personally...

punta

10+ Year Member



 
Msg#: 18094 posted 9:24 am on Nov 6, 2003 (gmt 0)

Google wouldn't try and evade any bans. It'd always stick to the same IP range. It follows robots.txt directives in web pages faithfully and also the no archive header in USENET, there's no reason to think that it wouldn't obey bans politely.

Does anyone think that IRC chat logs are useful sources of Information? I know sometimes interesting things are said, I've seen computer problems solved quite frequently in IRC chats, but a vast majority of conversations are just babble, not to mention the l33t speak!

abates

10+ Year Member



 
Msg#: 18094 posted 9:58 am on Nov 6, 2003 (gmt 0)

I've seen some interesting IRC chats with celebrities... but they've all been put on the web anyway. :)

punta

10+ Year Member



 
Msg#: 18094 posted 10:11 am on Nov 6, 2003 (gmt 0)

There's an interesting article here [dir.salon.com] about a start-up a few years ago that tried to archive chat rooms, albeit temporarily. They failed miserably, but maybe a company of Google's stature will be able to pull it off. There's certainly some interesting ideas in that article.

abates

10+ Year Member



 
Msg#: 18094 posted 11:46 pm on Nov 6, 2003 (gmt 0)

Punta: that was the site we had trouble with. We weren't well pleased.

pleeker

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 18094 posted 12:47 am on Nov 7, 2003 (gmt 0)

Does anyone think that IRC chat logs are useful sources of Information? I know sometimes interesting things are said, I've seen computer problems solved quite frequently in IRC chats, but a vast majority of conversations are just babble, not to mention the l33t speak!

I'd think IRC chats are at about the same level as guestbooks in terms of sources of valuable information. Maybe that's just me....

punta

10+ Year Member



 
Msg#: 18094 posted 11:11 am on Nov 7, 2003 (gmt 0)

Maybe they won't be logging chats. Maybe they're doing something else. Has anyone got any ideas of what might be up google's sleeve?

Perhaps they're looking for links mentioned in IRC?

BlueSky

10+ Year Member



 
Msg#: 18094 posted 11:57 am on Nov 7, 2003 (gmt 0)

I think it depends on the topic. Many IRC channels/chats in certain areas (tech oriented, celeb interviews, etc) have very low noise to signal ratio. They are far more info rich than quite a number of websites out there. However, I think people would get very mad if their conversations are indexed and placed in a public SE without their knowledge. This would lead to many requesting such pages be removed.

Perhaps this is something they may be considering to offer as a commercial service? I could see a large company wanting this type of info captured and indexed say for their tech support, exec chats, etc.

BlueSky

10+ Year Member



 
Msg#: 18094 posted 7:01 pm on Nov 7, 2003 (gmt 0)

Coincidence or do these guys hang around here?
[theregister.co.uk...]

Maybe Google is doing this as a project for the US government. I wouldn't put anything past Bush and Ashcroft.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved