homepage Welcome to WebmasterWorld Guest from 54.205.254.108
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
Google crawling Internet chat?
punta




msg:200084
 11:43 am on Nov 5, 2003 (gmt 0)

A number of people on the web have reported seing a google bot of sorts on their Internet Relay Chat servers, joining and parting channels.

Is seems that google might be archiving the contents of public channels with the aim of making them searchable, in the same way that you can search newsgroup posts via Google groups.

 

dcheney




msg:200085
 12:29 pm on Nov 5, 2003 (gmt 0)

I'm a regular op on a couple of EFnet channels - haven't seen or heard of anything like that.

punta




msg:200086
 12:34 pm on Nov 5, 2003 (gmt 0)

It seems that they're testing things out on smaller servers before launching it on a larger scale. If they have bugs in the software, it's better to cause a nuisance on a small network than something as large as EFnet.

There's more information here:

[manero.org...]

abates




msg:200087
 7:59 pm on Nov 5, 2003 (gmt 0)

A chat room I hang out in had a similar problem a while back with a site which sent a bot in and was echoing everything said to a web page without telling any of us. Once it was rumbled, it started morphing nicks to evade our bans... (wasn't Googlebot, I hasten to add)

I hope Google isn't going to start archiving IRC chat sessions, personally...

punta




msg:200088
 9:24 am on Nov 6, 2003 (gmt 0)

Google wouldn't try and evade any bans. It'd always stick to the same IP range. It follows robots.txt directives in web pages faithfully and also the no archive header in USENET, there's no reason to think that it wouldn't obey bans politely.

Does anyone think that IRC chat logs are useful sources of Information? I know sometimes interesting things are said, I've seen computer problems solved quite frequently in IRC chats, but a vast majority of conversations are just babble, not to mention the l33t speak!

abates




msg:200089
 9:58 am on Nov 6, 2003 (gmt 0)

I've seen some interesting IRC chats with celebrities... but they've all been put on the web anyway. :)

punta




msg:200090
 10:11 am on Nov 6, 2003 (gmt 0)

There's an interesting article here [dir.salon.com] about a start-up a few years ago that tried to archive chat rooms, albeit temporarily. They failed miserably, but maybe a company of Google's stature will be able to pull it off. There's certainly some interesting ideas in that article.

abates




msg:200091
 11:46 pm on Nov 6, 2003 (gmt 0)

Punta: that was the site we had trouble with. We weren't well pleased.

pleeker




msg:200092
 12:47 am on Nov 7, 2003 (gmt 0)

Does anyone think that IRC chat logs are useful sources of Information? I know sometimes interesting things are said, I've seen computer problems solved quite frequently in IRC chats, but a vast majority of conversations are just babble, not to mention the l33t speak!

I'd think IRC chats are at about the same level as guestbooks in terms of sources of valuable information. Maybe that's just me....

punta




msg:200093
 11:11 am on Nov 7, 2003 (gmt 0)

Maybe they won't be logging chats. Maybe they're doing something else. Has anyone got any ideas of what might be up google's sleeve?

Perhaps they're looking for links mentioned in IRC?

BlueSky




msg:200094
 11:57 am on Nov 7, 2003 (gmt 0)

I think it depends on the topic. Many IRC channels/chats in certain areas (tech oriented, celeb interviews, etc) have very low noise to signal ratio. They are far more info rich than quite a number of websites out there. However, I think people would get very mad if their conversations are indexed and placed in a public SE without their knowledge. This would lead to many requesting such pages be removed.

Perhaps this is something they may be considering to offer as a commercial service? I could see a large company wanting this type of info captured and indexed say for their tech support, exec chats, etc.

BlueSky




msg:200095
 7:01 pm on Nov 7, 2003 (gmt 0)

Coincidence or do these guys hang around here?
[theregister.co.uk...]

Maybe Google is doing this as a project for the US government. I wouldn't put anything past Bush and Ashcroft.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved