Forum Moderators: open

Message Too Old, No Replies

new bot: http://www.almaden.ibm.com/cs/crawler [c01]

IBM Web Fountain

         

claus

3:34 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The very hyped "hot shot" search technology from IBM just visited, i'm honored :)

1) UA-url yields info: [almaden.ibm.com...]

2) And this as well: [almaden.ibm.com...]

Reads "robots.txt" and has not violated. Link 1 claims support. Very well-behaved, spent five (5) hours GETting 89 files.

/claus



Method: GET
Protocol: HTTP/1.0
Files: HTML + "robots.txt"
UA-string: [almaden.ibm.com...] [c01]

IP: 66.147.154.3
Focal Communications FOCC-SPRBLK-4 (NET-66-147-128-0-1)
66.147.128.0 - 66.147.223.255
IBM Almaden Research Center FOCC-IBM-SJC-1 (NET-66-147-154-0-1)
66.147.154.0 - 66.147.154.63



edit: added: it does not seem to understand a 301-redirect - all files were redirected and it only got the headers (or perhaps it insists on domains starting with "www."?)

moltar

3:48 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I got something too today. Was just looking here at WW if anyone knows anything about it and found this post.

AwStats identified it as "IBM_Planetwide", is that the one? I can't find that record in my logs though. What name does it go under in logs? It grabed 42 pages at once. Very good.

lazerzubb

3:50 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hehe do a few searches, the almaden crawler have been online as long as i can remember ;)

moltar

3:52 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



What about "IBM_Planetwide"? I searched WW and found nothing.

lazerzubb

3:54 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



moltar: [robotstxt.org...]

And from webmasterworld:
[webmasterworld.com...]

claus

4:00 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>> What name does it go under in logs?

This: "http://www.almaden.ibm.com/cs/crawler [c01]"

>> IBM planetwide

Here: [ibm.com...] -no bot there though...

>> have been online as long as i can remember

damn, then i'm not honored at all... 'twas about time they paid me a visit then..


added:

After seeing lazerzubbs post and following the link i think that IBM Planetwide is not the same bot. Or perhaps it is an old name for it..

moltar

4:11 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks lazerzubb, it says "Restricted to IBM owned or related domains." on the description page of that robot.

My domain is not owned nor it's anyhow related to IBM. Why would it come to my site?

wilderness

4:28 pm on Aug 15, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Why would it come to my site?

To make YOUR resources available to its customers :(

wilderness

11:46 pm on Sep 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



from today's IAR Newsletter

4. IBM to Update Information Integration Tool
Big Blue embarks on a big project to add rich media, speed, flexibility and to its DB2
Information Integrator product; for now, call it 'Project Masala.'
http ://www.internetnews.com/ent-news/article.php/3070461