homepage Welcome to WebmasterWorld Guest from 54.204.128.190
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Search Engines / Alternative Search Engines
Forum Library, Charter, Moderators: bakedjake

Alternative Search Engines Forum

    
Microsoft to release new search technology
news report about a "new search proto-type"
Namaste




msg:469181
 12:38 pm on Apr 19, 2003 (gmt 0)

[netscape.com.com...]

article talks about Microsoft's search reasearch dept. This is their 3rd search engine related press release this month...they seem to be getting ready to release their own engine VERY SOON.

 

Oaf357




msg:469182
 6:53 pm on Apr 19, 2003 (gmt 0)

Hmm... maybe that's why I saw this in my logs today:

131.107.163.47 - - [18/Apr/2003:23:29:22 -0500] "GET /robots.txt HTTP/1.1" 200 409 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"
131.107.163.47 - - [18/Apr/2003:23:29:24 -0500] "GET / HTTP/1.1" 200 11203 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"

I thought it was spam but the IP address belongs to Microsoft.

Hmm...

daroz




msg:469183
 8:06 pm on Apr 19, 2003 (gmt 0)

Still active... Sorta. (Only pulled my Robots file...)

131.107.163.49 - - [19/Apr/2003:08:18:13 -0400] "GET /robots.txt HTTP/1.1" 200 147 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"

jeremy goodrich




msg:469184
 1:09 am on Apr 20, 2003 (gmt 0)

Wonder if they will pull an IBM and dump the project for some unknown reason after spidering everything under the sun, twice?

And I still find it hard to believe that they are using a 'real' hotmail address for the crawler - they should make it something more professional.

Though with the amount of hype going into it, the number of press releases, and the news about their search department having 70 staffers now, it does seem like they are serious about doing something.

jomaxx




msg:469185
 6:53 pm on Apr 21, 2003 (gmt 0)

I 403'ed "MicrosoftPrototypeCrawler" last week after it slammed my site and ignored my robots.txt exclusions. I assume they're still getting the bugs out at this point, but maybe I'll give the robot limited access just in case MSN goes live with this sooner than I expect.

jomaxx




msg:469186
 6:58 pm on Apr 21, 2003 (gmt 0)

P.S. I loved this new Windows feature mentioned in the article...
>> A prototype application called "Stuff I've Seen," for instance, will store
>> every screen that has popped up on a given computer monitor for a year.

That really really sounds like something I'd want to disable.

Oaf357




msg:469187
 11:04 pm on Apr 21, 2003 (gmt 0)

I'll be turning that off too.

craig1972




msg:469188
 2:42 pm on Apr 27, 2003 (gmt 0)

How can I "403" this thingie? Can I do it somehow in my HTACCESS file? I was hit by this thingie too!

mcavic




msg:469189
 5:29 am on May 2, 2003 (gmt 0)

And I still find it hard to believe that they are using a 'real' hotmail address for the crawler - they should make it something more professional.

I think the email address is fine, but running the crawler on an IP that doesn't resolve is a big no-no for me. :)

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Alternative Search Engines
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved