Welcome to WebmasterWorld Guest from 54.196.104.66

Forum Moderators: bakedjake

Message Too Old, No Replies

Microsoft to release new search technology

news report about a "new search proto-type"

     
12:38 pm on Apr 19, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Dec 2, 2002
posts:792
votes: 0


[netscape.com.com...]

article talks about Microsoft's search reasearch dept. This is their 3rd search engine related press release this month...they seem to be getting ready to release their own engine VERY SOON.

6:53 pm on Apr 19, 2003 (gmt 0)

Preferred Member

10+ Year Member

joined:Mar 5, 2003
posts:636
votes: 0


Hmm... maybe that's why I saw this in my logs today:

131.107.163.47 - - [18/Apr/2003:23:29:22 -0500] "GET /robots.txt HTTP/1.1" 200 409 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"
131.107.163.47 - - [18/Apr/2003:23:29:24 -0500] "GET / HTTP/1.1" 200 11203 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"

I thought it was spam but the IP address belongs to Microsoft.

Hmm...

8:06 pm on Apr 19, 2003 (gmt 0)

Junior Member

10+ Year Member

joined:Jan 20, 2003
posts:151
votes: 0


Still active... Sorta. (Only pulled my Robots file...)

131.107.163.49 - - [19/Apr/2003:08:18:13 -0400] "GET /robots.txt HTTP/1.1" 200 147 "-" "MicrosoftPrototypeCrawler (please report obnoxious behavior to newbiecrawler@hotmail.com)"

1:09 am on Apr 20, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member jeremy_goodrich is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Aug 4, 2000
posts:3468
votes: 0


Wonder if they will pull an IBM and dump the project for some unknown reason after spidering everything under the sun, twice?

And I still find it hard to believe that they are using a 'real' hotmail address for the crawler - they should make it something more professional.

Though with the amount of hype going into it, the number of press releases, and the news about their search department having 70 staffers now, it does seem like they are serious about doing something.

6:53 pm on Apr 21, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member jomaxx is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Nov 6, 2002
posts:4768
votes: 0


I 403'ed "MicrosoftPrototypeCrawler" last week after it slammed my site and ignored my robots.txt exclusions. I assume they're still getting the bugs out at this point, but maybe I'll give the robot limited access just in case MSN goes live with this sooner than I expect.
6:58 pm on Apr 21, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member jomaxx is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Nov 6, 2002
posts:4768
votes: 0


P.S. I loved this new Windows feature mentioned in the article...
>> A prototype application called "Stuff I've Seen," for instance, will store
>> every screen that has popped up on a given computer monitor for a year.

That really really sounds like something I'd want to disable.

11:04 pm on Apr 21, 2003 (gmt 0)

Preferred Member

10+ Year Member

joined:Mar 5, 2003
posts:636
votes: 0


I'll be turning that off too.
2:42 pm on Apr 27, 2003 (gmt 0)

Full Member

10+ Year Member

joined:Mar 29, 2003
posts:226
votes: 0


How can I "403" this thingie? Can I do it somehow in my HTACCESS file? I was hit by this thingie too!
5:29 am on May 2, 2003 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Mar 31, 2003
posts:1316
votes: 0


And I still find it hard to believe that they are using a 'real' hotmail address for the crawler - they should make it something more professional.

I think the email address is fine, but running the crawler on an IP that doesn't resolve is a big no-no for me. :)