Forum Moderators: bakedjake

Message Too Old, No Replies

LookSmart has acquired this Grub Inc

Open Source crawling

         

Jillibert

2:41 am on Mar 15, 2003 (gmt 0)



LOOK have purchased this company

[grub.org...]

It appears this will be able to not only refresh the hosts index but "crawl" the web and refresh the web index on a daily basis.

Very interesting process much like Napster and Seti.

Certainly will give apowerful and cost affective edge to Wisenut.

born2drv

4:16 am on Mar 16, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Technically speaking you don't have to contribute anything to get your site refreshed.

You can enable local crawling (your sites) and dissable remote crawling (other sites) in the preferences in the software. Obviously they don't want you to do this, but you're still doing them a favor....

By spidering your own sites, compressing it up and sending it to them, you are sparing them the bandwidth to crawl your site. You'll also be more inclined to make sure all your content is properly spidered and fresh. So they still win.

Maybe if this were to catch on and everyone spidered their own content, there would not be much other work to pass on for other members to spider anyways.

Kind of like all of us calculating/sending in our tax refunds like good citizens vs. getting the IRS to go on a hunt for our heads and do it the slow and painful way :) Spidering your own content and sending it in could end up becoming the norm.

cchooper

5:03 am on Mar 16, 2003 (gmt 0)

10+ Year Member



Those features don't work yet :) Not for me at least, hmm, and why hasn't anything been posted on the Grub site itself about the purchase? Very interesting ... you'd think that for such a community-based (almost) site it would've been mentioned by now.

Jillibert

6:08 am on Mar 16, 2003 (gmt 0)



cchooper,its in the forums section of the site.

I spoilt the party by breaking the news from getting the information from the LOOK annual report.Its amazing what you can find what comapnies are doing in annual reports/worth the look sometimes.

Judging from what Kordless has said i would think the 18th will be a nice date for others then on here to find out,just say hard to hide from us.

And from what i gather,what we are seeing now is not what they will be rolling out as the full version of this,as this is in beta.This will really heat things up in the search engine freshness area and if Looksmart can pull this off with Wisenut technology supporting it,could be a winner.

Born2 brought up some excellents points re the future of how this could be implemented too.

Fischerlaender

12:58 pm on Mar 16, 2003 (gmt 0)

10+ Year Member



porkyoz wrote:
To have your clients sites refreshed daily I would have thought would be in yours and your clients best interests. I see it as a two way street. True, you are providing bandwidth according to your limits and not "Grubs" appetite. It is entirely up to you.

You are completely right that this would make sense for an SEO. But I did not think of SEOs; I thought of the normal users - and they are the ones who have the sheer number that a project like Grub needs.

And for the technical aspect of misleading the Grub crawler: This program runs on my computer. It's very easy to manipulate domain name resolution on my local system. (using my own DNServer or just editing my hosts file) No checksum can help here. I'm not saying that it is easy to spam Grub this way but people are doing crazy things just to get a better PageRank. So Grub should be aware of its weakness.

cchooper

10:36 pm on Mar 16, 2003 (gmt 0)

10+ Year Member



Ah, good thinking =] Why not point every domain at my server's IP? Or even just my competitors (On a more personal note I'm targeting a not-so-competitive area, but yanno)

IMHO I think Grub really shoulda been something targeted to only run and monitor a server's (or networks, like a blog ring or some such) web documents, and then webmasters would be able to write-up a fancy front-end showing their latest updates (such as new products) and other updates wherever and however they please. A local system, and not so much a system that says "run this, and don't benefit," which, in its current stage, is exactly what it's screaming.

keeper

11:05 pm on Mar 16, 2003 (gmt 0)

10+ Year Member



Has anyone considered that this move also has the ability to defeat cloaking? I mean distributed spidering would make IP delivery a little prickly wouldn't it?

cchooper

2:07 am on Mar 17, 2003 (gmt 0)

10+ Year Member



Then you could always cloak by user agent O_o

Hollywood

5:05 am on Mar 17, 2003 (gmt 0)

10+ Year Member Top Contributors Of The Month



I'm in the top 10 for checked urls on Grubs site, and we are all spiking the chart off the scale.

Very good to see, let's push this baby to the max! Google would be quite interested in how far this chart can go I would think.

This Grub.org thing is interesting, what say you all?

~Hollywood

P.S. Keep pushi'n

papamaku

4:38 pm on Mar 17, 2003 (gmt 0)

10+ Year Member



does anyone know how much looksmart paid for grub?

+ what about their developers? will they be working for looksmart now?

Jillibert

9:39 pm on Mar 17, 2003 (gmt 0)



Papamaku........since LOOK acquired it in January and they are still developing this product one would have to think the integration of the 3 developers into LOOK is complete.

Note the latest version of Grub was released today with an auto update feature for the host.

On how much LOOK paid....it was a scrip deal and its in the annual report too.

jrobbio

10:56 pm on Mar 17, 2003 (gmt 0)

10+ Year Member



Check out the grub website for the announcement. Looks like people were right about Wisenut. Read on!

Hollywood

12:44 am on Mar 18, 2003 (gmt 0)

10+ Year Member Top Contributors Of The Month



"Looks like people were right about Wisenut"

Please explain

jrobbio

2:53 am on Mar 19, 2003 (gmt 0)

10+ Year Member



Hollywood it is intended for use with the Wisenut index with on the fly refreshing.
On another note a newer version of 1.11 has been posted because of some corrupted OCX's so the recommendation is to uninstall and then reinstall the newer version.

Camster

10:56 pm on Mar 20, 2003 (gmt 0)

10+ Year Member



now _that_'s interesting. and they said to keep an eye on wisenut...

jrobbio

9:53 pm on Mar 21, 2003 (gmt 0)

10+ Year Member



Found on the Grub front page:
CNET is running a headline about Grub and Looksmart.

http://news.com.com/2100-1032-993591.html [news.com.com]

rubble88

12:40 am on Mar 22, 2003 (gmt 0)

10+ Year Member



Here's the mention from the annual report,
In January 2003, we acquired substantially all of the assets of Grub, Inc., a developer of
distributed computing software which allows community participants to assist in the development and
updating of a web search index. We believe that by incorporating a distributed computing solution
into our systems and processes for updating our search index, we may be able to achieve substantial
gains in the freshness of the index and cost savings over the long term.

JonB

9:54 am on Apr 18, 2003 (gmt 0)

10+ Year Member



jsut saw this on some slovenian forum. they say google crawls around 150 million pages per day - grub is now up to 60 million.impressive.Also this slovenian news says that sites will be daily refresh or something - compared to googles once per month.

i didnt follow this discusion before - any changes since last post (month ago)? is it the breakthrough like they say on grub page?

This 47 message thread spans 2 pages: 47