homepage Welcome to WebmasterWorld Guest from 54.211.157.103
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Search Engines / Alternative Search Engines
Forum Library, Charter, Moderators: bakedjake

Alternative Search Engines Forum

    
Nutch vs. Sphinxsearch
wheel




msg:3376220
 9:05 pm on Jun 22, 2007 (gmt 0)

I'm running nutch on a number of projects. It's fast, and lets one index a huge volume of data - a great engine for search. But it's difficult to set up (it's java based instead of the typical LAMP stuff I'm used to) and not so easy to customize without a skilled developer.

I just ran across sphinx search, another GPL'ed engine based on mysql and php apparently.

Has anyone experience with both of these to compare or contrast these two methods? (Sphinx search is claiming terrabytes of data now, index size being one of the reasons I didn't go with a php/mysql setup before).

 

runarb




msg:3377306
 1:07 pm on Jun 24, 2007 (gmt 0)

Sphinx is not based on php and MySQL.

Sphinx is as I understand it written in c++, and uses its own index format, stored in files.

Metaphorically




msg:3381667
 1:19 am on Jun 29, 2007 (gmt 0)

I just built and installed Sphinx. It definitely uses MySQL as well as some of it's own files for indexing. I think the way it uses MySQL is just as a data source (as opposed to html files you'd get from your web site).

I ran through the "Quick Sphinx usage tour" in the instructions and it seems to work but it doesn't feel very robust. I had to create a couple of the directories it expected and it wasn't trivial to run it as a non-root user.

I also found that the instructions mean /usr/local/etc when they say /usr/local/sphinx/etc.

I haven't configured Nutch, only been a user on it, but I think I might install that now for comparison.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Alternative Search Engines
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved