Welcome to WebmasterWorld Guest from

Forum Moderators: bakedjake

Message Too Old, No Replies

Nutch vs. Sphinxsearch

9:05 pm on Jun 22, 2007 (gmt 0)

Senior Member

WebmasterWorld Senior Member wheel is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Feb 11, 2003
votes: 12

I'm running nutch on a number of projects. It's fast, and lets one index a huge volume of data - a great engine for search. But it's difficult to set up (it's java based instead of the typical LAMP stuff I'm used to) and not so easy to customize without a skilled developer.

I just ran across sphinx search, another GPL'ed engine based on mysql and php apparently.

Has anyone experience with both of these to compare or contrast these two methods? (Sphinx search is claiming terrabytes of data now, index size being one of the reasons I didn't go with a php/mysql setup before).

1:07 pm on June 24, 2007 (gmt 0)

New User

10+ Year Member

joined:May 31, 2003
votes: 0

Sphinx is not based on php and MySQL.

Sphinx is as I understand it written in c++, and uses its own index format, stored in files.

1:19 am on June 29, 2007 (gmt 0)

Full Member

10+ Year Member

joined:Dec 22, 2004
votes: 0

I just built and installed Sphinx. It definitely uses MySQL as well as some of it's own files for indexing. I think the way it uses MySQL is just as a data source (as opposed to html files you'd get from your web site).

I ran through the "Quick Sphinx usage tour" in the instructions and it seems to work but it doesn't feel very robust. I had to create a couple of the directories it expected and it wasn't trivial to run it as a non-root user.

I also found that the instructions mean /usr/local/etc when they say /usr/local/sphinx/etc.

I haven't configured Nutch, only been a user on it, but I think I might install that now for comparison.


Join The Conversation

Moderators and Top Contributors

Hot Threads This Week

Featured Threads

Free SEO Tools

Hire Expert Members