homepage Welcome to WebmasterWorld Guest from 184.72.82.126
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Visit PubCon.com
Home / Forums Index / Code, Content, and Presentation / PHP Server Side Scripting
Forum Library, Charter, Moderators: coopster & jatar k

PHP Server Side Scripting Forum

    
internal site search engine
any good "boxed" php scripts out there?
louponne




msg:1292396
 4:05 pm on Sep 13, 2003 (gmt 0)

I need to install an internal se on a site. I have an "old" perl script that works pretty well, but it works off of a flat file and this site is too big for that. What's more, I'm *much* more comfortable with php than with perl. There must be fully developped scripts to do this, but I can't find any via google, etc that "look" professional.

I'd like:
- separate indexing script (I mean the indexing takes place in one step, storing the results in, say a mysql db, and the searching takes place off that db) [added] hm - I think that this should work off a mysql db, but maybe flat files would be sufficient? The site's nearing 200 pages or so[/added]
- results ordered by relevance, depending where words found
- highlighting matched words
- support for pdf
- "no index" word skip list
- "no index" page skip list
- "no index" tags for skipping parts of pages

Anyone use anything in this area?

 

jatar_k




msg:1292397
 6:41 pm on Sep 15, 2003 (gmt 0)

hard to say louponne, I have used a few custom search scripts in the past but I haven't used an out of the box solution.

I also haven't had sites where I was really worried about the site search too much.

any have any ideas or have you maybe found something louponne?

louponne




msg:1292398
 7:32 pm on Sep 15, 2003 (gmt 0)

Thanks for responding, jatar_k - my post was really beginning to feel very lonely!

Mind you, I'm not looking necessarily for a complete out-of-the-box solution, but at least one I can work from. The only thing I could find that seemed to come relatively close is Zoom by wrensoft.com - the only main thing it doesn't do is stock the data in a mysql db (it works off of flat files). The rest looks pretty good.

I'm actually quite surprised not to find more scripts for this. I guess everyone does their own!

jatar_k




msg:1292399
 8:25 pm on Sep 15, 2003 (gmt 0)

seems to be that way. It always depneds on the site. A lot of sites I've worked with don't even have a search.

It's a bit strange because it seems to be something people ask for a lot. I guess that's because there a re no good ones.

ProfMoriarty




msg:1292400
 9:19 pm on Sep 15, 2003 (gmt 0)

Try PHPDig ([phpdig.toiletoine.net ]), it´s GPL and nice clean code you can change easily to match your needs.

Best regards,

ProfMoriarty

louponne




msg:1292401
 9:31 pm on Sep 15, 2003 (gmt 0)

Thanks, ProfMoriarty - I had found that one, but it seemed to be missing so many of the features I need that it didn't seem worthwhile. But I've downloaded it and I'll give it a better look!

ProfMoriarty




msg:1292402
 10:02 pm on Sep 15, 2003 (gmt 0)

louponne,

give it a try. So did I and after having invested some time, it was worth the effort because PHPDig´s rudiments are quite good and really easily to modify.

First take a look at the basic tables created in your database. That should make you an easy start and the search algorithm is yet rather clear.

So you need to understand how indexing works. It´s not as complicated as one may think as long you have the table structure and its meaning in mind - and now the (okay, _your_) work begins. Make some use of regular expressions to locate words, skip passages etc. Use a wrapper or whatever to read and parse PDFs - anyway, these are the main lines of code you have to extend.

Good luck,

ProfMoriarty

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / PHP Server Side Scripting
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved