Forum Moderators: phranque
<!--No Search-->This won't be indexed<!--End No Search--> That last one is what's held up the show so far. Perlfect and other free site searches will do most of the rest pretty well, but so far I haven't been able to find anything that keeps decent search stats. There's nothing worse than a site search that returns no results or no good results, so I'd like to be able to keep tabs at least on the searches that return no results or don't get a clicked result. Obviously, knowing which results have been clicked would also be very useful information.
Thanks in advance for any suggestions.
Matthew
The site is running on Linux and Apache 1.3.33, with MySQL 4.1.13.
I've been messing about with corzoogler but it does not (really) have relevance and it also works on raw files (not an index) (which is good and bad - but probably not serve your purpose Matthew).
The site is mostly dynamic etc etc
a) Does this mean you have a regular, predictable update regime (of the database itself) or you are just prepared to wear some inaccuracies / misses / losses? A loss would be a bit annoying or does the site just keep growing?
b) Isn't redirecting from the static name to the dynamic page a bit (too) fiddly or would this all be stored in the index? If so, I'm pretty sure you'd have to hack the code or it would have to be specifically written to do this (and not a bad selling point either).
Nice topic!
a)Does this mean you have a regular, predictable update regime (of the database itself) or you are just prepared to wear some inaccuracies / misses / losses? A loss would be a bit annoying or does the site just keep growing?
Articles are added weekly, forum posts and other content is added daily by our visitors. Several times per day, I can have an automated script go through the database and build static HTML pages of all the actual content, storing them in a directory of my choosing. I would seldom if ever remove content, so losses shouldn't be an issue. The search index wouldn't be right up to the minute, but it would be close enough to suit me.
b) Isn't redirecting from the static name to the dynamic page a bit (too) fiddly or would this all be stored in the index? If so, I'm pretty sure you'd have to hack the code or it would have to be specifically written to do this (and not a bad selling point either).
Actually, the index would only be indexing the static pages. When someone performed a search, the results would link to the static HTML files - no redirection at all. This is okay because the script that builds the static pages still preserves all the links on the page so they point back to the dynamic site. In other words, although the static page is where the searcher will be taken after clicking a result, everything will still behave exactly as though they were on a dynamic page. To put it another way, imagine viewing a dynamic page in your browser that uses absolute URL's. Now view the page source and save it as a new, static file in a new location. You'll be able to view the new page and it will look and act just like the original, dynamic version, and clicking a link on the new page will take you to one of the normal, dynamic pages. That's about the same thing as what's going on here.
Although this approach would work well for this site, it would also be fine if I could only find a site search that would crawl and index the dynamic pages themselves. Actually this might be much better and simpler in the long run. I'd have to configure either the indexer or robots.txt to disallow access to certain non-content files, such as the login and new forum post scripts. But once that was done, indexing right from the dynamic pages would probably be best after all.
[isearchthenet.com...]
There is a Pro and a free version, and is PHP. It does most, if not all of what you require and is well supported.