Forum Moderators: phranque
I am tearing my hair out at the moment trying to find a good and effective in site search engine which will list HTML, .ASP, PDF's and Word docs - can anyone help me with a recommendation via sticky if need be.
The solution would need to work on a Windows server and I would like the SE to be hosted within the web space, ie. it won't be remotely hosted. Ideally, I would like the solution to spider the site at regular intervals to index new pages/additions - am I asking to much here? It would need to be able to index .asp dynamic pages generated by a MsSQL database.
I only have about $600 - 1000 to spend.
Thanks for your help
Mark
What's the downside of a remotely hosted search?
I implemented it for a client, and using includes it calls out for the information (search results), then populates it in a web page template that matches the rest of the web site. The url never changes to anything outside of my client's domain. The url always shows www.myclientsdomain.com.
If there's a downside to this I'm interested in knowing it.
Thanks!
:)
[edited by: martinibuster at 10:57 pm (utc) on Feb. 5, 2003]
Martin - the main downside for this particular project is that I am bound by a very tight 'Data protection' clause in the clients contract with me.
Because the web site will actually contain contact from third party companies (it's a bit like a 'business club' project) I am not allowed under this contract to pass this data to a 'third party' - such as a provider of a remotely hosted search. Obviously, a visitor to the site could simply copy the data direct from the web page, but if they do this I am not actually knowingly passing the data to someone else!
I even had to get clearance from the client regarding the web hosting of the project, simply because this data would reside on a web site hosted by a company other than my own.
It's a bit of a restrictive arrangement, but hence the reason why I cannot really have data collected from the site and held elsewhere.
Mark
This one looked really promisng but has the following probs - needs cgi/perl, many windows hosts just wont install it.
Also I cant get it to work on any reseller accounts, error could not verify that web address [mydomain.com...] maps to ftp://www.mydomain.com
I guess because it a shared/virtual server?
I have looked for a good search a while, for windows it seems to be either fp-extensions (crap search), site server (expensive) , write your own using filesystem (brute force search) or remote hosted.
Yeah, I am running Unix, so I don't know about windowz.
>Also I cant get it to work on any reseller accounts, error >could not verify that web address [mydomain.com...] >maps to ftp://www.mydomain.com
I use it on several child accounts (reseller). I assume you are talking about the automatic install. I had no probs w/ auto install on shared server.
Try the support forums over there, the author and his followers are totally cool, and usually very helpful w/ all kinds of requests. They can probably help w/ windows application.
Did a couple of searches and found the service had finally (for them) gotten their "search sponsors" listings act together. What had in the past been three very general blurbs before my results that I didn't mind are now three very highly-targeted (based on search keywords) results. Wonder what that's been costing me by way of traffic and sales. (If you use a free service, when was the last time you searched your site?)
Immediately checked out linkshark's recommendation. Couldn't do an automatic install on a site that shares an IP (where I was just going to test it), but installed it automatically on a site with a dedicated IP in about 30 seconds.
Very impressive. Lot's of configuration options. Only glitch was my robots.txt keeping it out (a few general mozilla compatible entries) but simple to change the user agent (watch out for Calamity Jane galloping by).
Just a few more tweaks here and there and it's all set.
Thanks linkshark,
Jim
Also supports it's own robots noindex meta tag you can add to pages you do NOT want indexed when crawling.
* I think Brett uses a modified version of that script for this site.
I am running on Linux but they also claim to have a windows version.
daisho