Forum Moderators: phranque
I have looked at [java-source.net...] but these are spiders only, I am prepared to use a spider with Lucene but would prefer to only maintain one codebase.
I would like to run this on my server.
It must be a spider as some content is dynamic and offered up by an application server running on antoher machine (so filesystems not available)
I would prefer to use Java as the client has a problem with Perl interpreters running on vulnerable machines.
I know about fdse, but its Perl :(
It needs to be reasonably current (unlike [me.lv...] not Opensource either)
I've been looking for ages and the best offering so far is [htdig.org...]
Any ideas?