[KinoSearch] kinosearch 0.20 usable?
Gert Brinkmann
g111 at netcologne.de
Sat Feb 7 03:00:14 PST 2009
Marvin Humphrey wrote:
> It's frequent enough to be unacceptable: 1 in 20, 1 in 30. What happens is
> that when the Highlighter trims an excerpt, the keyword that made the excerpt
> relevant gets trimmed sometimes, yielding an excerpt that is not obviously
> relevant.
This reminds me of a bug in the current Solr 1.3 that we are using.
There the problem is that prefix-queries like "house*" do not return a
highlight snippet. This was the reason why I did look for another
solution like kinosearch. But as we have found a workaround for this bug
we are trying to continue with Solr first. I keep kinosearch in my mind
to probably give it a try sometimes later if at least 0.20 is released.
> I've put a ton of effort into modularizing the OO design and the file format
> so that supporting back compat will be easier going forward. There will be a
> big push sometime soon to write up a draft Apache Lucy File Format
Is kinosearch using lucy somehow? Is the lucy format different to the
lucene format? Isn't there already a lucy/lucene file format
specification as this project does exist for longer time?
> Specification version 0.1, which KS will conform to. If you'd like to
> contribute by reviewing that document, I'd appreciate it; the better a job we
> do, the sooner we'll "get the engine stable".
Well, this sounds to me as Linus Torvalds asked me to review a linux
kernel documentation. What type of documentation will this be? Isn't it
highly covering internal IR-theoretical things (that I do not know much
about)?
Gert
More information about the kinosearch
mailing list