[KinoSearch] Re: Wildcards
Father Chrysostomos
sprout at cpan.org
Thu Feb 14 20:36:21 PST 2008
On Feb 13, 2008, at 9:03 PM, Marvin Humphrey wrote:
>
> On Feb 13, 2008, at 8:26 PM, Nathan Kurz wrote:
>
>> Father C (and lurkers), I think it would be great if you could write
>> up your overview as well. Even if you haven't poked around all the
>> innards in depth, you're much closer to the way it works than most
>> users will ever be. So without reference to how it actually works,
>> write up something describing how it should work.
I’m actually quite clueless in this regard. Yes, I’ve looked at the
code (at least what’s in Perl), but I still don’t know what’s going
on. As for how it should work, I have no idea...just as long as I can
use it. :-)
> Quickly...
>
> I'm pretty close to a coherent API for Weight.
>
> [... blah blah blah...]
>
> What's left to refactor ....
Could you include a way for a set of terms to be treated as a single
term with regard to scoring, i.e., as if ‘fool’ and ‘food’ (in a
wildcard foo* match, for instance) were simple stored as ‘foo’ in the
index (the way word stemming works)? (If I’m not making myself clear,
please let me know.) If you don’t want to include this in core
KinoSearch, could you at least bear this in mind? This would, I
believe, affect the way doc_freq is calculated.
More information about the kinosearch
mailing list