[KinoSearch] Re: Wildcards

Father Chrysostomos sprout at cpan.org
Thu Feb 14 20:36:21 PST 2008


On Feb 13, 2008, at 9:03 PM, Marvin Humphrey wrote:

>
> On Feb 13, 2008, at 8:26 PM, Nathan Kurz wrote:
>
>> Father C (and lurkers), I think it would be great if you could write
>> up your overview as well.  Even if you haven't poked around all the
>> innards in depth, you're much closer to the way it works than most
>> users will ever be.  So without reference to how it actually works,
>> write up something describing how it should work.

I’m actually quite clueless in this regard. Yes, I’ve looked at the  
code (at least what’s in Perl), but I still don’t know what’s going  
on. As for how it should work, I have no idea...just as long as I can  
use it. :-)

> Quickly...
>
> I'm pretty close to a coherent API for Weight.
>
> [... blah blah blah...]
>
> What's left to refactor ....

Could you include a way for a set of terms to be treated as a single  
term with regard to scoring, i.e., as if ‘fool’ and ‘food’ (in a  
wildcard foo* match, for instance) were simple stored as ‘foo’ in the  
index (the way word stemming works)? (If I’m not making myself clear,  
please let me know.) If you don’t want to include this in core  
KinoSearch, could you at least bear this in mind? This would, I  
believe, affect the way doc_freq is calculated.




More information about the kinosearch mailing list