[KinoSearch] Roadmap .30 and Scorers

Andrew Bramble bramble.andrew at gmail.com
Tue Jul 22 17:28:28 PDT 2008


Hello,

After getting useful results and fast with KinoSearch .20 I began looking at
ways to narrow results further using field specific refinements. EG having
CPAN metadata indexed and being able to slice into it by a license field.
Might it be possible for a Scorer (I think it's a scorer) to compute from
within the set of matched results, the total frequency of tokens from a
given field. To use the CPAN example again, rather than choosing to search
for "date parser" and  license:artistic , might the initial search for
"date parser" return the matching results AND a structure describing that of
100 matched documents, the field 'license' breaks down to perl=50,
artistic=30, gpl=10, bsd=5, apache=5.
 One could then repeat the original search , adding  'license:perl'  to
narrow the search to only the 50 matching documents.

Since this would required reading/examining each matched record I would
guess this belongs in the XS/C rather than perl.

Is it wishful thinking ? or might this be possible with subclassable
scorers/hit collectors.

++KinoSearch

Andrew
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://rectangular.com/pipermail/kinosearch/attachments/20080723/4fb0c507/attachment-0001.htm 
-------------- next part --------------
_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch


More information about the kinosearch mailing list