[KinoSearch] Roadmap .30 and Scorers

Justin DeVuyst justin at devuyst.com
Tue Jul 22 17:57:56 PDT 2008



Hello,

I was playing around with indexing and searching CPAN with KinoSearch
recently myself.  Could you elaborate on what your plans are?  I'd
like to move on to something else if someone else is already doing
what I would like to see happen.

Basically my goal is to make searchable, in one place, everything
known about modules on the CPAN.  Whether KinoSearch can fit the
whole bill or just part of the bill I'm still not sure of.

Thanks,
jdv

Andrew Bramble wrote:
> Hello,
>
> After getting useful results and fast with KinoSearch .20 I began
> looking at
> ways to narrow results further using field specific refinements. EG
> having
> CPAN metadata indexed and being able to slice into it by a license
> field.
> Might it be possible for a Scorer (I think it's a scorer) to compute
> from
> within the set of matched results, the total frequency of tokens from
> a
> given field. To use the CPAN example again, rather than choosing to
> search
> for "date parser" and  license:artistic , might the initial search for
> "date parser" return the matching results AND a structure describing
> that of
> 100 matched documents, the field 'license' breaks down to perl=50,
> artistic=30, gpl=10, bsd=5, apache=5.
>  One could then repeat the original search , adding  'license:perl'
> to
> narrow the search to only the 50 matching documents.
>
> Since this would required reading/examining each matched record I
> would
> guess this belongs in the XS/C rather than perl.
>
> Is it wishful thinking ? or might this be possible with subclassable
> scorers/hit collectors.
>
> ++KinoSearch
>
> Andrew
> _______________________________________________
> KinoSearch mailing list
> KinoSearch at rectangular.com
> http://www.rectangular.com/mailman/listinfo/kinosearch
>



_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list