[KinoSearch] get doc/query similarity

jack_tanner at yahoo.com jack_tanner at yahoo.com
Tue Apr 15 07:13:39 PDT 2008



Ping? Still trying to compute similarity of two indexed docs... A weighted cosine or some such.

Thanks in advance.

----- Original Message ----
> From: "jack_tanner at yahoo.com" <jack_tanner at yahoo.com>
> To: KinoSearch discussion forum <kinosearch at rectangular.com>
> Sent: Friday, April 11, 2008 3:07:05 PM
> Subject: Re: [KinoSearch] get doc/query similarity
> 
> From: Marvin Humphrey 
> >
> > Let's assume you mean a term, for the sake of getting things started.   
> > Let's also assume that you don't really mean "one specific document",  
> > even though that's exactly what you said. :)
> 
> Thanks for that example. Let me be more clear about what is desired: I need to 
> compute the similarity of two indexed documents. I'd like it if the metric was 
> more sophisticated than mere term overlap. At a minimum, it could be Jaccard 
> (i.e., doc length-normalized term overlap). It would be preferable to have 
> something that takes corpus statistics into account. For example, if in my 
> corpus some term T has high TF and low IDF (occurs often and in many docs), then 
> such a term could be downweighted. Could you suggest a way of doing this? 
> Ideally with KS 0.162?
> 
> > Interesting.  I received your private email and wrote back.  Maybe  
> > hotmail is blocking rectangular.com or something.  AOL blockaded me  
> > once because the previous tenants on the Comcast IP block  
> > rectangular.com got assigned to weren't good netizens.
> 
> I never got your response at my hotmail address, not even in the spam folder. If 
> you like, I could forward a complaint to Hotmail's postmaster. Please send a 
> test e-mail to my @yahoo and cc my @hotmail, and I'll forward that along.
> 
> 
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com 
> 
> 
> _______________________________________________
> KinoSearch mailing list
> KinoSearch at rectangular.com
> http://www.rectangular.com/mailman/listinfo/kinosearch
> 




      ____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ


_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list