[KinoSearch] lucene indexes

Marvin Humphrey marvin at rectangular.com
Sat Jan 26 07:41:03 PST 2008




On Jan 26, 2008, at 7:22 AM, Eric Lease Morgan wrote:

> Can KinoSearch (version 0.162) read Lucene (version 2.3.0) indexes?  
> At first glance, it seems the answer is no.

The only release of KS that could read a Lucene (version 1.4.3) index  
was 0.05, and that was only for pure ASCII source material.

The Lucene file format is gnarly -- it uses the illegal aberration  
"modified UTF-8" for text encoding, it's compromised by exceedingly  
complex optimizations, etc.  The format wasn't originally designed to  
be public; the spec was published as an afterthought.  Developments  
since 1.4.3 have made it even harder to work with.

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/



_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list