[KinoSearch] lucene indexes
Marvin Humphrey
marvin at rectangular.com
Sat Jan 26 07:41:03 PST 2008
On Jan 26, 2008, at 7:22 AM, Eric Lease Morgan wrote:
> Can KinoSearch (version 0.162) read Lucene (version 2.3.0) indexes?
> At first glance, it seems the answer is no.
The only release of KS that could read a Lucene (version 1.4.3) index
was 0.05, and that was only for pure ASCII source material.
The Lucene file format is gnarly -- it uses the illegal aberration
"modified UTF-8" for text encoding, it's compromised by exceedingly
complex optimizations, etc. The format wasn't originally designed to
be public; the spec was published as an afterthought. Developments
since 1.4.3 have made it even harder to work with.
Marvin Humphrey
Rectangular Research
http://www.rectangular.com/
_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch
More information about the kinosearch
mailing list