[KinoSearch] How do you index ms office (.doc, .xls, .ppt) files with kinosearch

Ben Aurel ben.aurel at gmail.com
Mon Aug 25 04:12:25 PDT 2008



hi
I've red through most of the documentation trying to understand what
filetypes KS supports. There is the interesting oscon presentation on
http://www.rectangular.com/downloads/KinoSearch_OSCON2006.pdf, where
you can find the statement on page 13:

What is KinoSearch not?
...
- Not a file parser
...

So if I get this right, kinosearch doesn't care about your .doc, .xls,
.ppt files. As much as I personally try to avoid this formats, I think
its realistic to assume that you have to index such files when
creating something like an intranet search.

My question is, what would you suggest for indexing office formats ?
How do you extract text without ole and and an office installation on
the server?

thanks in advance
ben

_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list