[KinoSearch] utf8 warnings/error

Marvin Humphrey marvin at rectangular.com
Thu Aug 23 02:23:06 PDT 2007


On Aug 19, 2007, at 12:30 PM, Marvin Humphrey wrote:

>> I get the same errors and warnings with either of these inserted just
>> before the add_doc().
>
> OK, we'll have to figure out at what point what was valid UTF-8  
> became invalid UTF-8.  Can you please send me some spam?

Scott,

Thanks for sending me spam off-list.  Unfortunately/fortunately it  
seems to parse OK on my machine (stock Perl 5.8.6 on Mac OS X  
10.4.10).  An example script is attached (minus the spam).  Can you  
please fill in the blanks and try it out on your machine?

What version of Perl are you using?

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/

-------------- next part --------------
A non-text attachment was scrubbed...
Name: analyze_spam.pl
Type: text/x-perl-script
Size: 852 bytes
Desc: not available
Url : http://rectangular.com/pipermail/kinosearch/attachments/20070823/6248450e/attachment-0001.bin 
-------------- next part --------------


-------------- next part --------------
_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch


More information about the kinosearch mailing list