[KinoSearch] utf8 warnings/error

Scott Beck scottbeck at gmail.com
Fri Aug 24 14:34:29 PDT 2007



Hi,

I have a reproduceable test case that seems to corrupt up the index.
You can download the test case here:
  http://devmagic.org/kinotest.tar.gz

Or you can browse it here:
  http://devmagic.org/kinotest

test.pl is the code. The data file is a data dump of a bunch of
emails, test.pl inserts that data.

The kino file is the command line tool I've been using to do simple
test queries with, it has a db path hardcoded at the top, so if you
want to play with that you will need to modify it (it's unrelated to
the test case).

Running test.pl on my system gives these results:
Hits for flag_deleted:"0": 58
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Removed ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 0
Hits for flag_deleted:"0": 44
Adding ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Hits for flag_deleted:"0": 45
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Removed ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 0
Hits for flag_deleted:"0": 44
Adding ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Hits for flag_deleted:"0": 45
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Removed ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 0
Hits for flag_deleted:"0": 1
Adding ./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093
Hits for path:"./cur/1182974671.2581.1.vmware.nmsrv.com,S=11093": 1
Hits for flag_deleted:"0": 2

I insert the data, then run the query flag_deleted:"0", hits is 58.
Then I remove one item from the index and run that query again, hits
is now 44. I repeat these additions/deletions 3 times. As you can see
the third time we end up with only 2 results.

Please let me know if you see a bug in my code/logic.

Thanks,

Scott

_______________________________________________
KinoSearch mailing list
KinoSearch at rectangular.com
http://www.rectangular.com/mailman/listinfo/kinosearch




More information about the kinosearch mailing list