[KinoSearch] Index cleared when using add_doc, delete_by_term, et al.

Darian Anthony Patrick darian at criticode.com
Tue Mar 31 17:59:40 PDT 2009


Hello all,

I'm using KinoSearch-0.20_051 and I have an application that loops 
through an RSS feed and indexes each entry, md5summing the title and 
body and storing that in the index as well.  On subsequent invocations, 
my indexing script does a search on the existing index for the unique id 
associated with each entry in the RSS feed, retrieves the entry from the 
index if it has been indexed previously, then checks the stored md5sum 
for the title and body against the md5sum for the title and body of the 
current entry to determine if it has changed.  If it has changed, then I 
call

$invindexer->delete_by_term(
	'listing_id' => $listing{'listing_id'}
);

to delete the entry, and later

$invindexer->add_doc(\%listing);

to create the new entry.  I do this in a loop over all entries in the 
RSS feed.  I'm seeing behavior where occasionally my entire index gets 
totally blown away with only entries created during the current 
invocation of my indexing script existing in the index.

Is this a known issue?  Is this what "Fix delete_by_term behavior after 
add_doc is called in InvIndexer" means on 
http://www.marvinhumphrey.com/kinosearch/wiki/ToDoList?

Any insight would be much appreciated.

Thanks,

Darian

-- 
Darian Anthony Patrick, Criticode LLC
Office:     (215) 789-9956
Facsimile:  (866) 789-2992
XMPP/SMTP:  darian at criticode.com
Web:        http://criticode.com
=================================================
BCF1 E7AD 15AD 8A99 F613 AF5F 2A9C C45C F580 E087
=================================================



More information about the kinosearch mailing list