This is related to http://www.namazu.org/trac-namazu/trac.cgi/ticket/9 - agreed it is not a Namazu bug but we would appreciate some help to make Namazu work in our huge archive.
We're currently missing a script to keep the indexes in sync regularly. Where can we find an example of how to use namazu to only index content added/changed since the last run (or how to remove content that had been removed). We would like to get Namazu hooked up to scan for recent changes (perhaps every 3-4hrs, to 'find' in the archives and feed that into Namazu to update the indexes).
It's probably not a particularly difficult script to write, but if it's already available that would be great.
More comments from the sysdamin that installed Namazu:
"It looks like when I set up namazu back in April 2004, I ran the initial indexing script, but never hooked up anything to keep it indexed. I've checked the scripts that I found in ~mailman/namazu into 'namazu' module in sysadmin CVS, but they only seem to handle generating the initial indexes. As yet there doesn't appear to be a script capable of periodically checking for recent content, and indexing just that. I have re-run the index generation (so all content up to 21-Aug-2005 should get indexed), and I have started a script for this, and will check it into namazu module when I've had a chance to test it. Until then, the indexes will likely remain static."
Thank you.