Hi All,
I am implementing my own java client to crawl file system resources via the ManifoldCF JSON based API. I have been able to define and run a job to crawl a file system repository and output to a file system destination. The trouble I'm having currently is to be able to know which documents have been crawled via the ManifoldCF API. I have looked through the API documentation on the ManifoldCF release pages, but I'm unable to find this information. Could someone point me in the right direction? When I try to use java API for file system monitoring (to check on the contents of the output folder), I'm having issues with the files being locked during the execution of the job. Therefore, I need to query ManifoldCF engine to understand what documents have been changed in the output area so that I can run my file system monitoring code on a different schedule. Please let me know if I didn't explain myself well here. Thanks, Pranesh Pranesh Vadhirajan
