Hi All,

 

I am implementing my own java client to crawl file system resources via the
ManifoldCF JSON based API.  I have been able to define and run a job to
crawl a file system repository and output to a file system destination.  

 

The trouble I'm having currently is to be able to know which documents have
been crawled via the ManifoldCF API.  I have looked through the API
documentation on the ManifoldCF release pages, but I'm unable to find this
information.  Could someone point me in the right direction?  

 

When I try to use java API for file system monitoring (to check on the
contents of the output folder), I'm having issues with the files being
locked during the execution of the job.  Therefore, I need to query
ManifoldCF engine to understand what documents have been changed in the
output area so that I can run my file system monitoring code on a different
schedule.

 

Please let me know if I didn't explain myself well here. 

 

Thanks,
Pranesh

 

Pranesh Vadhirajan

 

Reply via email to