Hi Karl, thanks for your clarification.
I’m not changing any document specification information. I just set “Scheduled time” and “Job invocation” on “Scheduling” tab, “Start method” on “Connection” tab and click “Save” button. That’s all. I tried to set all the scheduling information directly in Postres database to be sure I didn’t change any document specification information and the result was the same, all documents were recrawled. One more thing I tried was to update “seedingversion” in “jobs” table but again all documents were recrawled. Thanks, Radko From: Karl Wright <[email protected]<mailto:[email protected]>> Reply-To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Date: Friday 1 April 2016 at 14:30 To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Subject: Re: Scheduled ManifoldCF jobs Sorry, that response was *almost* incoherent. :-) Trying again: As far as how MCF computes incremental changes, it does not matter whether a job is run on schedule, or manually. But if you change certain aspects of the job, namely the document specification information, MCF "starts over" at the beginning of time. It needs to do that because you might well have made changes to the document specification that could change the way documents are indexed. Thanks, Karl On Fri, Apr 1, 2016 at 6:36 AM, Karl Wright <[email protected]<mailto:[email protected]>> wrote: Hi Radko, For computing how MCF does job crawling, it does not care whether the job is run manually or by schedule. The issue is likely to be that you changed some other detail about the job definition that might have affected how documents are indexed. In that case, MCF would cause all documents to be recrawled because of that. Changes to a job's document specification information will cause that to be the case. Thanks, Karl On Fri, Apr 1, 2016 at 3:40 AM, Najman, Radko wrote: Hello, I have a few jobs crawling documents from Documentum. Some of these jobs are quite big and the first run of the job takes a few hours or a day to finish. Then, when I do a “minimal run” for updates, the job is usually done in a few minutes. I want to schedule these jobs for daily runs. I’m experiencing that the first scheduled run takes the same time as I ran the job for the first time manually. It seems it is recrawling all documents. Next scheduled runs are fast, a few minutes. Is it expected behaviour? I would expect the first scheduled run to be fast too because the job was already finished before by manual start. Is there a way how to don’t recrawl all documents in this case, it’s really time consuming operation. My settings: Schedule type: Scan every document once Job invocation: Minimal Scheduled time: once a day Start method: Start when schedule window starts Thank you, Radko Notice: This e-mail message, together with any attachments, contains information of Merck & Co., Inc. (2000 Galloping Hill Road, Kenilworth, New Jersey, USA 07033), and/or its affiliates Direct contact information for affiliates is available at http://www.merck.com/contact/contacts.html) that may be confidential, proprietary copyrighted and/or legally privileged. It is intended solely for the use of the individual or entity named on this message. If you are not the intended recipient, and have received this message in error, please notify us immediately by reply e-mail and then delete it from your system.
