[jira] [Commented] (CONNECTORS-1612) Postpone files in SMBException

Julien Massiera (JIRA) Wed, 19 Jun 2019 02:23:53 -0700


    [ 
https://issues.apache.org/jira/browse/CONNECTORS-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16867429#comment-16867429
 ]


Julien Massiera commented on CONNECTORS-1612:
---------------------------------------------

Thanks [[email protected]] for the fix. Can't we make this retry time a 
configurable parameter ? Would be valuable since some jobs may last longer than 
others. And concerning my suggestion to increase the time at each retry, can't 
we apply a formula at each retry ? like next retry = retry time * retry number 
? Would give more flexibility than just a fix value.

> Postpone files in SMBException
> ------------------------------
>
>                 Key: CONNECTORS-1612
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1612
>             Project: ManifoldCF
>          Issue Type: Improvement
>          Components: JCIFS connector
>    Affects Versions: ManifoldCF 2.12
>            Reporter: Julien Massiera
>            Assignee: Karl Wright
>            Priority: Critical
>             Fix For: ManifoldCF 2.14
>
>
> When crawling using the jcifs connector, some unexpected errors may trigger a 
> class "SMBException" which is caught by MCF.
> The current behavior for the job is to abort after a few retry.
> Although it is a generic class of SMBException, we consider that it is worth 
> before aborting the job, to postpone the concerned problematic files and try 
> the ones already in the pipe before aborting. This way, the job can move on 
> before developers have to study the particular problems. More precisely, the 
> algorithm could look like the following:
> Whenever a job encounters an error that is not clearly identified :
> 1. It immediately retries one time; 
> 2. If it succeeds, the crawl moves on as usual; 
> 3. If it fails, the job moves this document to the current end of the 
> processing pipeline, and crawls the remaining documents. It increments the 
> counter of tentative for this document to 2.
> 4. When encountering this document again, the job tries again. If it 
> succeeds, the crawl moves on as usual. If it fails, it moves this document to 
> the current end of the processing pipeline, increment the counter of 1, and 
> doubles the delay between two tentatives.
> 5. We iterate until the maximum number of tentatives of the crawl for the 
> problematic document has been reached. If it fails, abort the crawl. With 
> this behavior, a job is finally aborted on critical errors but at least we 
> will be able to crawl a maximum number of non problematic documents till the 
> failure.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (CONNECTORS-1612) Postpone files in SMBException

Reply via email to