[ 
https://issues.apache.org/jira/browse/CONNECTORS-501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13430557#comment-13430557
 ] 

Karl Wright commented on CONNECTORS-501:
----------------------------------------

I created a CONNECTORS-501 branch to work on this ticket.  I've checked in code 
which should put documents that are in "active" into "activerescanneeded" if 
their hopcount situation changes during processing.  I still need to pick up on 
this change for deletion, however - the logic there now requires documents that 
are in "activerescanneeded" to be put back into "active" and not actually 
deleted.  Because the same jobManager deletion method is used in many places, I 
may wind up creating a new jobManager method meant to work only in the context 
of an active document.


                
> Medium-scale web crawl with hopcount-based filtering fails to find correct 
> number of documents
> ----------------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-501
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-501
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Framework agents process, Web connector
>    Affects Versions: ManifoldCF 0.6
>            Reporter: Karl Wright
>            Assignee: Karl Wright
>             Fix For: ManifoldCF 0.7
>
>
> The new web crawler Postgresql load test, which uses hopcount-based 
> filtering, does not discover all 11110 documents it is supposed to.  It only 
> discovered 10603 when I ran it just now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to