[ 
https://issues.apache.org/jira/browse/CONNECTORS-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16766799#comment-16766799
 ] 

Karl Wright commented on CONNECTORS-1580:
-----------------------------------------

You are on your own here.  You are trying to use it as a queuing engine, not an 
incremental indexer.  You have not thought this out properly, clearly, because 
that's not what addSeedDocuments() does.  So you must come up with a version 
string computation that reflects the fact that your documents have changed and 
need to be reconsidered.  It will have to directly reference whatever external 
queue you are using to stuff changed documents in.

You should maybe start by reading the book.  It's free.  Here:  
https://github.com/DaddyWri/manifoldcfinaction/tree/master/pdfs



> Issues in documentum connector
> ------------------------------
>
>                 Key: CONNECTORS-1580
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1580
>             Project: ManifoldCF
>          Issue Type: Bug
>            Reporter: Pavithra Dhakshinamurthy
>            Priority: Blocker
>         Attachments: Job_Scheduling.png
>
>
> Hi Team,
>  We are facing below issues in apache manifold documentum connector version 
> 2.9.1.kindly help us. 
>  1.During the first run of the job,documents are getting indexed to 
> ElasticSearch.If the same job is run after the completion,records are getting 
> seeded,processed but not updated to output connector.Once the document id is 
> indexed,same document id is not able to update it again in the same job. 
>    
>  2.We have scheduled incremental crawling for every 15 mins and document 
> count will vary for every 15 mins. But in seeding it is not resetting the 
> document count,once the job is completed.It's getting added to last scheduled 
> job count.
>    eg.1st schedule-10 documents 
>       2nd schedule-5 documents 
> In the 2nd scheduled of the job,the document count should be 5,but it is 
> having document count as 15. so it is keep on adding the dcouments id for 
> every schedule and it is processing



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to