[
https://issues.apache.org/jira/browse/CONNECTORS-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16709688#comment-16709688
]
Karl Wright edited comment on CONNECTORS-1562 at 12/5/18 7:35 AM:
------------------------------------------------------------------
Hi [~SteenTi], I see this is the web connector. Can you tell me what kind of
crawl you are doing? If this is a continuous crawl, or you kicked it off with
"Start minimal", that's expected.
was (Author: [email protected]):
Hi [~SteenTi], can you tell me what repository connector you are using, and
what kind of crawl you are doing? If this is a continuous crawl, or you kicked
it off with "Start minimal", that's expected with most repository connectors.
But in any case t's the repository connector that determines what happens and
how deletions are found.
> Document removal Elastic
> ------------------------
>
> Key: CONNECTORS-1562
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1562
> Project: ManifoldCF
> Issue Type: Bug
> Components: Elastic Search connector, Web connector
> Affects Versions: ManifoldCF 2.11
> Environment: Manifoldcf 2.11
> Elasticsearch 6.3.2
> Web inputconnector
> elastic outputconnecotr
> Job crawls website input and outputs content to elastic
> Reporter: Tim Steenbeke
> Assignee: Karl Wright
> Priority: Critical
> Labels: starter
> Original Estimate: 4h
> Remaining Estimate: 4h
>
> My documents aren't removed from ElasticSearch index after rerunning the
> changed seeds
> I update my job to change the seedmap and rerun it or use the schedualer to
> keep it runneng even after updating it.
> After the rerun the unreachable documents don't get deleted.
> It only adds doucments when they can be reached.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)