[
https://issues.apache.org/jira/browse/CONNECTORS-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wright updated CONNECTORS-1472:
------------------------------------
Attachment: CONNECTORS-1472.patch
This patch should address the problems seen.
> Confluence connector doesn't call activities.noDocument() properly
> ------------------------------------------------------------------
>
> Key: CONNECTORS-1472
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1472
> Project: ManifoldCF
> Issue Type: Bug
> Components: Confluence connector
> Affects Versions: ManifoldCF 2.8.1
> Reporter: Karl Wright
> Assignee: Karl Wright
> Fix For: ManifoldCF 2.9
>
> Attachments: CONNECTORS-1472.patch
>
>
> During crawling, the Confluence connector in one installation is throwing the
> following exception:
> {code}
> java.lang.IllegalArgumentException: Unrecognized document identifier:
> 'att44634026'
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
> WARN 2017-11-21 10:00:14,373 (Worker thread '111') - Exception: Unrecognized
> document identifier: 'att69240163'
> java.lang.IllegalArgumentException: Unrecognized document identifier:
> 'att69240163'
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
> WARN 2017-11-21 10:00:14,379 (Worker thread '82') - Exception: Unrecognized
> document identifier: 'att56984899'
> java.lang.IllegalArgumentException: Unrecognized document identifier:
> 'att56984899'
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
> WARN 2017-11-21 10:00:14,386 (Worker thread '47') - Exception: Unrecognized
> document identifier: 'att56986313'
> java.lang.IllegalArgumentException: Unrecognized document identifier:
> 'att56986313'
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.computePipelineSpecificationWithVersions(WorkerThread.java:2164)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1627)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread$ProcessActivity.noDocument(WorkerThread.java:1605)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageInternal(ConfluenceRepositoryConnector.java:1078)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processPageAsAttachment(ConfluenceRepositoryConnector.java:1012)
> at
> org.apache.manifoldcf.crawler.connectors.confluence.ConfluenceRepositoryConnector.processDocuments(ConfluenceRepositoryConnector.java:936)
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
> FATAL 2017-11-21 10:00:14,386 (Worker thread '132') - Error tossed: null
> java.lang.NullPointerException
> {code}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)