Hello,

I’m seeing the following issue when crawling SharePoint 2013.

Manifold job gets terminated with an error when trying to fetch files that
are 'blocked' in SharePoint 2013.

This can happen when files of certain types are uploaded into SP and then
the file type (e.g. exe, dll, sp1) is added into the list of blocked file
types.

We tried excluding the blocked file types in the Paths rules, but we got
the same error.

Would it be possible to get Manifold skipping the files that are blocked by
SP setup and just log warnings/errors rather than completely abort the job?

Thanks,

Radek


ERROR 2014-09-08 11:52:50,005 (Worker thread '5') - Exception tossed: Error
fetching document 'http://sp2013/sites/demo/test/blocked%20files/tmp.ps1':
415
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Error fetching
document ' http://sp2013/sites/demo/test/blocked %20files/tmp.ps1': 415
at
org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.fetchAndIndexFile(SharePointRepository.java:1915)
at
org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.processDocuments(SharePointRepository.java:1774)
at
org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:677)
at
org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:670)
at
org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:649)
at
org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:402)
at
org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:380)

Reply via email to