I've checked in a fix for this for trunk. Looks like there is no sitepath prepended to the answers; they are always relative to the library. Please synch up and try again.
Karl On Thu, Sep 6, 2012 at 2:22 PM, Piergiorgio Lucidi <[email protected]> wrote: > I tried now upgrading the plugin on SharePoint and rebuilding the connector > from the trunk but it returns me this exception: > > ERROR 2012-09-06 20:20:00,993 (Worker thread '41') - Exception tossed: > Internal error: Relative path 'Library Custom/actions-article-v2.pdf' was > expected to start with '/my/personal/administrator/demosite' > org.apache.manifoldcf.core.interfaces.ManifoldCFException: Internal error: > Relative path 'Library Custom/actions-article-v2.pdf' was expected to start > with '/my/personal/administrator/demosite' > at > org.apache.manifoldcf.crawler.connectors.sharepoint.SPSProxyHelper.getChildren(SPSProxyHelper.java:655) > at > org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.processDocuments(SharePointRepository.java:1303) > at > org.apache.manifoldcf.crawler.connectors.BaseRepositoryConnector.processDocuments(BaseRepositoryConnector.java:423) > at > org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:561) > > It seems that there is an issue about the URL again. > > Hope this helps. > Piergiorgio > > 2012/9/6 Karl Wright <[email protected]> > >> I checked in a fix for the pagination issue in >> integration/sharepoint-2010/trunk. Care to build the plugin, deploy >> it, and let me know if it works now? >> >> Karl >> >> >> On Thu, Sep 6, 2012 at 1:26 PM, Karl Wright <[email protected]> wrote: >> > I conclude that the plugin is not handling paging properly - there's >> > no other explanation. So I am canceling the vote and will try to >> > check in a fix. >> > >> > Karl >> > >> > On Thu, Sep 6, 2012 at 1:11 PM, Karl Wright <[email protected]> wrote: >> >> It looks like two problems here. First, it looks like Solr is >> >> throwing a 500 error for at least one of the documents in your set. >> >> >> >> However, the fact that you only get 1000 documents indexed also shows >> >> that the code is still broken in some way. I will check into whether >> >> this looks like a problem in the connector or in the plugin. >> >> >> >> Karl >> >> >> >> On Thu, Sep 6, 2012 at 1:06 PM, Ahmet Arslan <[email protected]> wrote: >> >>> Hi Karl, >> >>> >> >>> With a document library that 7,888 items, I setup a crawl with >> mcf-trunk. Sometimes I get this exception : Error: Repeated service >> interruptions - failure processing document: Ingestion HTTP error code 500 >> >>> >> >>> If i don't get exception only 1000 docs are indexed. >> >>> >> >>> ERROR 2012-09-06 19:55:13,587 (Worker thread '30') - Exception tossed: >> Repeated service interruptions - failure processing document: Ingestion >> HTTP error code 500 >> >>> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Repeated >> service interruptions - failure processing document: Ingestion HTTP error >> code 500 >> >>> at >> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:585) >> >>> Caused by: org.apache.manifoldcf.core.interfaces.ManifoldCFException: >> Ingestion HTTP error code 500 >> >>> at >> org.apache.manifoldcf.agents.output.solr.HttpPoster$IngestThread.run(HttpPoster.java:1386) >> >>> >> >>> Ahmet >> >>> --- On Wed, 9/5/12, Karl Wright <[email protected]> wrote: >> >>> >> >>>> From: Karl Wright <[email protected]> >> >>>> Subject: [VOTE] Release Apache ManifoldCF SharePoint 2010 plugin 0.1 >> RC0 >> >>>> To: "dev" <[email protected]> >> >>>> Date: Wednesday, September 5, 2012, 11:52 PM >> >>>> Vote +1 if you think the Apache >> >>>> ManifoldCF SharePoint 2010 plugin 0.1 >> >>>> RC0 is ready for release. >> >>>> >> >>>> The release artifact can be found at: >> >>>> >> http://people.apache.org/~kwright/apache-manifoldcf-sharepoint-2010-plugin-0.1 >> >>>> . >> >>>> >> >>>> There is also a release tag at: >> >>>> >> https://svn.apache.org/repos/asf/manifoldcf/integration/sharepoint-2010/tags/release-0.1-RC0 >> >>>> >> >>>> Karl >> >>>> >> >> -- >> Piergiorgio Lucidi >> http://www.open4dev.com >> >>
