Thanks Karl, I¹ll add additional details to the JIRA ticket.
Phil From: <user-return-4236-priethmuller=funnelback....@manifoldcf.apache.org> on behalf of Karl Wright <[email protected]> Reply-To: <[email protected]> Date: Wednesday, 4 May 2016 at 5:34 PM To: "[email protected]" <[email protected]> Subject: Re: Error: Bad list view url without site The ticket is CONNECTORS-1309. Karl On Wed, May 4, 2016 at 3:32 AM, Karl Wright <[email protected]> wrote: > Hi Phil, > > The code is trying to extract the name of the list item from the URL here, and > failing to see what it expects. Here's the code: > >>>>>>> >>>>>> > // If it has no view url, we don't have any idea what to do with it > if (urlPath != null && urlPath.length() > 0) > { > // Normalize conditionally > if (!urlPath.startsWith("/")) > urlPath = prefixPath + urlPath; > // Get rid of what we don't want, unconditionally > if (urlPath.startsWith(prefixPath)) > { > urlPath = urlPath.substring(prefixPath.length()); > // We're at the /Lists/listname part of the name. Figure out > where the end of it is. > int index = urlPath.indexOf("/"); > if (index == -1) > throw new ManifoldCFException("Bad list view url without site: > '"+urlPath+"'"); > String pathpart = urlPath.substring(0,index); > > if("Lists".equals(pathpart)) > { > int k = urlPath.indexOf("/",index+1); > if (k == -1) > throw new ManifoldCFException("Bad list view url without > 'Lists': '"+urlPath+"'"); > pathpart = urlPath.substring(index+1,k); > } > > if ( pathpart.length() != 0 && !pathpart.equals("_catalogs")) > { > if (title == null || title.length() == 0) > title = pathpart; > result.add( new NameValue(pathpart, title) ); > } > <<<<<< > > Basically, the URL field is coming back containing just "default.aspx", which > does not have the expected prefix "/Lists/<listname>/..." on it, and that is > confusing the parser. > > What version of SharePoint are you crawling? Also, if you can turn on > connector debugging, I'd love to see the output of this debug statement: > >>>>>>> >>>>>> > // Leave this in for the moment > if (Logging.connectors.isDebugEnabled()) > Logging.connectors.debug("SharePoint: List: '"+urlPath+"', > '"+title+"'"); > <<<<<< > > Thanks, > Karl > > > On Tue, May 3, 2016 at 8:32 PM, Phil Riethmuller <[email protected]> > wrote: >> Hi, >> >> I'm using Manifold 2.3 using the single-process deployable war, and am trying >> to index a Sharepoint 2010 repository. I¹m receiving the following error >> which is causing the crawl to fail: >> >> ERROR 2016-04-29 10:50:25,985 (Worker thread '13') system.WorkerThread - >> Exception tossed: Bad list view url without site: 'default.aspx' >> >> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Bad list view url >> without site: 'default.aspx' >> >> at >> org.apache.manifoldcf.crawler.connectors.sharepoint.SPSProxyHelper.getLists(S >> PSProxyHelper.java:2524) >> >> at >> org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.proc >> essDocuments(SharePointRepository.java:1587) >> >> at >> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399) >> >> >> Are there any suggestions on the best approach to resolve this? >> >> Thanks, >> Phil >
