Hi Phil,
The code is trying to extract the name of the list item from the URL here,
and failing to see what it expects. Here's the code:
>>>>>>
// If it has no view url, we don't have any idea what to do with
it
if (urlPath != null && urlPath.length() > 0)
{
// Normalize conditionally
if (!urlPath.startsWith("/"))
urlPath = prefixPath + urlPath;
// Get rid of what we don't want, unconditionally
if (urlPath.startsWith(prefixPath))
{
urlPath = urlPath.substring(prefixPath.length());
// We're at the /Lists/listname part of the name. Figure out
where the end of it is.
int index = urlPath.indexOf("/");
if (index == -1)
throw new ManifoldCFException("Bad list view url without
site: '"+urlPath+"'");
String pathpart = urlPath.substring(0,index);
if("Lists".equals(pathpart))
{
int k = urlPath.indexOf("/",index+1);
if (k == -1)
throw new ManifoldCFException("Bad list view url without
'Lists': '"+urlPath+"'");
pathpart = urlPath.substring(index+1,k);
}
if ( pathpart.length() != 0 && !pathpart.equals("_catalogs"))
{
if (title == null || title.length() == 0)
title = pathpart;
result.add( new NameValue(pathpart, title) );
}
<<<<<<
Basically, the URL field is coming back containing just "default.aspx",
which does not have the expected prefix "/Lists/<listname>/..." on it, and
that is confusing the parser.
What version of SharePoint are you crawling? Also, if you can turn on
connector debugging, I'd love to see the output of this debug statement:
>>>>>>
// Leave this in for the moment
if (Logging.connectors.isDebugEnabled())
Logging.connectors.debug("SharePoint: List: '"+urlPath+"',
'"+title+"'");
<<<<<<
Thanks,
Karl
On Tue, May 3, 2016 at 8:32 PM, Phil Riethmuller <
[email protected]> wrote:
> Hi,
>
> I'm using Manifold 2.3 using the single-process deployable war, and am
> trying to index a Sharepoint 2010 repository. I’m receiving the following
> error which is causing the crawl to fail:
>
> ERROR 2016-04-29 10:50:25,985 (Worker thread '13') system.WorkerThread -
> Exception tossed: Bad list view url without site: 'default.aspx'
>
> org.apache.manifoldcf.core.interfaces.ManifoldCFException: Bad list view
> url without site: 'default.aspx'
>
> at
> org.apache.manifoldcf.crawler.connectors.sharepoint.SPSProxyHelper.getLists(SPSProxyHelper.java:2524)
>
> at
> org.apache.manifoldcf.crawler.connectors.sharepoint.SharePointRepository.processDocuments(SharePointRepository.java:1587)
>
> at
> org.apache.manifoldcf.crawler.system.WorkerThread.run(WorkerThread.java:399)
>
> Are there any suggestions on the best approach to resolve this?
>
> Thanks,
> Phil
>