[ https://issues.apache.org/jira/browse/FOR-703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632462#action_12632462 ]
Dmitriy Kargapolov commented on FOR-703: ---------------------------------------- I got similar issue which was fixed by patching book-to-cinclude-lucene.xsl: --- book-to-cinclude-lucene.xsl.orig 2007-04-14 22:42:01.000000000 -0400 +++ book-to-cinclude-lucene.xsl 2008-09-18 19:37:51.000000000 -0400 @@ -59,6 +59,10 @@ <!-- Ignore absolute http urls --> <xsl:template match="menu-item[starts-with(@href, 'https:')]"/> <!-- Ignore absolute https urls --> + <xsl:template match="menu-item[starts-with(@href, 'site:')]"/> +<!-- Ignore internal site: urls --> + <xsl:template match="menu-item[starts-with(@href, 'ext:')]"/> +<!-- Ignore external ext: urls --> <!-- For entries whose @href ends in "/", refer to @href/index.lucene --> <xsl:template match="menu-item[substring(@href, string-length(@href) - string-length('/') + 1) = '/']"> <cinclude:include> There is no much sense to try to index site: references (just duplicating info) or external links... Regards. > Our local search with Lucene gets failures at index time when remote files > are declared in site.xml > --------------------------------------------------------------------------------------------------- > > Key: FOR-703 > URL: https://issues.apache.org/jira/browse/FOR-703 > Project: Forrest > Issue Type: Bug > Components: Core operations, Other > Affects Versions: 0.8 > Reporter: David Crossley > > Our Lucene search, when building its index, follows our site.xml (linkmap) to > gather the files to be indexed. If the entries include a link to a remote > site, then Lucene build a wonky reference. > There are some examples in our site-author/content/site.xml for example ... > ---------- > <SVN label="Browse SVN" > href="http://svn.apache.org/viewcvs.cgi/forrest/trunk/"/> > ... causes error because it tries to index this URI ... > /svn/asf/forrest/site-author/./content/xdocs/http:/svn.apache.org/viewcvs.cgi/forrest/trunk/index.xml > ---------- -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.