[ https://issues.apache.org/jira/browse/FOR-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tim Williams updated FOR-677: ----------------------------- Fix Version/s: (was: 0.9-dev) 0.10 Moving to next release. > leading slash in gathered URIs causes double the number of links to be > processed > -------------------------------------------------------------------------------- > > Key: FOR-677 > URL: https://issues.apache.org/jira/browse/FOR-677 > Project: Forrest > Issue Type: Bug > Components: Core operations > Affects Versions: 0.7, 0.8 > Reporter: David Crossley > Fix For: 0.10 > > > Doing 'forrest' starts at the virtual document called linkmap.html where the > Cocoon crawler gathers the initial set of links, then starts crawling and > generating pages. Any new links are pushed onto the linkmap. However, for > some sites, such as our own "seed-sample" and our "site-author", there is a > sudden jump in the number of URIs remaining to be processed. > This is due to a URI with a leading slash (e.g. /samples/faq.html). When that > URI is processed, it gains a whole new set of links all with leading slashes, > and so the list of URIs is potentially doubled. > This issue could be due to a user error, i.e. adding a link that deliberately > begins with a slash. Sometimes, that is unavoidable. > However, we do have a sitemap transformer to "relativize" and "absolutize" > the links. Should it always trim the leading slash? Or are there cases where > that should not happen, so cannot generalise? -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.