[ https://issues.apache.org/jira/browse/NUTCH-269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julien Nioche resolved NUTCH-269. --------------------------------- Resolution: Fixed Fix Version/s: 1.1 Committed revision 897180 > CrawlDbReducer: OOME because no upper-bound on inlinks count > ------------------------------------------------------------ > > Key: NUTCH-269 > URL: https://issues.apache.org/jira/browse/NUTCH-269 > Project: Nutch > Issue Type: Bug > Reporter: stack > Assignee: Julien Nioche > Priority: Trivial > Fix For: 1.1 > > Attachments: too-many-links.patch, too-many-links2.patch > > > A CrawlDB update repeatedly OOME'd because an URL had hundreds of thousands > of inlinks (The british foriegn office likes putting a clear.gif multiple > times into each page: > http://www.fco.gov.uk/Xcelerate/graphics/images/fcomain/clear.gif). -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.