[jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch

2013-12-16 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13848998#comment-13848998 ] Sebastian Nagel commented on NUTCH-1465: Let's add use case C: *(C) inject URLs

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1681: - Attachment: (was: toUnicode.patch) In URLUtil.java, toUNICODE method does not work

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1681: - Attachment: toUnicode.patch Hi Lewis, I dont understand what do you mean but I explain why I

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1681: - Fix Version/s: 1.9 In URLUtil.java, toUNICODE method does not work correctly

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1681: - Attachment: NUTCH-1681-1.8.patch Patch for trunk with simple unit test. This seems fine to me.

[jira] [Commented] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13849135#comment-13849135 ] Sebastian Nagel commented on NUTCH-1681: Hi [~markus17], +1 but shouldn't we spend

[jira] [Commented] (NUTCH-1325) HostDB for Nutch

2013-12-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13849143#comment-13849143 ] Markus Jelsma commented on NUTCH-1325: -- Hi Tejas, (1): Current mapper is: {code}

[jira] [Updated] (NUTCH-1681) In URLUtil.java, toUNICODE method does not work correctly

2013-12-16 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1681: - Attachment: NUTCH-1681-1.8.patch Ah, i forgot to remove the system.out, i put it there

[jira] [Updated] (NUTCH-1321) IDNNormalizer

2013-12-16 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] İlhami KALKAN updated NUTCH-1321: - Attachment: idnNormalizer.patch I added patch file. Non-ascii urls are converted punycode by