[jira] [Created] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Edward Ackroyd (JIRA)
Edward Ackroyd created NUTCH-1530: - Summary: Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true) Key: NUTCH-1530 URL: https://issues.apache.org/jira/browse/NUTCH-1530

[jira] [Commented] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Roland (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13576898#comment-13576898 ] Roland commented on NUTCH-1530: --- Hi Edward, there must be another factor causing this,

[jira] [Commented] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Edward Ackroyd (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13576929#comment-13576929 ] Edward Ackroyd commented on NUTCH-1530: --- Roland,

Nutch JAVA Application

2013-02-12 Thread Shann
Hi, Part of my internship, we must develop a specialized search engine using Nutch, Solr, HBase, Tika. I began to develop a Java application for crawler with Nuth branch 2.x. Functions inject, generate, fetch, parse, updatedb, solrindex based on the actual execution of nutch via a shell

[jira] [Comment Edited] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Roland (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577094#comment-13577094 ] Roland edited comment on NUTCH-1530 at 2/12/13 10:13 PM: - Ok, here

Re: Nutch JAVA Application

2013-02-12 Thread Mattmann, Chris A (388J)
Hi Shann, Thank you for reaching out! If your goal is to get your project integrated into Apache Nutch, proper, then I would recommend simply: 0. File some JIRA issues in Apache Nutch http://issues.apache.org/jira/browse/NUTCH Small incremental patches and issues are preferred and this will let

[jira] [Updated] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1530: Fix Version/s: 2.2 Umlauts (üäö) garbled when fetch and parse in separate

[jira] [Commented] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true)

2013-02-12 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577299#comment-13577299 ] Lewis John McGibbney commented on NUTCH-1530: - Do you know which parser is