Edward Ackroyd created NUTCH-1530:
-
Summary: Umlauts (üäö) garbled when fetch and parse in separate
calls (OK when fetcher.parse is true)
Key: NUTCH-1530
URL: https://issues.apache.org/jira/browse/NUTCH-1530
[
https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13576898#comment-13576898
]
Roland commented on NUTCH-1530:
---
Hi Edward,
there must be another factor causing this,
[
https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13576929#comment-13576929
]
Edward Ackroyd commented on NUTCH-1530:
---
Roland,
Hi,
Part of my internship, we must develop a specialized search engine using
Nutch, Solr, HBase, Tika.
I began to develop a Java application for crawler with Nuth branch 2.x.
Functions inject, generate, fetch, parse, updatedb, solrindex based on the
actual execution of nutch via a shell
[
https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577094#comment-13577094
]
Roland edited comment on NUTCH-1530 at 2/12/13 10:13 PM:
-
Ok, here
Hi Shann,
Thank you for reaching out! If your goal is to get your project integrated
into Apache Nutch,
proper, then I would recommend simply:
0. File some JIRA issues in Apache Nutch
http://issues.apache.org/jira/browse/NUTCH Small incremental patches and
issues are preferred and this will let
[
https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-1530:
Fix Version/s: 2.2
Umlauts (üäö) garbled when fetch and parse in separate
[
https://issues.apache.org/jira/browse/NUTCH-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13577299#comment-13577299
]
Lewis John McGibbney commented on NUTCH-1530:
-
Do you know which parser is
8 matches
Mail list logo