[jira] [Created] (NUTCH-1700) Remove deprecated code in src/plugin/creativecommons/build.xml

2014-01-14 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created NUTCH-1700: --- Summary: Remove deprecated code in src/plugin/creativecommons/build.xml Key: NUTCH-1700 URL: https://issues.apache.org/jira/browse/NUTCH-1700 Project:

[jira] [Comment Edited] (NUTCH-1699) Tika Parser - Image Parse Bug

2014-01-14 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870543#comment-13870543 ] Lewis John McGibbney edited comment on NUTCH-1699 at 1/14/14 9:00 AM:

[jira] [Updated] (NUTCH-1699) Tika Parser - Image Parse Bug

2014-01-14 Thread Sebastian Nagel (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-1699: --- Attachment: NUTCH-1699-trunk.patch +1 problem confirmed, patch tested with trunk. Thanks,

[jira] [Updated] (NUTCH-1699) Tika Parser - Image Parse Bug

2014-01-14 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1699: - Affects Version/s: 1.7 Fix Version/s: 1.8 Tika Parser - Image Parse Bug

[jira] [Commented] (NUTCH-1699) Tika Parser - Image Parse Bug

2014-01-14 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870621#comment-13870621 ] Mehmet Zahid Yüzügüldü commented on NUTCH-1699: --- Thank you all Tika Parser

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870662#comment-13870662 ] Talat UYARER commented on NUTCH-1568: - Hi [~lewismc], Thanks for update. In your

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870668#comment-13870668 ] Lewis John McGibbney commented on NUTCH-1568: - I am +1 for this patch and the

[jira] [Comment Edited] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870668#comment-13870668 ] Lewis John McGibbney edited comment on NUTCH-1568 at 1/14/14 12:16 PM:

[jira] [Commented] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870671#comment-13870671 ] Talat UYARER commented on NUTCH-1568: - Thanks [~lewis], I have an objection :) can you

[jira] [Comment Edited] (NUTCH-1568) port pluggable indexing architecture to 2.x

2014-01-14 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870671#comment-13870671 ] Talat UYARER edited comment on NUTCH-1568 at 1/14/14 12:33 PM:

[jira] [Commented] (NUTCH-1113) Merging segments causes URLs to vanish from crawldb/index?

2014-01-14 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13870690#comment-13870690 ] Markus Jelsma commented on NUTCH-1113: -- This works too, but if we ditch most LINKED

Re: Proposal for SolrIndexWriter

2014-01-14 Thread Lajos
I realise I should have made myself clearer on one point. I understand that the current design comes from a Nutch-centric paradigm, in which Solr is used to hold the indexing data from Nutch. In this paradigm, I suppose the Nutch data needs to be fully mapped to Solr. But I'm interested in a

[jira] [Created] (NUTCH-1701) Make Solr Document Boost as an option

2014-01-14 Thread Tien Nguyen Manh (JIRA)
Tien Nguyen Manh created NUTCH-1701: --- Summary: Make Solr Document Boost as an option Key: NUTCH-1701 URL: https://issues.apache.org/jira/browse/NUTCH-1701 Project: Nutch Issue Type:

[jira] [Updated] (NUTCH-1701) Make Solr Document Boost as an option

2014-01-14 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated NUTCH-1701: Fix Version/s: 1.8 2.3 Make Solr Document Boost as an option

[jira] [Updated] (NUTCH-1701) Make Solr Document Boost as an option

2014-01-14 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated NUTCH-1701: Attachment: NUTCH-1701-2x.patch Make Solr Document Boost as an option