[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-715: -- Attachment: NUTCH-715-testcase.patch Subcollection plugin doesn't work with default

[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-715: -- Attachment: NUTCH-715-fix.patch Subcollection plugin doesn't work with default

[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-715: -- Attachment: (was: NUTCH-715-fix.patch) Subcollection plugin doesn't work with default

[jira] Updated: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-715: -- Attachment: NUTCH-715_subcollections_fix.patch Subcollection plugin doesn't work with default

Re: [VOTE] Release Apache Nutch 1.0

2009-03-10 Thread Sami Siren
This vote has been cancelled due to some last minute additions. I will post another RC soon. Sami Siren wrote: -- Sami Siren Hello, I have packaged the first release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc0/ See the included CHANGES.txt file

Re: Nutch ML cleanup

2009-03-10 Thread Sami Siren
Like I suspected: I have no power to do or view any admin stuff there. Btw. I am not seeing any span, perhaps google takes care of that for me? -- Sami Siren Sami Siren wrote: I'll take a look at this, I am pretty sure we have to ask Doug at the end :) -- Sami Siren Otis Gospodnetic wrote:

[jira] Resolved: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren resolved NUTCH-715. -- Resolution: Fixed committed, thanks Dmitry! Subcollection plugin doesn't work with default

[VOTE] Release Apache Nutch 1.0

2009-03-10 Thread Sami Siren
Hello, I have packaged the second release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc1/ See the CHANGES.txt[1] file for details on release contents and latest changes. The release was made from tag:

Re: [VOTE] Release Apache Nutch 1.0

2009-03-10 Thread Doğacan Güney
Again, my non-binding +1 :) On 10.Mar.2009, at 09:34, Sami Siren ssi...@gmail.com wrote: Hello, I have packaged the second release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc1/ See the CHANGES.txt[1] file for details on release contents and

Re: [VOTE] Release Apache Nutch 1.0

2009-03-10 Thread Sami Siren
!!!NOTE!!! There was faulty link in the message I sent earlier, hopefully I get it right this time: Hello, I have packaged the second release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc1/ See the CHANGES.txt[1] file for details on release contents

[jira] Created: (NUTCH-716) Make subcollection index filed multivalued

2009-03-10 Thread Dmitry Lihachev (JIRA)
Make subcollection index filed multivalued -- Key: NUTCH-716 URL: https://issues.apache.org/jira/browse/NUTCH-716 Project: Nutch Issue Type: Improvement Components: indexer Affects

[jira] Updated: (NUTCH-716) Make subcollection index filed multivalued

2009-03-10 Thread Dmitry Lihachev (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Lihachev updated NUTCH-716: -- Attachment: NUTCH-716_multivalued_subcollection.patch Make subcollection index filed

[jira] Commented: (NUTCH-705) parse-rtf plugin

2009-03-10 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680411#action_12680411 ] Sami Siren commented on NUTCH-705: -- I think we should start looking at Apache Tika for most

Re: Nutch ML cleanup

2009-03-10 Thread Andrzej Bialecki
Otis Gospodnetic wrote: Hi, This has been bugging me for a while now. For some reason Nutch MLs get the most junk emails - both rude/rudeish emails, as well as clear spam (with SPAM in the subject - something must be detecting it). I just looked at the headers of the clearly labeled spam

Moving Nutch parsers to Tika

2009-03-10 Thread Andrzej Bialecki
Hi all, I've been debating this for a while, too, what Sami suggested in another thread: I think we should start looking at Apache Tika for most (or all) of our parsers. This is actually a part of my broader vision for Nutch, that this project should not duplicate functionality of other

[jira] Created: (NUTCH-717) Make Nutch Solr integration easier

2009-03-10 Thread Sami Siren (JIRA)
Make Nutch Solr integration easier -- Key: NUTCH-717 URL: https://issues.apache.org/jira/browse/NUTCH-717 Project: Nutch Issue Type: New Feature Reporter: Sami Siren Fix For: 1.1

Re: Moving Nutch parsers to Tika

2009-03-10 Thread Sami Siren
Andrzej Bialecki wrote: Hi all, I've been debating this for a while, too, what Sami suggested in another thread: I think we should start looking at Apache Tika for most (or all) of our parsers. This is actually a part of my broader vision for Nutch, that this project should not duplicate

Use of gene...@l.a.o for...

2009-03-10 Thread Grant Ingersoll
Apologies for cross posting, but I wanted to make sure committers for the various subs all saw it (if I missed one, my apologies up front). Please, if you are going to reply, reply to general@ and not to all the CC's Just wanted to make a couple of notes about the use of gene...@l.a.o:

[no subject]

2009-03-10 Thread Agnieszka Zbrzezny
Hello, I'm new in Nutch programming and also on this mailing list. I'd like to change search option. Now it uses BooleanQuery, I need to use WildcardQuery. Is anyone doing something like that? Thanks for help Agnieszka

Re: [VOTE] Release Apache Nutch 1.0

2009-03-10 Thread Mattmann, Chris A
My non-binding +1. Thanks, Sami! Cheers, Chris On 3/9/09 11:34 PM, Sami Siren ssi...@gmail.com wrote: Hello, I have packaged the second release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc1/ http://people.apache.org/%7Esiren/nutch-1.0/rc0/

Re: Moving Nutch parsers to Tika

2009-03-10 Thread Otis Gospodnetic
I absolutely agree. Duplicating the work and focusing on non-core when the same functionality can be gotten by using Tika is not wise for Nutch. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Andrzej Bialecki a...@getopt.org To:

Re: Nutch ML cleanup

2009-03-10 Thread Doug Cutting
ogjunk-nu...@yahoo.com is a member of nutch-...@lists.sourceforge.net and nutch-gene...@lists.sourceforge.net. These lists do not otherwise appear to forward to Apache lists. They used to perhaps forward through nutch.org lists, but that domain no longer forwards any email. Please check the

[jira] Commented: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-10 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680749#action_12680749 ] Hudson commented on NUTCH-715: -- Integrated in Nutch-trunk #749 (See