Re: [VOTE] Release Apache Nutch 1.0

2009-03-09 Thread Marko Bauhardt
my non-binding +1 marko On Mar 8, 2009, at 10:07 PM, Dennis Kubes wrote: Non-binding +1 too :) Sami Siren wrote: Hello, I have packaged the first release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc0/ See the included CHANGES.txt file for

NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]

2009-03-09 Thread Sami Siren
Dog(acan Güney wrote: On Sun, Mar 8, 2009 at 20:25, Sami Siren ssi...@gmail.com wrote: Hello, I have packaged the first release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc0/ See the included CHANGES.txt file for details on release contents and

Re: planning for nutch-1.0-rc1

2009-03-09 Thread Bartosz Gadzimski
Hello, It's on 2 linux boxes one with centos and one with ubuntu. Both properly running old bin/nutch crawl. Problem is that it doesn't give exception on command line or in eclipse just writes to logs so it's hard to debug. One is running nutch trunk from 07 march, and one from todays rc1

Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]

2009-03-09 Thread Doğacan Güney
On 09.Mar.2009, at 11:05, Sami Siren ssi...@gmail.com wrote: Doğacan Güney wrote: On Sun, Mar 8, 2009 at 20:25, Sami Siren ssi...@gmail.com wrote: Hello, I have packaged the first release candidate for Apache Nutch 1.0 release at http://people.apache.org/~siren/nutch-1.0/rc0/ See

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-03-09 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680173#action_12680173 ] Shalin Shekhar Mangar commented on NUTCH-684: - Just found this issue from Sami's

[jira] Created: (NUTCH-713) Config options for webgraph Scoring not documented

2009-03-09 Thread Eric J. Christeson (JIRA)
Config options for webgraph Scoring not documented -- Key: NUTCH-713 URL: https://issues.apache.org/jira/browse/NUTCH-713 Project: Nutch Issue Type: Improvement Components: indexer

Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]

2009-03-09 Thread Sami Siren
Doğacan Güney wrote: On 09.Mar.2009, at 11:05, Sami Siren ssi...@gmail.com mailto:ssi...@gmail.com wrote: Doğacan Güney wrote: On Sun, Mar 8, 2009 at 20:25, Sami Siren ssi...@gmail.com mailto:ssi...@gmail.com wrote: Hello, I have packaged the first release candidate for Apache Nutch

[jira] Updated: (NUTCH-713) Config options for webgraph Scoring not documented

2009-03-09 Thread Eric J. Christeson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric J. Christeson updated NUTCH-713: - Attachment: webgraph-scoring.diff Patch to add config options to conf/nutch-default.xml

Re: [VOTE] Release Apache Nutch 1.0

2009-03-09 Thread Eric J. Christeson
non-binding +1 -- Eric J. Christeson eric.christe...@ndsu.edu Enterprise Computing and Infrastructure(701) 231-8693 (Voice) North Dakota State University PGP.sig Description: This is a digitally signed message part

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-03-09 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680194#action_12680194 ] Andrzej Bialecki commented on NUTCH-684: - Yes, I'm aware of this functionality. At

Re: NUTCH-684 [was: Re: [VOTE] Release Apache Nutch 1.0]

2009-03-09 Thread Doğacan Güney
On Mon, Mar 9, 2009 at 17:46, Sami Siren ssi...@gmail.com wrote: Doğacan Güney wrote: On 09.Mar.2009, at 11:05, Sami Siren ssi...@gmail.com mailto:ssi...@gmail.com wrote: Doğacan Güney wrote: On Sun, Mar 8, 2009 at 20:25, Sami Siren ssi...@gmail.com mailto:ssi...@gmail.com wrote:

[jira] Closed: (NUTCH-684) Dedup support for Solr

2009-03-09 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doğacan Güney closed NUTCH-684. --- Resolution: Fixed Fix Version/s: 1.0.0 Fixed as of rev. 751774. Dedup support for Solr

Nutch ML cleanup

2009-03-09 Thread Otis Gospodnetic
Hi, This has been bugging me for a while now. For some reason Nutch MLs get the most junk emails - both rude/rudeish emails, as well as clear spam (with SPAM in the subject - something must be detecting it). I just looked at the headers of the clearly labeled spam messages and found that

[jira] Created: (NUTCH-714) Need a SFTP and SCP Protocol Handler

2009-03-09 Thread Sanjoy Ghosh (JIRA)
Need a SFTP and SCP Protocol Handler Key: NUTCH-714 URL: https://issues.apache.org/jira/browse/NUTCH-714 Project: Nutch Issue Type: New Feature Components: fetcher Affects Versions: 0.9.0

[jira] Commented: (NUTCH-714) Need a SFTP and SCP Protocol Handler

2009-03-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680348#action_12680348 ] Chris A. Mattmann commented on NUTCH-714: - Hi Sanjoy, When you get a patch, let me

[jira] Assigned: (NUTCH-714) Need a SFTP and SCP Protocol Handler

2009-03-09 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann reassigned NUTCH-714: --- Assignee: Chris A. Mattmann Need a SFTP and SCP Protocol Handler

[jira] Commented: (NUTCH-684) Dedup support for Solr

2009-03-09 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12680374#action_12680374 ] Hudson commented on NUTCH-684: -- Integrated in Nutch-trunk #748 (See

[jira] Created: (NUTCH-715) Subcollection plugin doesn't work with default subcollections.xml file

2009-03-09 Thread Dmitry Lihachev (JIRA)
Subcollection plugin doesn't work with default subcollections.xml file -- Key: NUTCH-715 URL: https://issues.apache.org/jira/browse/NUTCH-715 Project: Nutch Issue Type: Bug