Re: planning for nutch-1.0-rc1

2009-03-12 Thread Bartosz Gadzimski
Hello Dennis, We'v been trying your new framework and indexer and everything looks better now. But we can't understand what should be output of last command (FieldIndexer). We have: u...@kubuntu:~/nutch-1.0$ ls crawl/indexes/part-0/ index.done segments_1 segments.gen

Re: planning for nutch-1.0-rc1

2009-03-09 Thread Bartosz Gadzimski
Hello, It's on 2 linux boxes one with centos and one with ubuntu. Both properly running old bin/nutch crawl. Problem is that it doesn't give exception on command line or in eclipse just writes to logs so it's hard to debug. One is running nutch trunk from 07 march, and one from todays rc1

Re: planning for nutch-1.0-rc1

2009-03-08 Thread Bartosz Gadzimski
Hello, Thanks Dennis for updateing wiki it helped a lot. You gave example with indexing but you didn't said a bit about it. Can you write some more? :) Anyways I have problems at the last step (nutch from 07 march): bin/nutch org.apache.nutch.indexer.field.FieldIndexer It simply stops

Re: planning for nutch-1.0-rc1

2009-03-08 Thread Dennis Kubes
Sorry about the docs being sparse on this. I will write more about the process as time permits. Don't know about the problem below. What platform are you running on, windows, linux? Dennis Bartosz Gadzimski wrote: Hello, Thanks Dennis for updateing wiki it helped a lot. You gave example

Re: planning for nutch-1.0-rc1

2009-03-06 Thread Dennis Kubes
NUTCH-578 was a while back but as I remember it worked fine. No objections to either including or pushing it. Dennis Sami Siren wrote: I am planning to build the first rc for nutch 1.0 at Tue 3.3.2009 morning (EET). There are still some issues marked as fix for 1.0 in Jira. Neither of the

Re: planning for nutch-1.0-rc1

2009-03-06 Thread Dennis Kubes
I don't know if I would make this primary yet. I need to check what is causing this as it worked fine for me, in fact we currently have it in production. Also we would need to update the shell scripts to integrate this more tightly. Dennis Bartosz Gadzimski wrote: Sami Siren pisze:

Re: planning for nutch-1.0-rc1

2009-03-06 Thread Andrzej Bialecki
Dennis Kubes wrote: I don't know if I would make this primary yet. Not before 1.0 ... ;) After that, we need to discuss what to do with the new and the current framework. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/|

Re: planning for nutch-1.0-rc1

2009-03-05 Thread Sami Siren
I am sure all of you noticed that the release planned to be cut during this week was delayed because of a new discovery right before the deadline (NUTCH-711). That has now been fixed so it's time to move on. I am now going to build the first RC during the weekend. -- Sami Siren Sami Siren

Re: planning for nutch-1.0-rc1

2009-03-02 Thread Sami Siren
Andrzej Bialecki wrote: Sami Siren wrote: I am planning to build the first rc for nutch 1.0 at Tue 3.3.2009 morning (EET). There are still some issues marked as fix for 1.0 in Jira. Neither of the two remaining _bugs_ seems too important to me, actually I only count the issues assigned to

Re: planning for nutch-1.0-rc1

2009-03-02 Thread Bartosz Gadzimski
Sami Siren pisze: Andrzej Bialecki wrote: Sami Siren wrote: I am planning to build the first rc for nutch 1.0 at Tue 3.3.2009 morning (EET). There are still some issues marked as fix for 1.0 in Jira. Neither of the two remaining _bugs_ seems too important to me, actually I only count the

planning for nutch-1.0-rc1

2009-02-28 Thread Sami Siren
I am planning to build the first rc for nutch 1.0 at Tue 3.3.2009 morning (EET). There are still some issues marked as fix for 1.0 in Jira. Neither of the two remaining _bugs_ seems too important to me, actually I only count the issues assigned to developers as real candidates to be included

Re: planning for nutch-1.0-rc1

2009-02-28 Thread Andrzej Bialecki
Sami Siren wrote: NUTCH-477 (ab) I decided to postpone this - the patch brings a lot of complexity, and it seems that it would be useful to few users. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information

Re: planning for nutch-1.0-rc1

2009-02-28 Thread Andrzej Bialecki
Sami Siren wrote: I am planning to build the first rc for nutch 1.0 at Tue 3.3.2009 morning (EET). There are still some issues marked as fix for 1.0 in Jira. Neither of the two remaining _bugs_ seems too important to me, actually I only count the issues assigned to developers as real