Re: JIRAClient

2008-02-07 Thread Sebastian Steinmetz

Hi,

For Eclipse users a direct integration into the IDE via Mylyn may be  
interesting too.


Have a look here:
MyLyn homepage: http://www.eclipse.org/mylyn/
JIRA plugin: http://confluence.atlassian.com/display/JIRAEXT/Mylyn

best regards,
Sebastian Steinmetz

Am 06.02.2008 um 13:19 schrieb Andrzej Bialecki:


Hi,

(This info mostly concerns committers and frequent contributors to  
Nutch.)


I've been using JIRAClient (http://almworks.com) to clean up our  
Augias stable of JIRA issues, and I find this tool extremely useful.  
I highly recommend it - the UI is much more user-friendly and  
efficient than the standard web UI. It maintains an internal DB,  
periodically synced with the JIRA, so that all operations like  
filters, searches, bulk edits etc  are many times faster than with  
web UI.


They offer free licenses for managing Open Source projects - IIRC  
it's enough to kindly ask.


(Disclaimer: I'm not in any way related to that company, I'm just a  
happy user).


--
Best regards,
Andrzej Bialecki 
___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com





Re: nutch latest build - inject operation failing

2008-02-07 Thread Andrzej Bialecki

DS jha wrote:

Hi -

Looks like latest trunk version of nutch is failing with the following
exception when trying to perform inject operation:

java.io.IOException: Target
file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
already exists


Is this really the latest trunk? Can you check the version of 
lib/hadoop*.jar? It should be 0.15.3. And make sure you have no other 
older hadoop libraries on the classpath.


--
Best regards,
Andrzej Bialecki 
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Re: JIRAClient

2008-02-07 Thread Sami Siren

Sebastian Steinmetz wrote:

Hi,

For Eclipse users a direct integration into the IDE via Mylyn may be 
interesting too.


IMO that tool has a lot of potential. Last time I tried it it was very 
resource intensive and missed some features in jira integration. I know 
that atleast some missing features have been implemented since then.


--
 Sami Siren


Re: nutch latest build - inject operation failing

2008-02-07 Thread DS jha
Yeah - it is using hadoop v 0.15.3 jar file - strange!


Thanks
Jha


On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:
 DS jha wrote:
  Hi -
 
  Looks like latest trunk version of nutch is failing with the following
  exception when trying to perform inject operation:
 
  java.io.IOException: Target
  file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
  already exists

 Is this really the latest trunk? Can you check the version of
 lib/hadoop*.jar? It should be 0.15.3. And make sure you have no other
 older hadoop libraries on the classpath.

 --
 Best regards,
 Andrzej Bialecki 
   ___. ___ ___ ___ _ _   __
 [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
 ___|||__||  \|  ||  |  Embedded Unix, System Integration
 http://www.sigram.com  Contact: info at sigram dot com




Maybe doing a 0.9.1 release

2008-02-07 Thread Dennis Kubes
What are everyones thoughts about doing an intermediate 0.9.1 release? 
There have been some patches lately which might be better in a release 
state for newcomers.


Dennis


Re: Maybe doing a 0.9.1 release

2008-02-07 Thread Andrzej Bialecki

Dennis Kubes wrote:
What are everyones thoughts about doing an intermediate 0.9.1 release? 
There have been some patches lately which might be better in a release 
state for newcomers.


Which ones did you have in mind?


--
Best regards,
Andrzej Bialecki 
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Re: Maybe doing a 0.9.1 release

2008-02-07 Thread Dennis Kubes
Mainly the ones that were causing problems with searching due to null 
content.  This is kinda low priority and I know we wanted to release a 
1.0 sometime soon with 0.16.  Maybe its too much of a hassle, just 
thought I would bring it up.


Dennis

Andrzej Bialecki wrote:

Dennis Kubes wrote:
What are everyones thoughts about doing an intermediate 0.9.1 release? 
There have been some patches lately which might be better in a release 
state for newcomers.


Which ones did you have in mind?




[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers

2008-02-07 Thread Sami Siren (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566779#action_12566779
 ] 

Sami Siren commented on NUTCH-602:
--

+1

 Allow configurable number of handlers for search servers
 

 Key: NUTCH-602
 URL: https://issues.apache.org/jira/browse/NUTCH-602
 Project: Nutch
  Issue Type: Improvement
  Components: searcher
 Environment: All
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0

 Attachments: NUTCH-602-1-20080205.patch


 This improvement changes the distributed search server to allow a 
 configurable number of RPC handlers.  Before the number was hardcoded at 10 
 handlers.  For high volume environments that limit will be quickly reached 
 and the overall search will slowdown.  The patch changes nutch-default.xml 
 with the configuration parameter searchers.num.handlers and changes 
 DistributedSearch to pull the number of handlers from the configuration.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: nutch latest build - inject operation failing

2008-02-07 Thread Dennis Kubes
Would need more info on your configuration.  Local or DFS, 
hadoop-site.xml changes.


Dennis

DS jha wrote:

I tried setting it to false but it was still throwing the same error.

Looks like when I am using older version of hadoop (0.14.4) it is working fine.

Thanks



On Feb 7, 2008 10:37 AM, Dennis Kubes [EMAIL PROTECTED] wrote:

Do you have speculative execution turned on.  If so turn it off.

Dennis


DS jha wrote:

This is running on Windows/Cygwin, with username 'user' - and it is
using default hadoop-site.xml

Thanks,
Jha

On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:

DS jha wrote:

Yeah - it is using hadoop v 0.15.3 jar file - strange!


Thanks
Jha


On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:

DS jha wrote:

Hi -

Looks like latest trunk version of nutch is failing with the following
exception when trying to perform inject operation:

java.io.IOException: Target
file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
already exists

Hmm, wait - this path is strange in itself, because it starts with
/tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you
change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are
running as a user with username user ?


--

Best regards,
Andrzej Bialecki 
  ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




Re: nutch latest build - inject operation failing

2008-02-07 Thread DS jha
I tried setting it to false but it was still throwing the same error.

Looks like when I am using older version of hadoop (0.14.4) it is working fine.

Thanks



On Feb 7, 2008 10:37 AM, Dennis Kubes [EMAIL PROTECTED] wrote:
 Do you have speculative execution turned on.  If so turn it off.

 Dennis


 DS jha wrote:
  This is running on Windows/Cygwin, with username 'user' - and it is
  using default hadoop-site.xml
 
  Thanks,
  Jha
 
  On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:
  DS jha wrote:
  Yeah - it is using hadoop v 0.15.3 jar file - strange!
 
 
  Thanks
  Jha
 
 
  On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:
  DS jha wrote:
  Hi -
 
  Looks like latest trunk version of nutch is failing with the following
  exception when trying to perform inject operation:
 
  java.io.IOException: Target
  file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
  already exists
  Hmm, wait - this path is strange in itself, because it starts with
  /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you
  change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are
  running as a user with username user ?
 
 
  --
 
  Best regards,
  Andrzej Bialecki 
___. ___ ___ ___ _ _   __
  [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
  ___|||__||  \|  ||  |  Embedded Unix, System Integration
  http://www.sigram.com  Contact: info at sigram dot com
 
 



Re: nutch latest build - inject operation failing

2008-02-07 Thread Andrzej Bialecki

DS jha wrote:

Yeah - it is using hadoop v 0.15.3 jar file - strange!


Thanks
Jha


On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:

DS jha wrote:

Hi -

Looks like latest trunk version of nutch is failing with the following
exception when trying to perform inject operation:

java.io.IOException: Target
file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
already exists


Hmm, wait - this path is strange in itself, because it starts with 
/tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you 
change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are 
running as a user with username user ?



--
Best regards,
Andrzej Bialecki 
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers

2008-02-07 Thread Dennis Kubes (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566772#action_12566772
 ] 

Dennis Kubes commented on NUTCH-602:


If nobody has any objections I will commit this soon.

 Allow configurable number of handlers for search servers
 

 Key: NUTCH-602
 URL: https://issues.apache.org/jira/browse/NUTCH-602
 Project: Nutch
  Issue Type: Improvement
  Components: searcher
 Environment: All
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0

 Attachments: NUTCH-602-1-20080205.patch


 This improvement changes the distributed search server to allow a 
 configurable number of RPC handlers.  Before the number was hardcoded at 10 
 handlers.  For high volume environments that limit will be quickly reached 
 and the overall search will slowdown.  The patch changes nutch-default.xml 
 with the configuration parameter searchers.num.handlers and changes 
 DistributedSearch to pull the number of handlers from the configuration.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (NUTCH-602) Allow configurable number of handlers for search servers

2008-02-07 Thread Dennis Kubes (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dennis Kubes resolved NUTCH-602.


Resolution: Fixed

Committed.

 Allow configurable number of handlers for search servers
 

 Key: NUTCH-602
 URL: https://issues.apache.org/jira/browse/NUTCH-602
 Project: Nutch
  Issue Type: Improvement
  Components: searcher
 Environment: All
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0

 Attachments: NUTCH-602-1-20080205.patch


 This improvement changes the distributed search server to allow a 
 configurable number of RPC handlers.  Before the number was hardcoded at 10 
 handlers.  For high volume environments that limit will be quickly reached 
 and the overall search will slowdown.  The patch changes nutch-default.xml 
 with the configuration parameter searchers.num.handlers and changes 
 DistributedSearch to pull the number of handlers from the configuration.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: nutch latest build - inject operation failing

2008-02-07 Thread DS jha
This is running on Windows/Cygwin, with username 'user' - and it is
using default hadoop-site.xml

Thanks,
Jha

On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:
 DS jha wrote:
  Yeah - it is using hadoop v 0.15.3 jar file - strange!
 
 
  Thanks
  Jha
 
 
  On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:
  DS jha wrote:
  Hi -
 
  Looks like latest trunk version of nutch is failing with the following
  exception when trying to perform inject operation:
 
  java.io.IOException: Target
  file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
  already exists

 Hmm, wait - this path is strange in itself, because it starts with
 /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you
 change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are
 running as a user with username user ?


 --

 Best regards,
 Andrzej Bialecki 
   ___. ___ ___ ___ _ _   __
 [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
 ___|||__||  \|  ||  |  Embedded Unix, System Integration
 http://www.sigram.com  Contact: info at sigram dot com




Re: nutch latest build - inject operation failing

2008-02-07 Thread Dennis Kubes

Do you have speculative execution turned on.  If so turn it off.

Dennis

DS jha wrote:

This is running on Windows/Cygwin, with username 'user' - and it is
using default hadoop-site.xml

Thanks,
Jha

On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:

DS jha wrote:

Yeah - it is using hadoop v 0.15.3 jar file - strange!


Thanks
Jha


On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote:

DS jha wrote:

Hi -

Looks like latest trunk version of nutch is failing with the following
exception when trying to perform inject operation:

java.io.IOException: Target
file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0
already exists

Hmm, wait - this path is strange in itself, because it starts with
/tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you
change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are
running as a user with username user ?


--

Best regards,
Andrzej Bialecki 
  ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop

2008-02-07 Thread Dennis Kubes (JIRA)
Change deprecated configuration methods for Hadoop
--

 Key: NUTCH-605
 URL: https://issues.apache.org/jira/browse/NUTCH-605
 Project: Nutch
  Issue Type: Improvement
 Environment: all
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0


Changes use of the now deprecated addFinalResource and addDefaultResource 
methods to just use addResouce

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (NUTCH-605) Change deprecated configuration methods for Hadoop

2008-02-07 Thread Dennis Kubes (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dennis Kubes updated NUTCH-605:
---

Attachment: NUTCH-605-1-20080207.patch

Changes deprecated resource methods for hadoop configuration.

 Change deprecated configuration methods for Hadoop
 --

 Key: NUTCH-605
 URL: https://issues.apache.org/jira/browse/NUTCH-605
 Project: Nutch
  Issue Type: Improvement
 Environment: all
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0

 Attachments: NUTCH-605-1-20080207.patch


 Changes use of the now deprecated addFinalResource and addDefaultResource 
 methods to just use addResouce

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers

2008-02-07 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566909#action_12566909
 ] 

Hudson commented on NUTCH-602:
--

Integrated in Nutch-trunk #355 (See 
[http://hudson.zones.apache.org/hudson/job/Nutch-trunk/355/])

 Allow configurable number of handlers for search servers
 

 Key: NUTCH-602
 URL: https://issues.apache.org/jira/browse/NUTCH-602
 Project: Nutch
  Issue Type: Improvement
  Components: searcher
 Environment: All
Reporter: Dennis Kubes
Assignee: Dennis Kubes
 Fix For: 1.0.0

 Attachments: NUTCH-602-1-20080205.patch


 This improvement changes the distributed search server to allow a 
 configurable number of RPC handlers.  Before the number was hardcoded at 10 
 handlers.  For high volume environments that limit will be quickly reached 
 and the overall search will slowdown.  The patch changes nutch-default.xml 
 with the configuration parameter searchers.num.handlers and changes 
 DistributedSearch to pull the number of handlers from the configuration.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.