Re: JIRAClient
Hi, For Eclipse users a direct integration into the IDE via Mylyn may be interesting too. Have a look here: MyLyn homepage: http://www.eclipse.org/mylyn/ JIRA plugin: http://confluence.atlassian.com/display/JIRAEXT/Mylyn best regards, Sebastian Steinmetz Am 06.02.2008 um 13:19 schrieb Andrzej Bialecki: Hi, (This info mostly concerns committers and frequent contributors to Nutch.) I've been using JIRAClient (http://almworks.com) to clean up our Augias stable of JIRA issues, and I find this tool extremely useful. I highly recommend it - the UI is much more user-friendly and efficient than the standard web UI. It maintains an internal DB, periodically synced with the JIRA, so that all operations like filters, searches, bulk edits etc are many times faster than with web UI. They offer free licenses for managing Open Source projects - IIRC it's enough to kindly ask. (Disclaimer: I'm not in any way related to that company, I'm just a happy user). -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: nutch latest build - inject operation failing
DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Is this really the latest trunk? Can you check the version of lib/hadoop*.jar? It should be 0.15.3. And make sure you have no other older hadoop libraries on the classpath. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: JIRAClient
Sebastian Steinmetz wrote: Hi, For Eclipse users a direct integration into the IDE via Mylyn may be interesting too. IMO that tool has a lot of potential. Last time I tried it it was very resource intensive and missed some features in jira integration. I know that atleast some missing features have been implemented since then. -- Sami Siren
Re: nutch latest build - inject operation failing
Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Is this really the latest trunk? Can you check the version of lib/hadoop*.jar? It should be 0.15.3. And make sure you have no other older hadoop libraries on the classpath. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Maybe doing a 0.9.1 release
What are everyones thoughts about doing an intermediate 0.9.1 release? There have been some patches lately which might be better in a release state for newcomers. Dennis
Re: Maybe doing a 0.9.1 release
Dennis Kubes wrote: What are everyones thoughts about doing an intermediate 0.9.1 release? There have been some patches lately which might be better in a release state for newcomers. Which ones did you have in mind? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: Maybe doing a 0.9.1 release
Mainly the ones that were causing problems with searching due to null content. This is kinda low priority and I know we wanted to release a 1.0 sometime soon with 0.16. Maybe its too much of a hassle, just thought I would bring it up. Dennis Andrzej Bialecki wrote: Dennis Kubes wrote: What are everyones thoughts about doing an intermediate 0.9.1 release? There have been some patches lately which might be better in a release state for newcomers. Which ones did you have in mind?
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers
[ https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566779#action_12566779 ] Sami Siren commented on NUTCH-602: -- +1 Allow configurable number of handlers for search servers Key: NUTCH-602 URL: https://issues.apache.org/jira/browse/NUTCH-602 Project: Nutch Issue Type: Improvement Components: searcher Environment: All Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Attachments: NUTCH-602-1-20080205.patch This improvement changes the distributed search server to allow a configurable number of RPC handlers. Before the number was hardcoded at 10 handlers. For high volume environments that limit will be quickly reached and the overall search will slowdown. The patch changes nutch-default.xml with the configuration parameter searchers.num.handlers and changes DistributedSearch to pull the number of handlers from the configuration. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
Re: nutch latest build - inject operation failing
Would need more info on your configuration. Local or DFS, hadoop-site.xml changes. Dennis DS jha wrote: I tried setting it to false but it was still throwing the same error. Looks like when I am using older version of hadoop (0.14.4) it is working fine. Thanks On Feb 7, 2008 10:37 AM, Dennis Kubes [EMAIL PROTECTED] wrote: Do you have speculative execution turned on. If so turn it off. Dennis DS jha wrote: This is running on Windows/Cygwin, with username 'user' - and it is using default hadoop-site.xml Thanks, Jha On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Hmm, wait - this path is strange in itself, because it starts with /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are running as a user with username user ? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: nutch latest build - inject operation failing
I tried setting it to false but it was still throwing the same error. Looks like when I am using older version of hadoop (0.14.4) it is working fine. Thanks On Feb 7, 2008 10:37 AM, Dennis Kubes [EMAIL PROTECTED] wrote: Do you have speculative execution turned on. If so turn it off. Dennis DS jha wrote: This is running on Windows/Cygwin, with username 'user' - and it is using default hadoop-site.xml Thanks, Jha On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Hmm, wait - this path is strange in itself, because it starts with /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are running as a user with username user ? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: nutch latest build - inject operation failing
DS jha wrote: Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Hmm, wait - this path is strange in itself, because it starts with /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are running as a user with username user ? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers
[ https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566772#action_12566772 ] Dennis Kubes commented on NUTCH-602: If nobody has any objections I will commit this soon. Allow configurable number of handlers for search servers Key: NUTCH-602 URL: https://issues.apache.org/jira/browse/NUTCH-602 Project: Nutch Issue Type: Improvement Components: searcher Environment: All Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Attachments: NUTCH-602-1-20080205.patch This improvement changes the distributed search server to allow a configurable number of RPC handlers. Before the number was hardcoded at 10 handlers. For high volume environments that limit will be quickly reached and the overall search will slowdown. The patch changes nutch-default.xml with the configuration parameter searchers.num.handlers and changes DistributedSearch to pull the number of handlers from the configuration. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Resolved: (NUTCH-602) Allow configurable number of handlers for search servers
[ https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-602. Resolution: Fixed Committed. Allow configurable number of handlers for search servers Key: NUTCH-602 URL: https://issues.apache.org/jira/browse/NUTCH-602 Project: Nutch Issue Type: Improvement Components: searcher Environment: All Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Attachments: NUTCH-602-1-20080205.patch This improvement changes the distributed search server to allow a configurable number of RPC handlers. Before the number was hardcoded at 10 handlers. For high volume environments that limit will be quickly reached and the overall search will slowdown. The patch changes nutch-default.xml with the configuration parameter searchers.num.handlers and changes DistributedSearch to pull the number of handlers from the configuration. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
Re: nutch latest build - inject operation failing
This is running on Windows/Cygwin, with username 'user' - and it is using default hadoop-site.xml Thanks, Jha On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Hmm, wait - this path is strange in itself, because it starts with /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are running as a user with username user ? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: nutch latest build - inject operation failing
Do you have speculative execution turned on. If so turn it off. Dennis DS jha wrote: This is running on Windows/Cygwin, with username 'user' - and it is using default hadoop-site.xml Thanks, Jha On Feb 7, 2008 10:03 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Yeah - it is using hadoop v 0.15.3 jar file - strange! Thanks Jha On Feb 7, 2008 8:11 AM, Andrzej Bialecki [EMAIL PROTECTED] wrote: DS jha wrote: Hi - Looks like latest trunk version of nutch is failing with the following exception when trying to perform inject operation: java.io.IOException: Target file:/tmp/hadoop-user/mapred/temp/inject-temp-1280136828/_reduce_dv90x0/part-0 already exists Hmm, wait - this path is strange in itself, because it starts with /tmp/hadoop-user ... Are you running on *nix or Windows/Cygwin? Did you change hadoop-site.xml to redefine hadoop.tmp.dir ? Or perhaps you are running as a user with username user ? -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
[jira] Created: (NUTCH-605) Change deprecated configuration methods for Hadoop
Change deprecated configuration methods for Hadoop -- Key: NUTCH-605 URL: https://issues.apache.org/jira/browse/NUTCH-605 Project: Nutch Issue Type: Improvement Environment: all Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Changes use of the now deprecated addFinalResource and addDefaultResource methods to just use addResouce -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (NUTCH-605) Change deprecated configuration methods for Hadoop
[ https://issues.apache.org/jira/browse/NUTCH-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes updated NUTCH-605: --- Attachment: NUTCH-605-1-20080207.patch Changes deprecated resource methods for hadoop configuration. Change deprecated configuration methods for Hadoop -- Key: NUTCH-605 URL: https://issues.apache.org/jira/browse/NUTCH-605 Project: Nutch Issue Type: Improvement Environment: all Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Attachments: NUTCH-605-1-20080207.patch Changes use of the now deprecated addFinalResource and addDefaultResource methods to just use addResouce -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (NUTCH-602) Allow configurable number of handlers for search servers
[ https://issues.apache.org/jira/browse/NUTCH-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12566909#action_12566909 ] Hudson commented on NUTCH-602: -- Integrated in Nutch-trunk #355 (See [http://hudson.zones.apache.org/hudson/job/Nutch-trunk/355/]) Allow configurable number of handlers for search servers Key: NUTCH-602 URL: https://issues.apache.org/jira/browse/NUTCH-602 Project: Nutch Issue Type: Improvement Components: searcher Environment: All Reporter: Dennis Kubes Assignee: Dennis Kubes Fix For: 1.0.0 Attachments: NUTCH-602-1-20080205.patch This improvement changes the distributed search server to allow a configurable number of RPC handlers. Before the number was hardcoded at 10 handlers. For high volume environments that limit will be quickly reached and the overall search will slowdown. The patch changes nutch-default.xml with the configuration parameter searchers.num.handlers and changes DistributedSearch to pull the number of handlers from the configuration. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.