[jira] Commented: (NUTCH-567) Proper (?) handling of URIs in TagSoup.

2007-10-18 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535853 ] Dawid Weiss commented on NUTCH-567: --- Don't mention it. Happy birthday and I hope it'll work for you. If you take a

[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread JIRA
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535855 ] Doğacan Güney commented on NUTCH-488: - I haven't tested it but the code looks good and clean. +1 from me.

Re: writing a new parse-exe plugin

2007-10-18 Thread eyal edri
Found how to associate multiple contentTypes to a certain plugin: just add the content type to the conf/parse-plugins.xml file: (the plugin can take more than one type) mimeType name=application/x-dosexec plugin id=parse-exe / /mimeType On 10/18/07, eyal edri [EMAIL

[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535875 ] Andrzej Bialecki commented on NUTCH-488: - +1. The nutch-default description has a typo, instead of coma it

Re: writing a new parse-exe plugin [NullPointerException]

2007-10-18 Thread eyal edri
hello again, I've added a printStackTrace to where the fetcher throws the exception: java.lang.NullPointerException at org.apache.hadoop.io.Text.encode(Text.java:375) at org.apache.hadoop.io.Text.encode(Text.java:356) at

[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter

2007-10-18 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535943 ] Andrzej Bialecki commented on NUTCH-565: - +1 overall. One question: shouldn't we put this under

[jira] Commented: (NUTCH-565) Arc File to Nutch Segments Converter

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12535960 ] Dennis Kubes commented on NUTCH-565: Yeah, I didn't really know where to put this as it is a tool but it is also

Re: Scoring API issues (LONG)

2007-10-18 Thread Sami Siren
Andrzej Bialecki wrote: Hi all, I've been working recently on a custom scoring plugin, and I found out some issues with the scoring API that severely limit the way we can calculate static page scores. I'd like to restart the discussion about this API, and propose some changes. Any comments

Re: Scoring API issues (LONG)

2007-10-18 Thread Andrzej Bialecki
Sami Siren wrote: Andrzej Bialecki wrote: Hi all, I've been working recently on a custom scoring plugin, and I found out some issues with the scoring API that severely limit the way we can calculate static page scores. I'd like to restart the discussion about this API, and propose some

[jira] Resolved: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes resolved NUTCH-488. Resolution: Fixed Fix Version/s: 1.0.0 Assignee: Emmanuel Joke This has been

[jira] Closed: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Kubes closed NUTCH-488. -- Closed Avoid parsing uneccessary links and get a more relevant outlink list

JIRA, Resolving and Closing Issues

2007-10-18 Thread Dennis Kubes
Quick question about Jira. When we commit, are we supposed to first resolve and then close the issue. What is the process on this. Dennis Kubes

Re: JIRA, Resolving and Closing Issues

2007-10-18 Thread Chris Mattmann
Dennis, My practice has been to do the following: 1. Resolve the issue, and describe (at a high level), the changes made to the code, e.g., *Introduced new classes A, B, C *Refactored method Y out of class D and into new class E *made internal method F of class G use member variable as

Re: JIRA, Resolving and Closing Issues

2007-10-18 Thread Sami Siren
I think for example lucene and hadoop follow different process where issues are resolved after successful commits and (bulk)closed on release. Manual addition of revision number is unnecessary when you add issue number into commit message, in that case JIRA will show this information (and some

[jira] Commented: (NUTCH-488) Avoid parsing uneccessary links and get a more relevant outlink list

2007-10-18 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12536102 ] Hudson commented on NUTCH-488: -- Integrated in Nutch-Nightly #241 (See