Re: duplicate libs

2006-02-14 Thread Dawid Weiss
Yes, there is an easier way. Implement a custom task to which you'll pass a path to plugin.xml and a name for a path. The task (Java code) will create a named (id) object which can be subsequently used in ant with . This requires a custom ant task, but as you mentioned foreach is also a se

[jira] Created: (NUTCH-210) Context.xml file for Nutch web application

2006-02-14 Thread Chris A. Mattmann (JIRA)
Context.xml file for Nutch web application -- Key: NUTCH-210 URL: http://issues.apache.org/jira/browse/NUTCH-210 Project: Nutch Type: Improvement Components: web gui Versions: 0.7.1, 0.7, 0.6, 0.7.2-dev, 0.8-dev En

[jira] Commented: (NUTCH-140) Add alias capability in parse-plugins.xml file that allows mimeType->extensionId mapping

2006-02-14 Thread Chris A. Mattmann (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-140?page=comments#action_12366376 ] Chris A. Mattmann commented on NUTCH-140: - Hi Folks, I've went ahead and created an initial patch for this issue. I'll be attaching it to JIRA within the next day for

[jira] Resolved: (NUTCH-90) reduce logging output of IndexSegment

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-90?page=all ] Sami Siren resolved NUTCH-90: - Resolution: Invalid doesn't seem to apply anymore > reduce logging output of IndexSegment > - > > Key: NUTCH-90 >

[jira] Resolved: (NUTCH-64) no results after a restart of a search--server (without tomcat restart)

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-64?page=all ] Sami Siren resolved NUTCH-64: - Resolution: Duplicate duplicate with NUTCH-14 > no results after a restart of a search--server (without tomcat restart) >

[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-48?page=all ] Sami Siren reassigned NUTCH-48: --- Assign To: Sami Siren > "Did you mean" query enhancement/refignment feature request > > > Key

[jira] Resolved: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-184?page=all ] Sami Siren resolved NUTCH-184: -- Fix Version: 0.8-dev Resolution: Fixed This is now committed, thank you! > Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation >

Re: duplicate libs

2006-02-14 Thread Jérôme Charron
Is there any ant guru in the nutch-dev list? Since the number of plugins increase in nutch and that dependencies becomes more and more used, I would like to add in build-plugin.xml the capability to dynamicaly add into the classpath the dependencies defined in a plugin.xml file (this avoid to decla

Which extension point should I extend?

2006-02-14 Thread Elwin
When using nutch to crawl some sites, I want to index fetched contents selectively only when the urls to these contents fit my filter, for other urls I just want nutch to crawl them and parse them without index. How can I achieve this? Which extension point should I extend?

Re: duplicate libs

2006-02-14 Thread Doug Cutting
Jérôme Charron wrote: Isn't it already reported in http://issues.apache.org/jira/browse/NUTCH-196? Yes, you're right. I have still provided a patch for a log4j lib. If there is no objection, I will commit it and go ahead for * lib-commons-httpclient * lib-nekohtml +1 Thanks! Doug

[jira] Closed: (NUTCH-123) Cache.jsp some times generate NullPointerException

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-123?page=all ] Sami Siren closed NUTCH-123: Fix Version: 0.8-dev Resolution: Duplicate problem reported to be fixed in NUTCH-135 > Cache.jsp some times generate NullPointerException > -

[jira] Commented: (NUTCH-165) object pooling for nutch bean --- to impriove performance

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-165?page=comments#action_12366349 ] Sami Siren commented on NUTCH-165: -- NutchBean is allready cached in application context by servlet container isn't this suffient? > object pooling for nutch bean --- to impri

[jira] Assigned: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-184?page=all ] Sami Siren reassigned NUTCH-184: Assign To: Sami Siren > Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation > > >

[jira] Resolved: (NUTCH-118) FAQ link points to invalid URL

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-118?page=all ] Sami Siren resolved NUTCH-118: -- Fix Version: 0.8-dev Resolution: Fixed fixed as related to NUTCH-81 > FAQ link points to invalid URL > -- > > Key: N

[jira] Resolved: (NUTCH-137) footer is not displayed in search result page

2006-02-14 Thread Sami Siren (JIRA)
[ http://issues.apache.org/jira/browse/NUTCH-137?page=all ] Sami Siren resolved NUTCH-137: -- Fix Version: 0.8-dev Resolution: Fixed fixed as related to NUTCH-81 > footer is not displayed in search result page > ---

Plugin dependencies

2006-02-14 Thread Enrico Triolo
Hi all, one of my plugins needs that the language-identifier be executed before it, but at the moment there's no way of assuring this behaviour (correct me if I'm wrong...). I patched a couple of classes to allow correct ordering of execution for plugins based on dependencies. The patch is very si

Re: duplicate libs

2006-02-14 Thread Michael Nebel
Hi, when you consolidate the libs, perhaps you can add a version of xalan. This seems to be needed by the OpenSearchServlet. But I'm not entirely sure that it's not a broknen tomcat installation of mine. Can someone please verify my observation? Regards Michael -- Michael Nebel htt

Re: duplicate libs

2006-02-14 Thread Dawid Weiss
log4j-1.2.11.jar src/plugin/clustering-carrot2/lib log4j-1.2.6.jar 1 src/plugin/parse-rss/lib log4j-1.2.9.jar src/plugin/parse-pdf/lib nekohtml-0.9.2.jarsrc/plugin/clustering-carrot2/lib nekohtml-0.9.4.jarsrc/plu

A little hack: retrieve only new urls

2006-02-14 Thread Enrico Triolo
Hi, since I had a little problem and noticed that someone else had it too, I tried to solve it in my own way... First let me explain the problem: I'm developing an application in which the user can classify web resources inside a taxonomy. The user chooses a seed url and defines the crawling depth

Re: duplicate libs

2006-02-14 Thread Jérôme Charron
> > There are a number of duplicated libs in the plugins, namely: Isn't it already reported in http://issues.apache.org/jira/browse/NUTCH-196? I have still provided a patch for a log4j lib. If there is no objection, I will commit it and go ahead for * lib-commons-httpclient * lib-nekohtml Jérôme