Any plan to implement this ? I mean move LanguageIdentifier class
intto nutch core.
As I already suggested it on this list, I really would like to move the
LanguageIdentifier class (and profiles) to
an independant Lucene sub-project (and the MimeType repository too).
I don't remember why but
Jérôme Charron wrote:
Any plan to implement this ? I mean move LanguageIdentifier class
intto nutch core.
As I already suggested it on this list, I really would like to move the
LanguageIdentifier class (and profiles) to
an independant Lucene sub-project (and the MimeType repository too).
+1. Other local modifications which I use frequently:
* exporting a list of supported languages,
* exporting an NGramProfile of the analyzed text,
* allow processing of chunks of input (i.e.
LanguageIdentifier.identify(char[] buf, int start, int len) ) - this is
very useful if the text to
Hi,
Protocol-httpclient sets the maximum number of total connections to
fetcher.threads.fetch configuration parameter for underlying
commons-httpclient. However, if -threads argument is used with the fetcher it
doesn't change fetcher.threads.fetch. Giving whatever number of threads to
-threads
[ http://issues.apache.org/jira/browse/NUTCH-127?page=all ]
Stefan Groschupf resolved NUTCH-127:
Resolution: Fixed
I guess it is solved, thanks. If able to reproduce it again I will just reopen
this or a new report.
Thanks!
uncorrect values
Thanks for finding this bug, please open a bug report in jira and if
you like I guess patches are always welcome. :-)
Am 23.01.2006 um 15:00 schrieb [EMAIL PROTECTED]:
Hi,
Protocol-httpclient sets the maximum number of total connections to
fetcher.threads.fetch configuration parameter for
Hi,
I have developed an xml parser plugin. I have test it with nutch 0.7.2.
The parser use namespaces and xpath to do the mapping between XML nodes and
lucene fields.
I'm trying to send the source of the plugin in a zip file but my message is
always rejected (it is considered as a spam).
How can I
Hi:
Due to a bug in the if statement its not possible to use the symlinks
for the shell scripts. Below you will find the patch.
Thanks
Zaheed
---
$ svn diff nutch
Index: nutch
===
--- nutch (revision 371849)