Hi,
Tika is parsing properly, I think it was some kind of proxy issue and also
the http.content.limit.
Thanks!
Remi
On Fri, Feb 10, 2012 at 11:16 PM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
Hi Remi,
Please ensure that your http.content limit is sufficient, what are you url
Ok I just did (It's great but I've been reluctant because recompiling
always gives me errors).
However, I'm still having a similar error:
$ bin/nutch parsechecker http://URL
fetching: http://URL
parsing: http://URL
contentType: application/ms-excel
-
Url
---
With the nutch parsechecker command I get the following error message:
Error: Could not find or load main class parsechecker, this doesn't sound
good!
On Tue, Feb 7, 2012 at 9:58 AM, remi tassing tassingr...@gmail.com wrote:
The point that made me start thinking is because I got this error
Hey guys,
I checked the mailing-list archive but couldn't get an answer on this. I
think CSV and TXT don't need any kind of parsing, but how.are handled by
default?
Remi
Upgrade to 1.4.
With the nutch parsechecker command I get the following error message:
Error: Could not find or load main class parsechecker, this doesn't sound
good!
On Tue, Feb 7, 2012 at 9:58 AM, remi tassing tassingr...@gmail.com wrote:
The point that made me start thinking is
Hey guys,
I checked the mailing-list archive but couldn't get an answer on this. I
think CSV and TXT don't need any kind of parsing, but how.are handled by
default?
Remi
The point that made me start thinking is because I got this error message:
failed(2,0): Can't retrieve Tika parser for mime-type application/ms-excel
I'm using Nutch-1.2 and my nutch-site.xml has:
property
nameplugin.includes/name
7 matches
Mail list logo