Its because the size of the maximum content size. Change
the content.limit values in your site configuration file.


On Tue, 31 May 2005 10:11:02 -0500
 "Kyle Gabhart" <[EMAIL PROTECTED]> wrote:
> I have a large number of documents on our intranet (about
> 1000) that are indexed by nutch (version 0.6).  On about
> 1/3 of those documents I get the following error:
> 
> 050529 011245 fetch okay, but can't parse PATH_TO_FILE,
> reason: Content truncated at 65536 bytes. Parser can't
> handle incomplete msword file. 
> The same happens on some PDF files.  Any ideas?
> 
> -KG
> 
> 

_____________________________________________________________________
For super low premiums, click here http://www.dialdirect.co.za/quote

Reply via email to