Changing http.content.limit to -1 fixed the truncated PDF problem for me.

Al

-----Original Message-----
From: "Håvard W. Kongsgård" [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, November 16, 2005 7:50 AM
To: [email protected]
Subject: Re: PDF indexing support?


Have now added conf/nutch-site.xml but still the same problem. | Related 
to the problem? http://sourceforge.net/forum/message.php?msg_id=3391668 
http://sourceforge.net/forum/message.php?msg_id=3398773

>
> <?xml version="1.0"?>
> <?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?> <nutch-conf>
> <property>
>  <name>http.content.limit</name>
>  <value>45451515565536</value>
>  <description>The length limit for downloaded content, in bytes.
>  If this value is nonnegative (>=0), content longer than it will be 
> truncated;
>  otherwise, no truncation at all.
>  </description>
> </property>
> </nutch-conf>
>
> Håvard W. Kongsgård wrote:
>
>> HTTP
>>
>>
>> Sébastien LE CALLONNEC wrote:
>>
>>> Hej Håvard,
>>>
>>> That's because you have to create one yourself.  The values you will 
>>> set in there will override the default values.
>>>
>>> Here are a few more questions to try to solve your problem: where is 
>>> your PDF located?  What protocol is used to fetch it (HTTP, FTP, 
>>> etc.)?
>>>
>>>
>>> Regards,
>>> /sebastien
>>>
>>> --- "Håvard W. Kongsgård" <[EMAIL PROTECTED]> a écrit :
>>>
>>>  
>>>
>>>> Don't have a conf/nutch-site.xml
>>>>
>>>>
>>>>
>>>> Jérôme Charron wrote:
>>>>
>>>>  
>>>>
>>>>>> conf/nutch-default
>>>>>>  
>>>>>>       
>>>>>
>>>>>
>>>>> Checks that they are not overrided in the conf/nutch-site If no, 
>>>>> sorry, no more idea for now :-(
>>>>>
>>>>> Jérôme
>>>>>
>>>>> --
>>>>> http://motrech.free.fr/
>>>>> http://www.frutch.org/
>>>>>
>>>>>
>>>>>
>>>>>     
>>>>
>>>>
>>>> -------------------------------------------------------------------
>>>> -----
>>>>
>>>>  
>>>>
>>>>> No virus found in this incoming message.
>>>>> Checked by AVG Free Edition.
>>>>> Version: 7.1.362 / Virus Database: 267.13.1/169 - Release Date:
>>>>>     
>>>>
>>>>
>>>> 15.11.2005
>>>>  
>>>>
>>>>>
>>>>>
>>>>>     
>>>>
>>>>
>>>>   
>>>
>>>
>>>
>>>
>>>
>>>    
>>>            
>>> ____________________________________________________________________
>>> _______
>>> Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! 
>>> Messenger Téléchargez cette version sur http://fr.messenger.yahoo.com
>>>
>>>
>>>  
>>>
>>
>>
>>
>


-------------------------------------------------------
This SF.Net email is sponsored by the JBoss Inc.  Get Certified Today
Register for a JBoss Training Course.  Free Certification Exam
for All Training Attendees Through End of 2005. For more info visit:
http://ads.osdn.com/?ad_idv28&alloc_id845&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to