Changing http.content.limit to -1 fixed the truncated PDF problem for me. Al
-----Original Message----- From: "Håvard W. Kongsgård" [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 16, 2005 7:50 AM To: [email protected] Subject: Re: PDF indexing support? Have now added conf/nutch-site.xml but still the same problem. | Related to the problem? http://sourceforge.net/forum/message.php?msg_id=3391668 http://sourceforge.net/forum/message.php?msg_id=3398773 > > <?xml version="1.0"?> > <?xml-stylesheet type="text/xsl" href="nutch-conf.xsl"?> <nutch-conf> > <property> > <name>http.content.limit</name> > <value>45451515565536</value> > <description>The length limit for downloaded content, in bytes. > If this value is nonnegative (>=0), content longer than it will be > truncated; > otherwise, no truncation at all. > </description> > </property> > </nutch-conf> > > Håvard W. Kongsgård wrote: > >> HTTP >> >> >> Sébastien LE CALLONNEC wrote: >> >>> Hej Håvard, >>> >>> That's because you have to create one yourself. The values you will >>> set in there will override the default values. >>> >>> Here are a few more questions to try to solve your problem: where is >>> your PDF located? What protocol is used to fetch it (HTTP, FTP, >>> etc.)? >>> >>> >>> Regards, >>> /sebastien >>> >>> --- "Håvard W. Kongsgård" <[EMAIL PROTECTED]> a écrit : >>> >>> >>> >>>> Don't have a conf/nutch-site.xml >>>> >>>> >>>> >>>> Jérôme Charron wrote: >>>> >>>> >>>> >>>>>> conf/nutch-default >>>>>> >>>>>> >>>>> >>>>> >>>>> Checks that they are not overrided in the conf/nutch-site If no, >>>>> sorry, no more idea for now :-( >>>>> >>>>> Jérôme >>>>> >>>>> -- >>>>> http://motrech.free.fr/ >>>>> http://www.frutch.org/ >>>>> >>>>> >>>>> >>>>> >>>> >>>> >>>> ------------------------------------------------------------------- >>>> ----- >>>> >>>> >>>> >>>>> No virus found in this incoming message. >>>>> Checked by AVG Free Edition. >>>>> Version: 7.1.362 / Virus Database: 267.13.1/169 - Release Date: >>>>> >>>> >>>> >>>> 15.11.2005 >>>> >>>> >>>>> >>>>> >>>>> >>>> >>>> >>>> >>> >>> >>> >>> >>> >>> >>> >>> ____________________________________________________________________ >>> _______ >>> Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! >>> Messenger Téléchargez cette version sur http://fr.messenger.yahoo.com >>> >>> >>> >>> >> >> >> > ------------------------------------------------------- This SF.Net email is sponsored by the JBoss Inc. Get Certified Today Register for a JBoss Training Course. Free Certification Exam for All Training Attendees Through End of 2005. For more info visit: http://ads.osdn.com/?ad_idv28&alloc_id845&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
