Re: PDFBox (Re: Nutch Lockup/Freeze (Fetcher) - HELP!!)

2005-06-29 Thread Juho Mäkinen
On 6/29/05, Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > Juho Mäkinen wrote: > > I did some research and I traced the problem to be somewhere inside > > HttpRequest of protocol-httpclient. > > If you enabled the PDF parser, the version of PDFBox that is currently > in SVN is known to be broken -

Re: PDFBox (Re: Nutch Lockup/Freeze (Fetcher) - HELP!!)

2005-06-28 Thread Andrzej Bialecki
Naveen K Kohli wrote: Could you please provide some insight into the changes that you made to fix PDFBox code? I would like to update the source code for PDFBox on my end too. I didn't fix it - the PDFBox author did. Please refer to the forums at http://www.pdfbox.org , around middle of April

Re: PDFBox (Re: Nutch Lockup/Freeze (Fetcher) - HELP!!)

2005-06-28 Thread Naveen K Kohli
Could you please provide some insight into the changes that you made to fix PDFBox code? I would like to update the source code for PDFBox on my end too.   Thanks   --Naveen K Kohli http://www.netomatix.com -- Original message from Andrzej Bialecki <[EMAIL PROTECTED]>: ---

PDFBox (Re: Nutch Lockup/Freeze (Fetcher) - HELP!!)

2005-06-28 Thread Andrzej Bialecki
Juho Mäkinen wrote: I did some research and I traced the problem to be somewhere inside HttpRequest of protocol-httpclient. If you enabled the PDF parser, the version of PDFBox that is currently in SVN is known to be broken - for some PDFs a bug in CMap handling can cause an endless loop. Ple