> Did you state previously that you were using a newish version of Red  
> Hat? Red Hat has moved to defaulting things to a UTF-8 
> environment. I  

Yup, Red Hat 8.0


> believe one consequence is that Perl treats text mode filehandles as  
> UTF-8; this has the potential to introduce problems if the input  
> includes anything other than 7-bit ASCII (i.e. characters coded with  
> values greater than 127).

That's good to know.

> 
> You might try putting the filehandle in binary mode and see if that  
> clears up the problem. In order to do this, simply add the line
> 
>    binmode CAT;
> 
> immediately preceding the first use of the handle. That should be  
> somewhere in the neighborhood of line 110 I think (the code 
> should be  
> added before 'while (<CAT>)').

Worked perfectly!!!
Thanks a million Jim!!!

Hava good one!

Dan

> 
> Jim
> 
> On Monday, June 30, 2003, at 12:43 PM, Dan Muey wrote:
> 
> > When indexing pdf's, I accasionally get:
> >
> > Malformed UTF-8 character (unexpected continuation byte 
> 0xad, with no
> > preceding start byte) in substitution ...
> >
> > Always blamed on line 113 and 117 in pdf2html.pl
> >
> > Which is in pdf_body()


-------------------------------------------------------
This SF.Net email sponsored by: Free pre-built ASP.NET sites including
Data Reports, E-commerce, Portals, and Forums are available now.
Download today and enter to win an XBOX or Visual Studio .NET.
http://aspnet.click-url.com/go/psa00100006ave/direct;at.asp_061203_01/01
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to