On Fri, Jun 16, 2006 at 10:24:31PM -0700, Gregg Eshelman wrote:

> Is there an easy way to replace Unicode characters
> in HTML with their ASCII/ANSI equivalents?
> A Unicode character in an HTML file is coded like
> #&0028;parenthetical statement#&0029;
> In ASCII/ANSI it's
> (parenthetical statement)
> Does the Plucker converter grok Unicode?
I feel I don't fully understand your problem since Plucker DB format
handles this case by attaching an ASCII character to the Unicode
character inserted into text. This allow readers to choose between them
(mostly to try Unicode first and then fallback to the ASCII one).

If this doesn't work for you, you should check your setup and at least
provide us with the exact versions of your reader and distiller.

It's also possible that you have somewhat broken HTML files to build
your documents.

Unfortunately, I cannot be confident about the current Plucker software
in this respect since I don't use Palm Plucker reader anymore, but the
Python distiller 1.8 handles such Unicode characters as expected.

> What'd be nice is a small program that would
> automatically find and replace the codes with the
> characters. That'd lose things like left and right
> quotes, but on a small PDA screen it makes little
> difference. For characters with no direct equivalent,
> like the lozenge, the program could allow the user
> to select an ASCII/ANSI character to replace it.
Perl is your friend. Read the documentation about the Encode core
module. It supports HTML and XML entities.

_______________________________________________
plucker-dev mailing list
plucker-dev@rubberchicken.org
http://lists.rubberchicken.org/mailman/listinfo/plucker-dev

Reply via email to