Hello Dave,

When you refer to output, are you writing the document to System.out or
System.err? Those streams (and every other PrintStream) will write strings
in your platform's native encoding, and if a particular character cannot
be represented, it's written as a '?'. If you want to write Java strings
to a file or elsewhere in a particular encoding, you should have a look at
java.io.OutputStreamWriter.

Hope that helps.

On Wed, 23 Jul 2003, Dave Carlson wrote:

> Hi, I am new to this list. We are having a problem. We have documents with
> UTF characters in them (ie   == space), and when we parse them on
> Windows, we get the correct output. However, when we parse them on Unix,
> we get either a "?" or "\040". We have been working with the Unix LANG
> environment variable, as well as "file.encoding"> We can't get this to
> output the correct value. Any assistance would be greatly appreciated.
>
> Thanks

--------------------
Michael Glavassevich
[EMAIL PROTECTED]

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to