What i am trying to do is be able to read text from a page encoded in UTF-8.

I can see the Japanese text in the browser and can copy it into Excel or WordPad.

So i figure i have the fonts i need. But when i use Ruby WIN32OLE to get the text from Excel or IE, i only get "????".

If it were in an odd encoding, i could try to use iconv or pack/unpack or something to convert it. But i can't get the text into Ruby. I'm unsure whether WIN32OLE is limited to ANSI.

If i understand correctly (probably not) Shift JIS is just a different code page, rather than a different encoding. So that would explain why it could work.

I don't much care about regexp right now.

Bret

At 05:27 PM 8/31/2005, Peter Chau wrote:
I've had scripts running on a Shift-JIS website to read/write Japanese.
I had more problems configuring Windows, DOS, and the text editor than I
did with Ruby/Watir.

>From what I remember, I had to install an East Asian Language Package in
Control Panel, Regional and Language Options and then activate Japanese
as a non-unicode program in the advance tab. To set up my text editor. I
had to use a Japanese Font (Arial Unicode MS Japanese script). I haven't
tried Regexp with Japanese characters.

Peter


_______________________________________________
Wtr-general mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/wtr-general

_____________________
 Bret Pettichord
 www.pettichord.com

_______________________________________________
Wtr-general mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/wtr-general

Reply via email to