What i am trying to do is be able to read text from a page encoded in UTF-8.
I can see the Japanese text in the browser and can copy it into Excel or WordPad.
So i figure i have the fonts i need. But when i use Ruby WIN32OLE to get the text from Excel or IE, i only get "????".
If it were in an odd encoding, i could try to use iconv or pack/unpack or something to convert it. But i can't get the text into Ruby. I'm unsure whether WIN32OLE is limited to ANSI.
If i understand correctly (probably not) Shift JIS is just a different code page, rather than a different encoding. So that would explain why it could work.
I don't much care about regexp right now. Bret At 05:27 PM 8/31/2005, Peter Chau wrote:
I've had scripts running on a Shift-JIS website to read/write Japanese. I had more problems configuring Windows, DOS, and the text editor than I did with Ruby/Watir. >From what I remember, I had to install an East Asian Language Package in Control Panel, Regional and Language Options and then activate Japanese as a non-unicode program in the advance tab. To set up my text editor. I had to use a Japanese Font (Arial Unicode MS Japanese script). I haven't tried Regexp with Japanese characters. Peter _______________________________________________ Wtr-general mailing list [email protected] http://rubyforge.org/mailman/listinfo/wtr-general
_____________________ Bret Pettichord www.pettichord.com _______________________________________________ Wtr-general mailing list [email protected] http://rubyforge.org/mailman/listinfo/wtr-general
