On Sat, 7 Sep 2002 21:05:13 -0400 Rick Dillon <[EMAIL PROTECTED]> wrote:
> Hello. > > I am currently populating html pages with content from MS Excel. I am > using a Java program that literally places the Excel content directly > into the output code (which is saved as html). It appears that Excel > is using Unicode characters, which is causing strange glyphs when the > html is viewed in a browser. Is there a Perl Way to parse the output > and replace the Unicode characters with asciii, or UTF-8 equivalents? > And do I need to upgrade to perl 5.6 to do this? > > Thanks, > > Rick Dillon I think Spreadsheet/ParseExcel.pm and Spreadsheet/WriteExcel.pm should provide a good solution. http://search.cpan.org/author/KWITKNR/Spreadsheet-ParseExcel-0.2602/ http://search.cpan.org/author/JMCNAMARA/Spreadsheet-WriteExcel-0.38/ To transcode Unicode, Unicode/String.pm is good. http://search.cpan.org/author/GAAS/Unicode-String-2.06/ If you need not to use regular expression operators, like m// and s///, for Unicode, Perl 5.005 or 5.6.1 may be still used with the modules mentioned above. Otherwise Perl 5.8.0 should be strongly recommended. SADAHIRO Tomoyuki