Re: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-06 Thread George N. White III
On Sat, 5 Jun 2004, Idris Samawi Hamid wrote: Hi gang, For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even ArabTeX) unless one of the encoding filters like utf-8 is used. Even for utf-8 files, however, it would be very useful to be able to convert a utf-8 file to Latin transcript

Re: Re[4]: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sat, 05 Jun 2004 18:19:22 -0600, Idris Samawi Hamid <[EMAIL PROTECTED]> wrote: One question: The hex for e.g. alif is 0627; how did you get D8A7 from that for purposes of the script (so I can follow along for the rest)? Ok, I found it: If you want to add more conversions, open your unicode fi

Re: Re[4]: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sun, 6 Jun 2004 01:58:44 +0200, Giuseppe Bilotta <[EMAIL PROTECTED]> wrote: == #!/usr/bin/perl use strict; use warnings; open(NEW,">new.tex"); #opens file to print out the result while (<>) { #this opens the file for reading $_ =~ s/\xD8\xA7/A/g; #this is the actual conversion $_ =~ s/\xD8\xA8

Re:[NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sun, 6 Jun 2004 11:08:46 +1200, Richard MAHONEY <[EMAIL PROTECTED]> wrote: You might like to look at some of the encoding conversion scripts at: http://homepages.comnet.co.nz/~r-mahoney/scripts/scripts.html N.B. For sorting utf-8 Arabic you might find the perl module `Sort::ArbBiLex' useful T

Re[4]: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Giuseppe Bilotta
Sunday, June 6, 2004 Idris Samawi Hamid wrote: > Hi Giuseppe (Is it not way past your bedtime;->), Yes it is, and it shows. But since I'm up and not having any particular urge to go to bed in this very moment, here's a tested alternative that works here: == #!/usr/bin/perl use strict; use warni

Re: Re[2]: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sun, 6 Jun 2004 01:15:56 +0200, Giuseppe Bilotta <[EMAIL PROTECTED]> wrote: My take: try the following (should work even with ActiveState 5.6) === #!/usr/bin/perl use strict; #D comment the following, I think we can do without # use open ':utf8'; open(NEW,">new.tex"); #opens file to print out

Re[2]: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Giuseppe Bilotta
Sunday, June 6, 2004 Idris Samawi Hamid wrote: > Here is the whole file once again: > == > #!/usr/bin/perl -w > use strict; > use open ':utf8'; > open(NEW,">new.tex"); #opens file to print out the result > while (<>); { #this opens the file for reading > $_ =~

Re: [SPAM: 3.411] [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Richard MAHONEY
On Sat, Jun 05, 2004 at 01:32:35PM -0600, Idris Samawi Hamid wrote: > Hi gang, > > For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even > ArabTeX) unless one of the encoding filters like utf-8 is used. Even for > utf-8 files, however, it would be very useful to be able to convert

Re: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sat, 05 Jun 2004 23:48:18 +0200, Thomas A. Schmitz <[EMAIL PROTECTED]> wrote: Just a quick reply (it's bedtime over here): there may be 2 problems. Ok, get some sleep;-) Anyhow, I fixed the line break (is the space between tilda and `s' correct?) == $_ =~ s/\x{062

Re: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Thomas A. Schmitz
Just a quick reply (it's bedtime over here): there may be 2 problems. 1 is that the mail program put in an unwanted linebreak after the =~ part, just remove it; it should all be one line. And then: you'll need a fairly recent version of perl for it to work, what do you get when you do perl --versi

Re: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
On Sat, 05 Jun 2004 22:41:39 +0200, Thomas A. Schmitz <[EMAIL PROTECTED]> wrote: Idris, I know a bit of perl and would love to help. However, I fear that sending us your stuff via mail will be a bit difficult because the utf-8 chracters get transformed into gibberish. Thnx 4 such a speedy reply!

Re: [NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Thomas A. Schmitz
Idris, I know a bit of perl and would love to help. However, I fear that sending us your stuff via mail will be a bit difficult because the utf-8 chracters get transformed into gibberish. Could you send the hexadecimal code of the characters you want to convert? Or I could simply give you the synt

[NTG-context] Arabic-utf-8 (plus a sample)

2004-06-05 Thread Idris Samawi Hamid
Hi gang, For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even ArabTeX) unless one of the encoding filters like utf-8 is used. Even for utf-8 files, however, it would be very useful to be able to convert a utf-8 file to Latin transcription for further processing by Aleph/(e-)Omega