On Sat, 5 Jun 2004, Idris Samawi Hamid wrote:
Hi gang,
For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even ArabTeX)
unless one of the encoding filters like utf-8 is used. Even for utf-8 files,
however, it would be very useful to be able to convert a utf-8 file to Latin
transcript
On Sat, 05 Jun 2004 18:19:22 -0600, Idris Samawi Hamid
<[EMAIL PROTECTED]> wrote:
One question: The hex for e.g. alif is 0627; how did you get D8A7 from
that for purposes of the script (so I can follow along for the rest)?
Ok, I found it:
If you want to add more conversions, open your unicode fi
On Sun, 6 Jun 2004 01:58:44 +0200, Giuseppe Bilotta <[EMAIL PROTECTED]>
wrote:
==
#!/usr/bin/perl
use strict;
use warnings;
open(NEW,">new.tex"); #opens file to print out the result
while (<>) { #this opens the file for reading
$_ =~ s/\xD8\xA7/A/g; #this is the actual conversion
$_ =~ s/\xD8\xA8
On Sun, 6 Jun 2004 11:08:46 +1200, Richard MAHONEY
<[EMAIL PROTECTED]> wrote:
You might like to look at some of the encoding conversion scripts at:
http://homepages.comnet.co.nz/~r-mahoney/scripts/scripts.html
N.B. For sorting utf-8 Arabic you might find the perl module
`Sort::ArbBiLex' useful
T
Sunday, June 6, 2004 Idris Samawi Hamid wrote:
> Hi Giuseppe (Is it not way past your bedtime;->),
Yes it is, and it shows. But since I'm up and not having any
particular urge to go to bed in this very moment, here's a
tested alternative that works here:
==
#!/usr/bin/perl
use strict;
use warni
On Sun, 6 Jun 2004 01:15:56 +0200, Giuseppe Bilotta <[EMAIL PROTECTED]>
wrote:
My take: try the following (should work even with ActiveState
5.6)
===
#!/usr/bin/perl
use strict;
#D comment the following, I think we can do without
# use open ':utf8';
open(NEW,">new.tex"); #opens file to print out
Sunday, June 6, 2004 Idris Samawi Hamid wrote:
> Here is the whole file once again:
> ==
> #!/usr/bin/perl -w
> use strict;
> use open ':utf8';
> open(NEW,">new.tex"); #opens file to print out the result
> while (<>); { #this opens the file for reading
> $_ =~
On Sat, Jun 05, 2004 at 01:32:35PM -0600, Idris Samawi Hamid wrote:
> Hi gang,
>
> For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even
> ArabTeX) unless one of the encoding filters like utf-8 is used. Even for
> utf-8 files, however, it would be very useful to be able to convert
On Sat, 05 Jun 2004 23:48:18 +0200, Thomas A. Schmitz
<[EMAIL PROTECTED]> wrote:
Just a quick reply (it's bedtime over here): there may be 2 problems.
Ok, get some sleep;-) Anyhow, I fixed the line break (is the space between
tilda and `s' correct?)
==
$_ =~ s/\x{062
Just a quick reply (it's bedtime over here): there may be 2 problems. 1
is that the mail program put in an unwanted linebreak after the =~
part, just remove it; it should all be one line. And then: you'll need a
fairly recent version of perl for it to work, what do you get when you
do
perl --versi
On Sat, 05 Jun 2004 22:41:39 +0200, Thomas A. Schmitz
<[EMAIL PROTECTED]> wrote:
Idris,
I know a bit of perl and would love to help. However, I fear that
sending us your stuff via mail will be a bit difficult because the utf-8
chracters get transformed into gibberish.
Thnx 4 such a speedy reply!
Idris,
I know a bit of perl and would love to help. However, I fear that
sending us your stuff via mail will be a bit difficult because the utf-8
chracters get transformed into gibberish. Could you send the hexadecimal
code of the characters you want to convert? Or I could simply give you
the synt
Hi gang,
For Arabic we use a Latin transcription in Aleph/(e-)Omega (or even
ArabTeX) unless one of the encoding filters like utf-8 is used. Even for
utf-8 files, however, it would be very useful to be able to convert a
utf-8 file to Latin transcription for further processing by
Aleph/(e-)Omega
13 matches
Mail list logo