Re: Bidirectional (bidi) Support?

2003-10-25 Thread Chris Whiting
>"Bob Hallissy" <[EMAIL PROTECTED]> wrote in message news:OFA3E32D4C.D67B7CEA->[EMAIL PROTECTED] > >On 21/10/2003 01:09:32 "Chris Whiting" wrote: > >>I have implemented ... an Arabic shaping algorithm in >>Perl and was wondering if it would be useful to upload it to cpan. > >I presume your algorit

Malformed UTF-8 character

2003-10-25 Thread Marco Baroni
Dear all, I am new to (explicit) unicode handling, and right now I am facing this problem. I have some data (lots of data) that in theory should be in ascii (with entity references in place of non-ascii characters). I have no easy way to get to know exactly how these data were generated. W

PS (Malformed UTF-8 character)

2003-10-25 Thread Marco Baroni
While it took a while to start happening, I see that occasionally other reg exps (such as the one on line 23) complain: Malformed UTF-8 character (unexpected end of string) at look_for_probable_nprs.pl line 121, <> line 1021152. (Line 121 of script: $w =~ s/^\x{e0}/\x{c0}/; ) On Sunday

Re: PS (Malformed UTF-8 character)

2003-10-25 Thread David Graff
On Sunday, Oct 26, 2003, at 01:12 Europe/Rome, Marco Baroni wrote: > I have some data (lots of data) that in theory should be in ascii > (with entity references in place of non-ascii characters). I have no > easy way to get to know exactly how these data were generated. > [snip] > I looked at

Re: Bidirectional (bidi) Support?

2003-10-25 Thread Jungshik Shin
On Fri, 24 Oct 2003, Chris Whiting wrote: > >"Bob Hallissy" <[EMAIL PROTECTED]> wrote in message >> I presume your algorithm depends on the Arabic presentation forms available >> as separately encoded >characters in Unicode. If this is the case, > The algorithm, and all that I have seen, conver