>"Bob Hallissy" <[EMAIL PROTECTED]> wrote in message
news:OFA3E32D4C.D67B7CEA->[EMAIL PROTECTED]
>
>On 21/10/2003 01:09:32 "Chris Whiting" wrote:
>
>>I have implemented ... an Arabic shaping algorithm in
>>Perl and was wondering if it would be useful to upload it to cpan.
>
>I presume your algorit
Dear all,
I am new to (explicit) unicode handling, and right now I am facing this
problem.
I have some data (lots of data) that in theory should be in ascii (with
entity references in place of non-ascii characters). I have no easy way
to get to know exactly how these data were generated.
W
While it took a while to start happening, I see that occasionally other
reg exps (such as the one on line 23) complain:
Malformed UTF-8 character (unexpected end of string) at
look_for_probable_nprs.pl line 121, <> line 1021152.
(Line 121 of script:
$w =~ s/^\x{e0}/\x{c0}/;
)
On Sunday
On Sunday, Oct 26, 2003, at 01:12 Europe/Rome, Marco Baroni wrote:
> I have some data (lots of data) that in theory should be in ascii
> (with entity references in place of non-ascii characters). I have no
> easy way to get to know exactly how these data were generated.
>
[snip]
> I looked at
On Fri, 24 Oct 2003, Chris Whiting wrote:
> >"Bob Hallissy" <[EMAIL PROTECTED]> wrote in message
>> I presume your algorithm depends on the Arabic presentation forms available
>> as separately encoded >characters in Unicode. If this is the case,
> The algorithm, and all that I have seen, conver