Re: [pcre-dev] Using PCRE upon Asian and other two-byte national codings

Ze'ev Atlas Sun, 24 Nov 2013 06:40:22 -0800

Whatever you do, please take care not to break the EBCDIC support (which is now 
implemented in Exclusive OR mode to ASCII)
 
Ze'ev Atlas




________________________________
 From: ND <[email protected]>
To: Pcre-dev <[email protected]> 
Sent: Sunday, November 24, 2013 6:01 AM
Subject: Re: [pcre-dev] Using PCRE upon Asian and other two-byte national 
codings
 

On 2013-11-23 16:07, ph10 wrote:
> On Sat, 23 Nov 2013, Zoltán Herczeg wrote:
>
> currently PCRE character tables can only hold lowercase / flipped case  
> and various type bits for the >first 256 characters. Supporting the  
> whole 64K character set in 16 bit mode would take 409600 bytes >of  
> memory, which is less than half megabyte. Today, even smartphones can  
> afford that cost. The trade->of would be that the same tables could not  
> be used in 8/16/32 bit modes anymore, since the >lowercase / flipped  
> case tables would depend on the natural character length. Hence a table  
> with only >256 characters would be bigger in 16/32 bit mode than now.  
> (Note: the table size would always be >divisible by 256. This would  
> allow not to change anything in 8 bit mode, but we could also support  
> >character sets which does not have 64K characters in 16 bit and  
> especially in 32 bit mode, where we >have 4096M characters).
> I am sure we cannot do this for 8.34 (this is not an easy task), but if  
> this is important for many >people, we might think about this later.

I think it will be useful.

Thanks.

-- 
## List details at https://lists.exim.org/mailman/listinfo/pcre-dev 
-- 
## List details at https://lists.exim.org/mailman/listinfo/pcre-dev

Re: [pcre-dev] Using PCRE upon Asian and other two-byte national codings

Reply via email to