[PATCHES] EUC_JP and SJIS conversion improvement

Atsushi Ogawa Mon, 13 Jun 2005 06:16:11 -0700

The character-code conversion from EUC_JP to SJIS is executed by
converting two stages. The first stage is conversion from EUC_JP to MIC.
The next stage is conversion from MIC to SJIS. (Conversion from SJIS to
EUC_JP is also similar.)

It is not so efficient, because it is necessary to allocate the
buffer for MIC, and to execute the calculation for conversion twice.

In the attached patch, it enables the direct conversion of EUC_JP and
SJIS. Additionally, there is an improvement that reduce the call of
pg_mic_mblen. 

The effect of the patch that I measured is as follows:

o The Data for test was created by 'pgbench -i'.

o Test SQL:
set client_encoding to 'SJIS';
select * from accounts;

o Test results: Linux(CPU: Pentium III, Compiler option: -O2)
 - original: 2.920s
 - patched : 2.278s

regards,

---
Atsushi Ogawa

euc_jp_and_sjis.patch
Description: Binary data

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?


               http://archives.postgresql.org

[PATCHES] EUC_JP and SJIS conversion improvement

Reply via email to