One more
https://stackoverflow.com/questions/1454952/dummys-guide-to-unicode
The information contained in this electronic mail transmission is intended only
for the use of the individual or entity named in this transmission. If you are
not the intended recipient of this transmission, you
Here are a few articles if you really want to dig in
https://stackoverflow.com/questions/5290182/how-many-bytes-does-one-unicode-character-take
https://docs.oracle.com/database/121/NLSPG/ch2charset.htm#NLSPG1037
https://metacpan.org/pod/Text::Iconv
I’m beginning to think that just truncating the string to something well short
of the column limit is going to be the simplest way to go. This has been a
persistent issue in the past with earlier versions of this app that were
handled by the previous programmer/dba by…manually editing the file
UTF-8 has a variable number of bytes per character. Some encodings have 1 byte
per character always. Some encodings have two bytes per character always.
Assuming that the number of bytes is the same as the number of characters
sometimes leads to wrong answers.
When you are using perl as a
On Thursday 27 May 2021 23:35:32 Bruce Johnson wrote:
> use bytes; $string=substr($orig,0,4000);
Hello! This is really suspicious. See **BOLD** description of 'bytes' module:
**Use of this module for anything other than debugging purposes is strongly
discouraged.**
**If you feel that the
I'm working on an app that processes (among other thing) long sections of text,
and I’m running into odd multi-byte characters in some of the entries.
The column in question is a varchar2(4000) so I am truncating the input to 4000
bytes.
(via use “bytes; $string=substr($orig,0,4000); ” in the