I am not entirely sure that it is a bug in ICU. I think it is a "feature".

ICU does not recognize any valid characters in the reserved ranges of an encoding. (Not sure I am using proper terminology here.) For example ISO-8859-1 (aka Latin 1) identifies everything between 128 and 159 as undefined. However, this range is used by cp1250 (and other cp125x and cp1521), which are Microsofts variants on ISO8859. Many people mistakenly refer to cp1250 as Latin-1. It is not.

Many of the non UTF-8 modules contain non Latin-1 characters. When converted to UTF-8, it will fail. And when coming back to Latin-1, it will not be present.

If we were to identify to the conversion routine what encoding was used, then it might work. I say might, because I ran across a few OSes that did not have the MS encodings on them. (e.g. IBM mainframe, Sun Solaris at least through 7, early versions of Linux [ but have not looked lately ]).

Chris Little wrote:

This is an error coming from ICU (not Sword) and I don't believe it is any problem. If you have a question about whether your data is being preserved correctly, you can do imp2mod on your input file followed by mod2imp on the resulting module. Then diff the two imp files and see if there's any difference.

--Chris

Don A. Elbourne Jr. wrote:

I am creating a new Daily Devo module out of Joseph Philpot's "Daily Portions." I have created an imp file and I'm using the latest imp2ld tool on WinXP.

I get output like this:

12.30
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
12.31
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND
from: U_INVALID_CHAR_FOUND

You can see my imp file here:
http://elbourne.org/temp/philpot.imp (640 KB)

What character could be flagging the errors and what should I do to correct it?


_______________________________________________ sword-devel mailing list: [email protected] http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page


_______________________________________________ sword-devel mailing list: [email protected] http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page

Reply via email to