Hi David, On Sat, 2008-09-20 at 14:16 -0400, David Reiser wrote: > On Sep 17, 2008, at 9:00 AM, Derek Atkins wrote: > > > David Reiser <[EMAIL PROTECTED]> writes: > > > >>> Looks like it is coming from Apple's libSystem.dylib. I'll see if I > >>> can find someone who knows about any Apple regex oddities. > >> > >> Well, here's some Apple regex oddity: The regex parser apparently > >> doesn't like higher-than-ascii utf-8 unless the file it's working on > >> starts with a utf-8 BOM (0xefbbbf). I can't even grep qif-parse.scm > >> for the GBP symbol -- there is no output from the grep command. If I > >> use bbedit to prepend the three hex bytes to the file, then grep > >> successfully finds the symbols. But if the .scm file starts with the > >> BOM, gnucash launch now fails with: > > > > Interesting! Do you have the Gnu Regex library available? What if > > you explicitly link against that instead of using the regex in Apple's > > libc? > > > > -derek > > my googling suggests that the Gnu Regex library is part of gnu libc. > If my understanding is correct, then pogma's answer is: > "porting glibc and forcing your app to use it is nigh on impossible". > > How 'bout just converting the .scm files from Latin-1 to UTF-8?
I thought either Charles or Derek would answer that, maybe I misunderstood something. The newest version of the file qif-parse.scm in trunk is ASCII-only, so it is the encoded version of the same sequence of characters in both latin-1 and utf-8. It contains a special quoted version of a unicode character point, but that is logically different from a latin-1 encoding, even if some unicode character points equal the bytes from their latin-1 encoded form. Please excuse me if I chose incorrect terms, but I hope my message is decipherable :-) Ciao, -- andi5
signature.asc
Description: This is a digitally signed message part
_______________________________________________ gnucash-devel mailing list [email protected] https://lists.gnucash.org/mailman/listinfo/gnucash-devel
