Re: How to remove emoji's from unicode string

2019-01-13 Thread Stephen MacLean via use-livecode
Hi Kee, codepointOffset() doesn’t seem to work as expected, at least for me on my data, although I’m not sure why. The results are way different. I had thought it would be as simple as using that, but as per usual, nothing is! Best, Steve MacLean > On Jan 13, 2019, at 6:51 PM, Kee Nethery

Re: How to remove emoji's from unicode string

2019-01-13 Thread Stephen MacLean via use-livecode
Hi Richmond, Thanks for posting this! There are multiple ranges for emojis, as more were added in each version on Unicode. Here is the list for the latest version of unicode 12.0 https://www.unicode.org/Public/emoji/12 ... i-data.txt

Re: How to remove emoji's from unicode string

2019-01-13 Thread scott--- via use-livecode
Hello Richmond.I have found that emojis also cause the command to fail silently. Being able to strip emojis would be helpful for that as well. I've been fooling about with your emoji stripping stack. Using codePointToNum(tEmojiChar) > 128511 doesn't seem to catch all the emoji

Re: How to remove emoji's from unicode string

2019-01-13 Thread Kee Nethery via use-livecode
On my phone so verify this. Shouldn’t you be using codepointoffset(),not offset()? Kee Nethery > On Jan 13, 2019, at 1:34 PM, Richmond via use-livecode > wrote: > > Cop a look at this: > > *http://forums.livecode.com/viewtopic.php?f=7=32030* > >> On 13.01.19 22:17, Stephen MacLean via

Re: How to remove emoji's from unicode string

2019-01-13 Thread Richmond via use-livecode
Cop a look at this: *http://forums.livecode.com/viewtopic.php?f=7=32030* On 13.01.19 22:17, Stephen MacLean via use-livecode wrote: Hi All, The recent conversations on using offset() with Unicode strings was very enlightening, thanks to all that took part!. I have data stored in UTF8mb4. I

Re: How to remove emoji's from unicode string

2019-01-13 Thread Richmond via use-livecode
Well . . . as the emojis are stored in a Unicode range (Hex 1F600 - 1F64F) https://www.unicode.org/charts/ I'd "just" strip out any characters inwith that range. On 13.01.19 22:17, Stephen MacLean via use-livecode wrote: Hi All, The recent conversations on using offset() with Unicode

How to remove emoji's from unicode string

2019-01-13 Thread Stephen MacLean via use-livecode
Hi All, The recent conversations on using offset() with Unicode strings was very enlightening, thanks to all that took part!. I have data stored in UTF8mb4. I use textDecode after loading it from the DB to put it into a format that LC understands. I then use offset() to find certain tags,