Re: [GENERAL] endash not a graphic character?

2016-08-21 Thread Bruno Wolff III
On Sun, Aug 21, 2016 at 14:24:16 -0400, Tom Lane wrote: Unfortunately, these particular characters are U+2013 and U+2014 so you lose. Thanks for saving me some time, as it would have taken me quite a while to figure that out. I'll adjust the constraint so that good

Re: [GENERAL] endash not a graphic character?

2016-08-21 Thread Tom Lane
Bruno Wolff III writes: > However I am wondering about my use of [[:graph:]] to match characters > that have glyphs. I was not expecting there to be characters that have > glyphs to not be in the graph class. In the short term I might want to > change the way I am testing that.

Re: [GENERAL] endash not a graphic character?

2016-08-21 Thread Bruno Wolff III
On Sun, Aug 21, 2016 at 12:30:21 -0500, Bruno Wolff III wrote: I should also try the equivalent test in perl to see if it is more likely tied to the unicode implementation on my system or if it appears to be Postgres specific. It looks like my locale may not be being set

Re: [GENERAL] endash not a graphic character?

2016-08-21 Thread Bruno Wolff III
On Sun, Aug 21, 2016 at 08:12:23 +1000, rob stone wrote: You can't use (emdash) or (endash)? Or their hex equivalents. See the Unicode chart. By the way, those aren't the correct codes. That only works if your code treats iso-5589-1 code points as windows 1252 code

Re: [GENERAL] endash not a graphic character?

2016-08-21 Thread Bruno Wolff III
On Sun, Aug 21, 2016 at 08:12:23 +1000, rob stone wrote: You can't use (emdash) or (endash)? Or their hex equivalents. See the Unicode chart. I am not the source of the data, but I can special case them one way or the other. However I am wondering about my use of

Re: [GENERAL] endash not a graphic character?

2016-08-20 Thread rob stone
Hello Bruno, On Sat, 2016-08-20 at 14:04 -0500, Bruno Wolff III wrote: > I was surprised to find endash and emdash were not graphic characters > in  > en_US. I'm not sure if this is correct behavior, a bug in postgres or > a  > bug in my OS' collation definitions? > > For example: > > Dash: >

[GENERAL] endash not a graphic character?

2016-08-20 Thread Bruno Wolff III
I was surprised to find endash and emdash were not graphic characters in en_US. I'm not sure if this is correct behavior, a bug in postgres or a bug in my OS' collation definitions? For example: Dash: area=> select '-' ~ '[[:graph:]]' collate "en_US"; ?column? -- t (1 row) Endash: