Hello Tobias,
Note that only 4 digits hex codes are supported by the Java/Oxygen regex
engine with the \u unicode code point.
If you use 5 digits, the 5th digit is interpreted independently as a
literal, so this creates undesired side effects.
e.g.
[\u0100-\u1F9FF] is interpreted as
Hi Andreas,
sure, this can be done with basic regex query:|[\u00D8-\u00F6]|
||
|And for your example: [\u0100-\u1F9FF] Unfortunately, oXygen 18 seems to
have a bug with this query (precisely: with 5 digit hex codes) as it
also matches characters below \u0100 (which is the following of
Duh. I bet you must have heard my head banging on the desk even in
Leipzig.
Thank you a lot, I don't know if I would have come to think of this
otherwise.
Cheers,
Andreas
* Imsieke, Gerrit, le-tex dixit [2016-06-24 10:24]:
Either positively [\u0100-\u] (it doesn’t seem to stretch
Either positively [\u0100-\u] (it doesn’t seem to stretch above 4
hex digits yet) or [^\u-\u00ff]
On 24.06.2016 09:50, Andreas Wagner wrote:
Dear all,
In order to make sure that we have caught all special characters in an
externally transcribed TEI/XML file, I would like to seach for
Dear all,
In order to make sure that we have caught all special characters in an
externally transcribed TEI/XML file, I would like to seach for all
characters above Unicode Codepoint 0x00ff. Can this be done in the
Regular Expression Find box? (I found the search for single unicode