Re: [oXygen-user] Search for Characters in Unicode range

Tobias Fischer | pagina GmbH Fri, 24 Jun 2016 01:39:10 -0700

Hi Andreas,

sure, this can be done with basic regex query:|[\u00D8-\u00F6]|
||

|And for your example: [\u0100-\u1F9FF] Unfortunately, oXygen 18 seems tohave a bug with this query (precisely: with 5 digit hex codes) as italso matches characters below \u0100 (which is the following of \u00FF).However, you can also work with negation: [^\u0000-\u00FF] And thisseems to work fine :) Regards, Tobias |


Tobias Fischer
XML- und E-Book-Entwicklung

Telefon: +49 (0)7071 9876-44 · Fax: -22
Mail: [email protected]

pagina GmbH - Publikationstechnologien
Herrenberger Straße 51 | D-72070 Tübingen
www.pagina-online.de | www.parsx.de

Handelsregister Stuttgart - HRB 380249
Geschäftsführer: Tobias Ott

Am 24.06.2016 um 09:50 schrieb Andreas Wagner:

Dear all,
In order to make sure that we have caught all special characters in anexternally transcribed TEI/XML file, I would like to seach for allcharacters above Unicode Codepoint 0x00ff. Can this be done in theRegular Expression Find box? (I found the search for single unicodecodepoints with \u, \x etc., but can't figure out if this can be usedto search for characters (not) in codepoint ranges.
Thanks for any suggestion,

Andreas

_______________________________________________
oXygen-user mailing list
[email protected]
https://www.oxygenxml.com/mailman/listinfo/oxygen-user

Re: [oXygen-user] Search for Characters in Unicode *range*

Reply via email to

Re: [oXygen-user] Search for Characters in Unicode range