On 10/15/2010 3:19 PM, Tim Greenwood wrote:
Is there any regular expression - in perl, or elsewhere, that enables searching on the derived age? I want to find all characters in a file added since Unicode 4.1. I could write it all by processing against the derived age file, but it would be nice if it is ready to go.
Xquery on the XML representation of the UCD is your friend. Eg --- declare namespace u = "http://www.unicode.org/ns/2003/ucd/1.0"; for $c in doc('ucd.all.flat.xml')//u:ucd/u:repertoire/u:ch...@age >= 4.1] return concat ($c/@cp, " ", $c/@age, " ", $c/@na, "
" ) --- Eric.

