in perldoc perlunicde : Unicode Character Properties : Scripts I see a Han, which can be use as $string =~/\p{Han}/;
my question is how can I find out what exactly "Han" is ? I know \p{Han} can match a Chinese word, also tested it to match each word in perl-src/ext/Encode/t/gb2312.utf, but I do not know the exact range of this \p{Han}. thanks!