Hello, could you please include this tiny patch which allow matcher to catch URL containing diacritics? E.g.: http://www.svetgralu.cz/svet-gralu/Hrdý_národ_se_stává_velmocí_(2). Otherwise it would match just http://www.svetgralu.cz/svet-gralu/Hrd. I tested it with different PERL_UNICODE settings ("AS", "SDa", reset on "0"), works with all.
Thanks. Regards, Petr
--- src/urxvt/perl/matcher.orig 2012-04-20 01:15:00.000000000 +0200 +++ src/urxvt/perl/matcher 2012-05-29 00:21:45.000000000 +0200 # match URL containing diacritics e.g.: http://www.svetgralu.cz/svet-gralu/Hrdý_národ_se_stává_velmocí_(2) # otherwise it would get just http://www.svetgralu.cz/svet-gralu/Hrd @@ -6,10 +6,10 @@ my $url = qr{ (?:https?://|ftp://|news://|mailto:|file://|\bwww\.) - [a-zA-Z0-9\-\@;\/?:&=%\$_.+!*\x27,~#]* + [\p{XPosixAlnum}\-\@;\/?:&=%\$_.+!*\x27,~#]* ( - \([a-zA-Z0-9\-\@;\/?:&=%\$_.+!*\x27,~#]*\)| # Allow a pair of matched parentheses - [a-zA-Z0-9\-\@;\/?:&=%\$_+*~] # exclude some trailing characters (heuristic) + \([\p{XPosixAlnum}\-\@;\/?:&=%\$_.+!*\x27,~#]*\)| # Allow a pair of matched parentheses + [\p{XPosixAlnum}\-\@;\/?:&=%\$_+*~] # exclude some trailing characters (heuristic) )+ }x;
_______________________________________________ rxvt-unicode mailing list [email protected] http://lists.schmorp.de/cgi-bin/mailman/listinfo/rxvt-unicode
