Hello,

could you please include this tiny patch which allow matcher to catch URL 
containing diacritics?
E.g.: http://www.svetgralu.cz/svet-gralu/Hrdý_národ_se_stává_velmocí_(2). 
Otherwise it would match
just http://www.svetgralu.cz/svet-gralu/Hrd. I tested it with different 
PERL_UNICODE settings
("AS", "SDa", reset on "0"), works with all.

Thanks.

Regards,
Petr
--- src/urxvt/perl/matcher.orig	2012-04-20 01:15:00.000000000 +0200
+++ src/urxvt/perl/matcher	2012-05-29 00:21:45.000000000 +0200
# match URL containing diacritics e.g.: http://www.svetgralu.cz/svet-gralu/Hrdý_národ_se_stává_velmocí_(2)
# otherwise it would get just http://www.svetgralu.cz/svet-gralu/Hrd
@@ -6,10 +6,10 @@
 my $url =
    qr{
       (?:https?://|ftp://|news://|mailto:|file://|\bwww\.)
-      [a-zA-Z0-9\-\@;\/?:&=%\$_.+!*\x27,~#]*
+      [\p{XPosixAlnum}\-\@;\/?:&=%\$_.+!*\x27,~#]*
       (
-         \([a-zA-Z0-9\-\@;\/?:&=%\$_.+!*\x27,~#]*\)| # Allow a pair of matched parentheses
-         [a-zA-Z0-9\-\@;\/?:&=%\$_+*~]  # exclude some trailing characters (heuristic)
+         \([\p{XPosixAlnum}\-\@;\/?:&=%\$_.+!*\x27,~#]*\)| # Allow a pair of matched parentheses
+         [\p{XPosixAlnum}\-\@;\/?:&=%\$_+*~]  # exclude some trailing characters (heuristic)
       )+
    }x;
 
_______________________________________________
rxvt-unicode mailing list
[email protected]
http://lists.schmorp.de/cgi-bin/mailman/listinfo/rxvt-unicode

Reply via email to