> There are a couple of bugs in trac related to UTF-8.  It looks like
> all file names and URLs are run through the pretty restrictive
> quoteFilename in wikiutil.py.  This recodes all characters that aren't
> in (A-Z,a-z,1-9).   In a UTF-8 environment, it doesn't work on UTF-8
> URLs.

It looks like[1] only these ascii characters are allowed in a URI:

Unreserved Characters (no encoding needed)
A-Z (uppercase letters)
a-z (lowercase letters)
0-9 (numbers)
- (dash)
_ (underscore)
. (period)
~ (tilde)

Reserved Characters (allowed only if encoded)
! = %21
* = %2A
' = %27
( = %28
) = %29
; = %3B
: = %3A
@ = %40
& = %26
= = %3D
+ = %2B
$ = %24
, = %2C
/ = %2F
? = %3F
% = %25
# = %23
[ = %5B
] = %5D

If the filename is meant to be displayed in the browser it make sense to
encode it using percent encoding. But of course the title of the page
should still be UTF8.

Scott
------
[1] http://en.wikipedia.org/wiki/Percent-encoding


_______________________________________________
Sycamore-Dev mailing list
[EMAIL PROTECTED]
http://www.projectsycamore.org/
https://tools.cernio.com/mailman/listinfo/sycamore-dev

Reply via email to