Hi, 

When I edit the tags on:

http://del.icio.us/url/4720413157054e2059cf355a64300a3c 

(my username is aidan , right now I'm the only person to have posted that
URI), any Latin-1 (excluding ASCII) in a tag means that tag is truncated
from the last non-Latin-1 character onwards. This happens both with an
Ajax
edit and with a full-screen edit.

In detail; currently my tags are:

japanese language [Φ] [h] [ç] from-peter-t-daniels pasta.cantbedone.org 

(you can't see them on the URL page right now, it appears to be
cached; try
http://del.icio.us/tag/%5B%CE%A6%D5 if the cache hasn't expired by the
time
you read this). Φ is a Greek character, U+30A6, ç is a Latin-1 character,
U+00E7. The rest of the characters are US-ASCII. Browser is Firefox
1.5.0.11, platform Windows XP.

I edit those tags--I click on edit, then full-screen edit, and add
hi-there
as a tag, such that the displayed text is now:

japanese language [Φ] [h] [ç] from-peter-t-daniels
pasta.cantbedone.org hi-there

I then click save, and it redirects away from that tag. But when I
examine 
the URL details again, it no longer has a [ç] tag, but it has a new [ 
tag. It still has the [Φ] tag. The same happens when I edit other tags 
containing Latin 1, or when I create new entries with tags containing
Latin 
1. 

Using the Live HTTP Headers extension, I see that the POST request
submitted
was:

POST /aidan/%5B%C3%A7%5D?779822
url=http%3A%2F%2Fpasta.cantbedone.org%2Fpages%2FPXXU7p.htm&oldurl=http%3A%2F%2Fpasta.cantbedone.org%2Fpages%2FPXXU7p.htm&description=%5B%CE%A6%5D%2C+%5Bh%5D+in+Japanese&notes=Cute%3B+Japanese+went+through+a+historical+development+similar+to+the+%2Ff%2F&tags=japanese+language+%5B%CE%A6%5D+%5Bh%5D+%5B%C3%A7%5D+from-peter-t-daniels+pasta.cantbedone.org+&jump=no&date=2007-03-10T14%3A31%3A10Z&key=1540afd502e4ce4af3cb2bac8df225d1

which, when URL-decoded and converted to UTF-8, gives this: 

POST /aidan?312757
url=http://pasta.cantbedone.org/pages/PXXU7p.htm&oldurl=http://pasta.cantbedone.org/pages/PXXU7p.htm&description=[Φ],+[h]+in+Japanese&notes=Cute;+Japanese+went+through+a+historical+development+similar+to+the+/f/&tags=japanese+language+[Φ]+[h]+[ç]+from-peter-t-daniels+pasta.cantbedone.org+&jump=no&date=2007-03-10T14:31:10Z&key=1540afd502e4ce4af3cb2bac8df225d

Now, the tags CGI variable is correct there, so this seems to be a
server-side problem. I can work around it by renaming the tag [ to [ç], or
espa to español. 

I've made a UTF-8 encoded version of this email at
http://www.parhasard.net/del.icio.us-latin-1-problem.txt , since Yahoo
Groups appears to prefer to treat it as Latin 1. 

Best regards, and please tell me if I should report this somewhere else. 

        Aidan


Reply via email to