Are you submitting non-utf8 stuff to begin with? Joshua
> -----Original Message----- > From: [email protected] > [mailto:[EMAIL PROTECTED] On Behalf Of aeohek > Sent: Wednesday, May 02, 2007 4:24 AM > To: [email protected] > Subject: [ydn-delicious] Latin 1, and only Latin 1, lost on tag edit > > Hi, > > When I edit the tags on: > > http://del.icio.us/url/4720413157054e2059cf355a64300a3c > > (my username is aidan , right now I'm the only person to have > posted that > URI), any Latin-1 (excluding ASCII) in a tag means that tag > is truncated > from the last non-Latin-1 character onwards. This happens both with an > Ajax > edit and with a full-screen edit. > > In detail; currently my tags are: > > japanese language [Φ] [h] [ç] from-peter-t-daniels > pasta.cantbedone.org > > (you can't see them on the URL page right now, it appears to be > cached; try > http://del.icio.us/tag/%5B%CE%A6%D5 if the cache hasn't expired by the > time > you read this). Φ is a Greek character, U+30A6, ç is a > Latin-1 character, > U+00E7. The rest of the characters are US-ASCII. Browser is Firefox > 1.5.0.11, platform Windows XP. > > I edit those tags--I click on edit, then full-screen edit, and add > hi-there > as a tag, such that the displayed text is now: > > japanese language [Φ] [h] [ç] from-peter-t-daniels > pasta.cantbedone.org hi-there > > I then click save, and it redirects away from that tag. But when I > examine > the URL details again, it no longer has a [ç] tag, but it has a new [ > tag. It still has the [Φ] tag. The same happens when I > edit other tags > containing Latin 1, or when I create new entries with tags containing > Latin > 1. > > Using the Live HTTP Headers extension, I see that the POST request > submitted > was: > > POST /aidan/%5B%C3%A7%5D?779822 > url=http%3A%2F%2Fpasta.cantbedone.org%2Fpages%2FPXXU7p.htm&old > url=http%3A%2F%2Fpasta.cantbedone.org%2Fpages%2FPXXU7p.htm&des > cription=%5B%CE%A6%5D%2C+%5Bh%5D+in+Japanese¬es=Cute%3B+Jap anese+went+through+a+historical+development+similar+to+the+%2Ff%2F&tags=japanese+language+%5B%> CE%A6%5D+%5Bh%5D+%5B%C3%A7%5D+from-peter-t-daniels+pasta.cantb > edone.org+&jump=no&date=2007-03-10T14%3A31%3A10Z&key=1540afd50 > 2e4ce4af3cb2bac8df225d1 > > which, when URL-decoded and converted to UTF-8, gives this: > > POST /aidan?312757 > url=http://pasta.cantbedone.org/pages/PXXU7p.htm&oldurl=http:/ > /pasta.cantbedone.org/pages/PXXU7p.htm&description=[Φ],+[ > h]+in+Japanese¬es=Cute;+Japanese+went+through+a+historical+ > development+similar+to+the+/f/&tags=japanese+language+[Φ] > +[h]+[ç]+from-peter-t-daniels+pasta.cantbedone.org+&jump=no&da > te=2007-03-10T14:31:10Z&key=1540afd502e4ce4af3cb2bac8df225d > > Now, the tags CGI variable is correct there, so this seems to be a > server-side problem. I can work around it by renaming the tag > [ to [ç], or > espa to español. > > I've made a UTF-8 encoded version of this email at > http://www.parhasard.net/del.icio.us-latin-1-problem.txt , since Yahoo > Groups appears to prefer to treat it as Latin 1. > > Best regards, and please tell me if I should report this > somewhere else. > > Aidan > > > > > > Yahoo! Groups Links > > > >

