2006/7/17, gabor <[EMAIL PROTECTED]>: > > Jeroen Ruigrok van der Werven wrote: > > On 7/12/06, Julian 'Julik' Tarkhanov <[EMAIL PROTECTED]> wrote: > >> This is handled by Unicode standard and is called transliteration. > > > > > > Also, for Japanese, are you going to follow kunrei-shiki or rather the > > more widely used hepburn transliteration? Or perhaps even nippon-shiki > > if you feel like sticking to strictness. > > i think we do not need to discuss japanese at all. after all, there's no > transliteration for kanji. so it's imho pointless to argue about > kana-transliteration, when you cannot transliterate kanji.
We Japanese know that we can't transarate Japanese to ASCII. So I want to do it as follows at least. A letter does not disappear and is restored. #FileField and ImageField have same letters disappear problem. def slug_ja(word) : try : unicode(word, 'ASCII') import re slug = re.sub('[^\w\s-]', '', word).strip().lower() slug = re.sub('[-\s]+', '-', slug) return slug except UnicodeDecodeError : from encodings import idna painful_slug = word.strip().lower().decode('utf-8').encode('IDNA') return painful_slug --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Django developers" group. To post to this group, send email to django-developers@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/django-developers -~----------~----~----~----~------~----~------~--~---