https://bugzilla.wikimedia.org/show_bug.cgi?id=32717

--- Comment #4 from Lupo <lupo.bugzi...@gmail.com> 2011-12-01 08:12:26 UTC ---
(In reply to comment #2)
> Zero-width space is required for some scripts, to insert a break between
> letters that would otherwise form ligatures.

Maybe I misunderstand the purpose of these Unicode characters; I'm not a
Unicode specialist. I thought that was the purpose of the zero-width non-joiner
(U+200C)? Granted, I think the zero-width space (U+200B) also would need to
have the same effect as the ZWNJ as it indicates an (invisible) word boundary,
but I'd say that's just a side effect. Also, this normally invisible word
boundary may be expanded into visible whitespace by text justification
according to [[en:zero-width space]]. So right, stripping it would not be
right, but maybe it should be treated as an underscore.

Anyway, thanks for the answer, I see the rationale now. Whether it's 100%
correct is less important to me. And perhaps people are using U+200B where they
should actually use U+200C, and it's thus more user-friendly to treat it that
way. I was just trying to understand what the thoughts behind this were.

> 
> Tab ... SHOULD be stripped, lemme check. :)

"Outright forbidden": do I see this right that this is rejected at line 2834
http://svn.wikimedia.org/viewvc/mediawiki/trunk/phase3/includes/Title.php?revision=104051&view=markup#l2834
and depends on the configuration of $wgLegalTitleChars?

So, is an installation allowed to define that \t was a legal title character,
and if so, what happens then? (Or what would make most sense then?) Replace by
underscore?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to