No, the safe thing to do (and the thing that is done) is to treat the spaceOK, it's safe, but it is a misuse of Unicode. As space plus combining character is a unit in Unicode, it should be treated as a unit by higher level protocols. If higher level protocols are allowed to do arbitrary things within Unicode units, there is no end to the possible confusion. See for example, from Unicode 4.0 chapter 3:
as a space ignoring the fact that the NMTOKEN contains a combining
character, this is even safer than your suggestion since it can't
mis-identify the combining properties of a character.
C7 A process shall interpret a coded character representation according to the character
semantics established by this standard, if that process does interpret that coded character
representation.
-- Peter Kirk [EMAIL PROTECTED] (personal) [EMAIL PROTECTED] (work) http://www.qaya.org/

