We have a user-facing function that creates login names based on their real names, using initial characters from their surname and last name to create a MarkLogic user name. For the first time we recorded a server error when someone registered with a name beginning with a non-ASCII character (Norwegian Ø), because currently MarkLogic username cannot have non-ASCII characters.

So I thought the easy solution would be to use xdmp:diacritic-less(). But no, that only changes characters like ñ and é that are accented variants of a single letter. It does not touch combined charaters like Ø or Æ.

Of course I could use fn:translate to catch all of the likely cases, but is there a more general-purpose standard or extension function to perform normalization to ASCII for accented/combined Latin characters in a MarkLogic environment?

David S.

--
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 400314, Charlottesville, VA 22904-4314 USA
Email: [email protected]   Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to