Github user HuJiayin commented on a diff in the pull request:
https://github.com/apache/spark/pull/7208#discussion_r34012100
--- Diff:
unsafe/src/main/java/org/apache/spark/unsafe/types/UTF8String.java ---
@@ -165,6 +165,38 @@ public UTF8String toLowerCase() {
return UTF8String.fromString(toString().toLowerCase());
}
+ public UTF8String initCap(final byte[] byteArr) {
+ String isoString = null;
+ try {
+ isoString = new String(byteArr, "US-ASCII");
--- End diff --
The new alphabets can be added according to this implementation, all the
alphabets are in http://www.asciitable.com/
The customers can also make some enhancement based on this solution.
Support this kind of alphabets are not only in this function.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]