Github user javadba commented on the pull request:

    https://github.com/apache/spark/pull/1586#issuecomment-50427481
  
    I am returning to the issue of the two failed tests for strlen:
    
    Actually my local behavior IS correct - I  made mistake not to include the 
encoding when I did the scala REPL.  Here is the corrected invocation in scala 
REPL: 
    
        scala> "\uF93D\uF936\uF949\uF942".getBytes("UTF-8").length
        res1: Int = 12
    
    Notice that the above returns 12 !
    
    For reference, the answer that happens on Jenkins is correct for the 
default encoding as well as ISO-8859-1:
    
        scala> "\uF93D\uF936\uF949\uF942".getBytes("ISO-8859-1").length
        res2: Int = 4
    
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to