[ https://issues.apache.org/jira/browse/ASTERIXDB-1575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15437011#comment-15437011 ]
Xikui Wang commented on ASTERIXDB-1575: --------------------------------------- [~wyk] Hi, this one is fixed in the latest master. Please try it out. :) > TwitterParser doesn't support non-ascii strings. > ------------------------------------------------ > > Key: ASTERIXDB-1575 > URL: https://issues.apache.org/jira/browse/ASTERIXDB-1575 > Project: Apache AsterixDB > Issue Type: Bug > Reporter: Wail Alkowaileet > Assignee: Xikui Wang > > Hi, > When I tried to run the TwitterFeed to collect Arabic tweets, I got malformed > strings. > it seems that JObjectUtil.getNormalizedString() discard all Arabic letters in > UTF-8. -- This message was sent by Atlassian JIRA (v6.3.4#6332)