[
https://issues.apache.org/jira/browse/CAMEL-7584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aki Yoshida resolved CAMEL-7584.
--------------------------------
Resolution: Fixed
> XML-Aware Tokenizer failing with utf-8 multibyte characters
> -----------------------------------------------------------
>
> Key: CAMEL-7584
> URL: https://issues.apache.org/jira/browse/CAMEL-7584
> Project: Camel
> Issue Type: Bug
> Components: camel-core
> Reporter: Aki Yoshida
> Assignee: Aki Yoshida
> Fix For: 2.14.0
>
>
> There is some issue in the underlining Stax reader's
> getLocation().getCharOffset() when the input data is an InputStream to the
> stax reader.
> This issue was brought up in the woodstox community. But I believe fixing it
> seems to be non trivial as woodstox internally uses char/Reader and keeps the
> offset value to the character sequence and not to the original input stream.
> We change the tokenzer to pass java.io.Reader to the woodstox parser instead
> of passing java.io.InputStream directly.
--
This message was sent by Atlassian JIRA
(v6.2#6252)