[ 
https://issues.apache.org/jira/browse/FLINK-5771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15861022#comment-15861022
 ] 

Colin Breame commented on FLINK-5771:
-------------------------------------

The problem is in the loop found at line 570 of DelimitedInputFormat.java:

{code}
                        while (this.readPos < this.limit && i < 
this.delimiter.length) {
                                if ((this.readBuffer[this.readPos++]) == 
this.delimiter[i]) {
                                        i++;
                                } else {
                                        i = 0;
                                }
                        }
{code}


> DelimitedInputFormat does not correctly handle muli-byte delimiters
> -------------------------------------------------------------------
>
>                 Key: FLINK-5771
>                 URL: https://issues.apache.org/jira/browse/FLINK-5771
>             Project: Flink
>          Issue Type: Bug
>          Components: filesystem-connector
>    Affects Versions: 1.2.0
>            Reporter: Colin Breame
>         Attachments: Test.java, test.txt
>
>
> The DelimitedInputFormat does not correctly handle multi-byte delimiters.
> The reader sometimes misses a delimiter if it is preceded by the first byte 
> from the delimiter.  This results in two records (or more) being returned 
> from a single call to nextRecord.
> See attached test case.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to