[ 
https://issues.apache.org/jira/browse/NIFI-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14999408#comment-14999408
 ] 

Mark Payne commented on NIFI-994:
---------------------------------

[~bbende] - great catch! I was able to create a unit test that replicated the 
issue. There were a couple of places where the checksum could have been messed 
up:

- Needed to ensure that if we changed position of RandomAccessFile that we did 
not count the bytes that we "unread" toward the checksum.
- Was a bug where we did not keep the correct checksum after a processor was 
stopped and restarted.

The unit test 'testMultipleRolloversAfterHavingReadAllData' was added and 
failed because it pulled in duplicate data just like you were seeing. With the 
new patch,
this has been resolved.

I attached a new patch separately that should be applied on top of the others 
in order to make it easier for you to understand what changed, vs. squashing 
all the commits.

Please review whenever you get a chance and ensure that all looks good.

Thanks!
-Mark

> Processor to tail files
> -----------------------
>
>                 Key: NIFI-994
>                 URL: https://issues.apache.org/jira/browse/NIFI-994
>             Project: Apache NiFi
>          Issue Type: New Feature
>    Affects Versions: 0.4.0
>            Reporter: Joseph Percivall
>            Assignee: Mark Payne
>             Fix For: 0.4.0
>
>         Attachments: 0001-NIFI-994-Initial-import-of-TailFile.patch, 
> 0002-NIFI-994-Ensure-that-processor-is-not-valid-due-to-t.patch, 
> 0003-NIFI-994-Fixed-issue-that-could-result-in-data-dupli.patch
>
>
> It's a very common data ingest situation to want to input text into the 
> system by "tailing" a file, most commonly log files. Currently we don't have 
> an easy way to do this. 
> A simple processor to tail a file would benefit many users. There would need 
> to be an option to not just tail a file but pick up where the processor left 
> off if it is interrupted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to