[ 
https://issues.apache.org/jira/browse/RANGER-1501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15983378#comment-15983378
 ] 

Ramesh Mani commented on RANGER-1501:
-------------------------------------

[~coheig] What you seeing is the behaviour even without this patch also. When 
the file handle on the hdfs is closed either by  a rollover of logging to new 
file ( by default its set to @12.00 am everyday) or in case if NN got restarted 
for various reason. What we found what that during the abrupt restart of NN  
there were some audit log loss and  
https://issues.apache.org/jira/browse/RANGER-1310 toke care of it by 
introducing a AuditFileSpool which logs into local FS before pushing into 
destination. Still need to test this patch throughly to see that Audits are not 
lost even without the RANGER-1310 fix.

> Audit Flush to HDFS does not actually cause the audit logs to be flushed to 
> HDFS 
> ---------------------------------------------------------------------------------
>
>                 Key: RANGER-1501
>                 URL: https://issues.apache.org/jira/browse/RANGER-1501
>             Project: Ranger
>          Issue Type: Bug
>          Components: audit
>    Affects Versions: 0.7.0
>            Reporter: Yan
>            Assignee: Yan
>             Fix For: 1.0.0
>
>         Attachments: 
> 0001-RANGER-1501-Audit-Flush-to-HDFS-does-not-actually-ca.patch
>
>
> The reason is that HDFS file stream's flush() call does not really flush the 
> data all the way to disk, nor even makes the data visible to HDFS users. See 
> the HDFS semantics of the flush/sync at 
> https://issues.apache.org/jira/browse/HADOOP-6313.
> Consequently the audit logs on HDFS won't be visible/durable from HDFS client 
> until the log file is closed. This will, among other issues, boost chances of 
> losing audit logs in case of system failure.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to