[ 
https://issues.apache.org/jira/browse/HDFS-6634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14080031#comment-14080031
 ] 

Allen Wittenauer commented on HDFS-6634:
----------------------------------------

bq. What if the format of the audit log changes? 

Then you'd see a very large contingent of angry admins killing hadoop 
committers in their sleep.  ;) 

In reality, all of the core folks recognize that the current audit format is 
pretty much locked in stone forever and ever.  Any changes would need to be a 
new audit format that goes to a different log and/or file.  I think it was 
[~sureshms] who mentioned that they might make a new one for keep track of a 
context, but the old one would have to remain.

bq. What about the large volume of events in the audit log (e.g. reads)? How 
are those easily filtered out?

The same way you'd expect: 

# read event
# do i care about this event?
# no? toss it.
# yes? act.
# repeat until done

You might get clever with the stream and write a custom handler that partitions 
the data if this is a concern.  I'd be more interested in it serializing the 
data so that one isn't pushing ascii over the wire.  But again, that's all 
pretty easy to do with something like this theoretical setup.  The one *big* 
gotcha is security.  Reading a raw stream of file access like this is clearly a 
security hole and Kafka has zero security today as well.

> inotify in HDFS
> ---------------
>
>                 Key: HDFS-6634
>                 URL: https://issues.apache.org/jira/browse/HDFS-6634
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: hdfs-client, namenode, qjm
>            Reporter: James Thomas
>            Assignee: James Thomas
>         Attachments: inotify-intro.2.pdf, inotify-intro.pdf
>
>
> Design a mechanism for applications like search engines to access the HDFS 
> edit stream.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to