[ https://issues.apache.org/jira/browse/HADOOP-18257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17945053#comment-17945053 ]
ASF GitHub Bot commented on HADOOP-18257: ----------------------------------------- steveloughran commented on PR #6901: URL: https://github.com/apache/hadoop/pull/6901#issuecomment-2809423199 hasn't adaped to the 11.4 upgrade as it was using avro ser/deser as part of the workflow, and that can't be done with the class restrictions. > Analyzing S3A Audit Logs > ------------------------- > > Key: HADOOP-18257 > URL: https://issues.apache.org/jira/browse/HADOOP-18257 > Project: Hadoop Common > Issue Type: Task > Components: fs/s3 > Reporter: Sravani Gadey > Assignee: Mehakmeet Singh > Priority: Major > Labels: pull-request-available > > The main aim is to analyze S3A Audit logs to give better insights in Hive and > Spark jobs. > Steps involved are: > * Merging audit log files containing huge number of audit logs collected > from a job containing various S3 requests. > * Parsing audit logs using regular expressions i.e., dividing them into key > value pairs. > * Converting the key value pairs into CSV file and AVRO file formats. > * Querying on data which would give better insights for different jobs. > * Visualizing the audit logs on Zeppelin or Jupyter notebook with graphs. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org