[ 
https://issues.apache.org/jira/browse/HIVE-4531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13768648#comment-13768648
 ] 

Eugene Koifman commented on HIVE-4531:
--------------------------------------

Review Comments:
Should this include e2e tests in addition (or instead of unit tests).  If (when 
:) Hadoop changes the log file format this will break, but Unit tests won't 
catch this since the data that the tests parse is "static".

Here is a bunch of little things/nits:

o Server.java has “    if (enablelog == true && 
!TempletonUtils.isset(statusdir))      throw new BadParam("enablelog is only 
applicable when statusdir is set");”  in 4 different places.  Can this be a 
method?
o What is the purpose of Server#misc()?
o TempletonControllerJob: import org.apache.hive.hcatalog.templeton.Main; - 
unused import
oo Line 173 - indentation is off
oo Line 295 - writer.close() - This writer is connected to System.err.  What 
are the implications of closing this?  What if something tries to write to it 
later?
o TempletonUtils has unused imports - checkstyle needs to be run on the whole 
patch.
o TestJobIDParser mixes JUnit3 and JUnit4.  It should either not extend 
TestCase (I vote for this) or not use @Test annotations
o Can JobIDParser (and all subclasses) be made package scoped since they are 
not used outside templeton pacakge?  Similarly, can methods be made as private 
as possible?
o JobIDParser#parseJobID() has “fname” param which is not used.  What is the 
intent?  Should it be used in openStatusFile() call?  If not, better to remove 
it.
o JobIDParser#openStatusFile() creas a Reader.  Where/when is it being closed?
o Could the 2 member variables in JobIDParser be made private (even final)?
o Why is TestJobIDParser using findJobID() directly?  Could it not use 
parseJobID()?
o Can JobIDParser have 1 line of class level javadoc about the purpose of this 
class?

                
> [WebHCat] Collecting task logs to hdfs
> --------------------------------------
>
>                 Key: HIVE-4531
>                 URL: https://issues.apache.org/jira/browse/HIVE-4531
>             Project: Hive
>          Issue Type: New Feature
>          Components: HCatalog
>            Reporter: Daniel Dai
>            Assignee: Daniel Dai
>             Fix For: 0.12.0
>
>         Attachments: HIVE-4531-1.patch, HIVE-4531-2.patch, HIVE-4531-3.patch, 
> HIVE-4531-4.patch, HIVE-4531-5.patch, HIVE-4531-6.patch, HIVE-4531-7.patch, 
> HIVE-4531-8.patch, samplestatusdirwithlist.tar.gz
>
>
> It would be nice we collect task logs after job finish. This is similar to 
> what Amazon EMR does.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to