[ 
https://issues.apache.org/jira/browse/HCATALOG-308?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13229682#comment-13229682
 ] 

Sushanth Sowmyan commented on HCATALOG-308:
-------------------------------------------

As an update, this jira was first of all intended to move the 
configureInputJobProperties call from the Partition level to the table level, 
since we were passing a TableDesc to it, and the idea was to eventually 
introduce a new configureInputPartitionJobProperties at a partition level.

Much of this is due to StorageHandler being a Table-level concept in hive, and 
us attempting to use that to replace StorageDrivers, which were a partition 
level concept in HCat. Hive does not use a storage handler for simpler IF/OF 
based partitions, and thus allows partitions to use different file formats. 
With our change, however, we tried to unify code under FosterStorageHandler, 
and have it do any and all initialization. This works, as long as the table's 
IF is the same through it.

So, now, given that we have to step back on this, this is missing the time 
window to release with 0.4, which means we should attempt to solve this in a 
more complete manner, starting with changing hive's StorageHandlers to 
understand partitions and then migrating HCat calls to use appropriately.

Other fixes that this mammoth patch also tried to address were:

a) Consistency between input and output - will be addressed with the SH 
partition revamp.
b) Change of method signatures to (TableDesc) rather than 
(TableDesc,Map<String,String>) to increase readability and simplicity - will 
likely be abandoned because we will have backward compatibility headaches 
later, we'll stick to the older signature
c) Replacement of Properties with Map<String,String> in InputJobInfo and 
OutputJobInfo - will have to be abandoned for backward compatibility reasons. 
At some point, we might decide to introduce something easier to use than 
InputJobInfo/OutputJobInfo (maybe even with a builder interface!) to make it 
easier/simpler to interface with the user, and deprecate the current one, but 
that's at least a few revisions away.
d) Removal of mapred IF/OF from HCat - can easily be done, and should. - 
Created https://issues.apache.org/jira/browse/HCATALOG-319 for it.
                
> TableDesc and jobProperties related changes to configureInputJobProperties 
> and configureOutputJobProperties
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: HCATALOG-308
>                 URL: https://issues.apache.org/jira/browse/HCATALOG-308
>             Project: HCatalog
>          Issue Type: Sub-task
>            Reporter: Sushanth Sowmyan
>            Assignee: Sushanth Sowmyan
>             Fix For: 0.4
>
>         Attachments: HCATALOG-308.2.patch, HCATALOG-308.4.patch, 
> HCATALOG-308.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to