Switch from using StorageDrivers to SerDes to do data (de)serialization
-----------------------------------------------------------------------
Key: HCATALOG-237
URL: https://issues.apache.org/jira/browse/HCATALOG-237
Project: HCatalog
Issue Type: Improvement
Components: storage handlers
Affects Versions: 0.4
Reporter: Alan Gates
Fix For: 0.4
HCatalog started by creating its own classes, InputStorageDriver and
OutputStorageDriver, to do data conversion between the storage layer
Input/OutputFormats and the HCatInput/OutputFormats. These provide very
similar functionality to Hive's SerDe class, though with a much simpler
interface.
This usage of separate classes has led to a number of issues for HCatalog.
One, it cannot make use of existing Hive SerDes. Two, it has led to a need to
make HCat specific extensions of Hive interfaces (such as the StorageHandler)
to provide the StorageDescriptors. Three, it means that users who already have
Hive installed cannot use HCatalog without first updating every partition in
their metastore with storage driver information.
I propose we switch to using SerDes for this. To address the issue of the more
complicated SerDe interface we can provide adaptor classes that make writing
new SerDes easy in simple cases.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira