Refactor InputFormat and OutputFormat for Hive
----------------------------------------------

                 Key: HIVE-1133
                 URL: https://issues.apache.org/jira/browse/HIVE-1133
             Project: Hadoop Hive
          Issue Type: Improvement
    Affects Versions: 0.6.0
            Reporter: Zheng Shao


Currently we ran into several problems of the FileInputFormat/OutputFormat in 
Hive.

The requirements are:
R1. We want to support HBase: HIVE-806
R2. We want to selectively include files based on file names: HIVE-951
R3. We want to optionally choose to recurse on the directory structure: HIVE-108
R4. We want to pass the filter condition into the storage (very useful for 
HBase, and indexed data format)
R5. We want to pass the column selection information into the storage (already 
done as part of the RCFile, but we can do it better)

We need to structure these requirements and the code structure in a good way to 
make it extensible.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to