[jira] [Commented] (HIVE-7333) Create RDD translator, translating Hive Tables into Spark RDDs

Rui Li (JIRA) Tue, 12 Aug 2014 18:46:34 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14095022#comment-14095022
 ]


Rui Li commented on HIVE-7333:
------------------------------

Sorry I forgot to put these here:

I tested the following kinds of tables for spark:
- TextFile
- SequenceFile
- Orc
- RCFile
- Parquet
- Avro
- Partitioned table
- Bucketed table
- HBase backed table

For each table, I load some data into it and run a count (*) query. For 
partitioned table the query is run for only one partition. I verified all the 
results are correct. There's some issue with HBase table though, I created 
HIVE-7467 for it.

The data size is small and the test query is simple. So please let me know if 
there's more tables to be tested or we need more complicated test cases for 
specific tables.

> Create RDD translator, translating Hive Tables into Spark RDDs
> --------------------------------------------------------------
>
>                 Key: HIVE-7333
>                 URL: https://issues.apache.org/jira/browse/HIVE-7333
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Xuefu Zhang
>            Assignee: Rui Li
>
> Please refer to the design specification.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7333) Create RDD translator, translating Hive Tables into Spark RDDs

Reply via email to