[
https://issues.apache.org/jira/browse/HIVE-7333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14095022#comment-14095022
]
Rui Li commented on HIVE-7333:
------------------------------
Sorry I forgot to put these here:
I tested the following kinds of tables for spark:
- TextFile
- SequenceFile
- Orc
- RCFile
- Parquet
- Avro
- Partitioned table
- Bucketed table
- HBase backed table
For each table, I load some data into it and run a count (*) query. For
partitioned table the query is run for only one partition. I verified all the
results are correct. There's some issue with HBase table though, I created
HIVE-7467 for it.
The data size is small and the test query is simple. So please let me know if
there's more tables to be tested or we need more complicated test cases for
specific tables.
> Create RDD translator, translating Hive Tables into Spark RDDs
> --------------------------------------------------------------
>
> Key: HIVE-7333
> URL: https://issues.apache.org/jira/browse/HIVE-7333
> Project: Hive
> Issue Type: Sub-task
> Reporter: Xuefu Zhang
> Assignee: Rui Li
>
> Please refer to the design specification.
--
This message was sent by Atlassian JIRA
(v6.2#6252)