Re: Question about create hive tables.

2011-05-31 Thread 김영우
Hi dujinhang, See, http://wiki.apache.org/hadoop/Hive/UserGuide add jar ../build/contrib/hive_contrib.jar; CREATE TABLE apachelog ( host STRING, identity STRING, user STRING, time STRING, request STRING, status STRING, size STRING, referer STRING, agent STRING) ROW FORMAT

question about number of map tasks for small file

2011-05-31 Thread Junxian Yan
Hi Guys I use flume to store log file , and use hive to query. Flume always store the small file with suffix .seq Now I have over 35 thousand seq files. Every time when I launch query script, 35 thousand map tasks will be created and it's so long time to wait for completing. I also try to set

RDBS and Hive

2011-05-31 Thread Guy Doulberg
Hey friends, I have been using Hive to explore my HDFS for a while now. In my organization we have some applicative information in SqlServer that occasionally I need to join with data I have in the HDFS. When I need to do such a join, I import the data from the SqlServer using sqoop. I wanted

Re: RDBS and Hive

2011-05-31 Thread Andrew Wilson
Yes, here is the ticket: https://issues.apache.org/jira/browse/HIVE-1555?page=com.atlassian.streams.streams-jira-plugin%3Aactivity-stream-issue-tab#issue-tabshttps://issues.apache.org/jira/browse/HIVE-1555?page=com.atlassian.streams.streams-jira-plugin:activity-stream-issue-tab#issue-tabs We use

Hive basic questions

2011-05-31 Thread abh not
Hi All, I am new to Hive and have been reading http://wiki.apache.org/hadoop/Hive/Tutorial to get better understanding of Hive I am sorry for really basic questions, but I have some confusion, here are couple of questions: 1. what is difference between internal and external table in Hive?