Question about Hadoop task side-effect files//

2010-06-09 Thread wuxy
I found following section at the end of chapter 6 of the book Hadoop, the definitive guide, 'Task side-effect files'; Care needs to be taken to ensure that multiple instances of the same task don't try to write to the same file. There are two problems to avoid: if a task

[jira] Commented: (HIVE-1139) GroupByOperator sometimes throws OutOfMemory error when there are too many distinct keys

2010-06-09 Thread Arvind Prabhakar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12876979#action_12876979 ] Arvind Prabhakar commented on HIVE-1139: I did some preliminary analysis for this

Re: Question about Hadoop task side-effect files//

2010-06-09 Thread Gerrit van Vuuren
Hi, Using tools frameworks pig and hive already avoids this (unless you write your own stores/writers). What these do is each mapper or reducer (depending from where you write your final data to) will write to its own unique file on hdfs. Have a look at the contents of a table in hive which

[jira] Commented: (HIVE-417) Implement Indexing in Hive

2010-06-09 Thread Prafulla Tekawade (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877049#action_12877049 ] Prafulla Tekawade commented on HIVE-417: I was thinking of adding something called

[jira] Commented: (HIVE-417) Implement Indexing in Hive

2010-06-09 Thread He Yongqiang (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877144#action_12877144 ] He Yongqiang commented on HIVE-417: --- Plan sounds perfectly good to me! Implement Indexing

Cannot access more than one hive prompt

2010-06-09 Thread jaydeep vishwakarma
Hi, I am trying to access two hive prompt from same machine. Only first one is working. But other one hive prompt showing following error when doing simple select query. FAILED: Error in semantic analysis: Unable to fetch table employee How to access more than one hive prompt in same system.

[jira] Commented: (HIVE-1386) HiveQL SQL Compliance (Umbrella)

2010-06-09 Thread Jeff Hammerbacher (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877183#action_12877183 ] Jeff Hammerbacher commented on HIVE-1386: - The discussion in HIVE-61 seems to

Re: Cannot access more than one hive prompt

2010-06-09 Thread Edward Capriolo
On Wed, Jun 9, 2010 at 3:42 PM, jaydeep vishwakarma jaydeep.vishwaka...@mkhoj.com wrote: Hi, I am trying to access two hive prompt from same machine. Only first one is working. But other one hive prompt showing following error when doing simple select query. FAILED: Error in semantic

[jira] Commented: (HIVE-1397) histogram() UDAF for a numerical column

2010-06-09 Thread Edward Capriolo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877220#action_12877220 ] Edward Capriolo commented on HIVE-1397: --- Looks great. Can not wait. histogram() UDAF

[jira] Commented: (HIVE-1139) GroupByOperator sometimes throws OutOfMemory error when there are too many distinct keys

2010-06-09 Thread Arvind Prabhakar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877222#action_12877222 ] Arvind Prabhakar commented on HIVE-1139: If there is interest, I can file a separate

[jira] Updated: (HIVE-1373) Missing connection pool plugin in Eclipse classpath

2010-06-09 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashish Thusoo updated HIVE-1373: Status: Resolved (was: Patch Available) Hadoop Flags: [Reviewed] Fix Version/s:

[jira] Commented: (HIVE-1397) histogram() UDAF for a numerical column

2010-06-09 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877233#action_12877233 ] Ashish Thusoo commented on HIVE-1397: - +1. This would be a cool contribution.

[jira] Commented: (HIVE-1139) GroupByOperator sometimes throws OutOfMemory error when there are too many distinct keys

2010-06-09 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877232#action_12877232 ] Ashish Thusoo commented on HIVE-1139: - Arvind, I thought the whole point of this JIRA

[jira] Created: (HIVE-1398) Support union all without an outer select *

2010-06-09 Thread Ashish Thusoo (JIRA)
Support union all without an outer select * --- Key: HIVE-1398 URL: https://issues.apache.org/jira/browse/HIVE-1398 Project: Hadoop Hive Issue Type: Improvement Components: Query Processor

[jira] Commented: (HIVE-417) Implement Indexing in Hive

2010-06-09 Thread Ashish Thusoo (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877236#action_12877236 ] Ashish Thusoo commented on HIVE-417: A couple of comments on this: A complication that

[jira] Commented: (HIVE-1139) GroupByOperator sometimes throws OutOfMemory error when there are too many distinct keys

2010-06-09 Thread Arvind Prabhakar (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877239#action_12877239 ] Arvind Prabhakar commented on HIVE-1139: Ashish - no problem - let me explain: The

[jira] Commented: (HIVE-417) Implement Indexing in Hive

2010-06-09 Thread Prafulla Tekawade (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877295#action_12877295 ] Prafulla Tekawade commented on HIVE-417: Yes Ashish, Thats what I had in mind.

[jira] Commented: (HIVE-1139) GroupByOperator sometimes throws OutOfMemory error when there are too many distinct keys

2010-06-09 Thread Ning Zhang (JIRA)
[ https://issues.apache.org/jira/browse/HIVE-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12877303#action_12877303 ] Ning Zhang commented on HIVE-1139: -- Arvind, I remember I got this problem