date:20130506

[jira] [Commented] (HIVE-4502) subquery smb joins fails

2013-05-06 Thread Vikram Dixit K (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13649536#comment-13649536
 ] 

Vikram Dixit K commented on HIVE-4502:
--

java.lang.NullPointerException
[junit] at 
org.apache.hadoop.hive.ql.exec.ExprNodeColumnEvaluator.initialize(ExprNodeColumnEvaluator.java:82)
[junit] at 
org.apache.hadoop.hive.ql.exec.JoinUtil.getObjectInspectorsFromEvaluators(JoinUtil.java:68)
[junit] at 
org.apache.hadoop.hive.ql.exec.CommonJoinOperator.initializeOp(CommonJoinOperator.java:224)
[junit] at 
org.apache.hadoop.hive.ql.exec.JoinOperator.initializeOp(JoinOperator.java:61)
[junit] at 
org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:375)
[junit] at 
org.apache.hadoop.hive.ql.exec.ExecReducer.configure(ExecReducer.java:154)
[junit] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
[junit] at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
[junit] at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
[junit] at java.lang.reflect.Method.invoke(Method.java:597)
[junit] at 
org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88)
[junit] at 
org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64)
[junit] at 
org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117)
[junit] at 
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:486)
[junit] at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
[junit] at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:262)


 subquery smb joins fails
 

 Key: HIVE-4502
 URL: https://issues.apache.org/jira/browse/HIVE-4502
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: smb_mapjoin_25.q


 Found this issue while running some SMB joins. Attaching test case that 
 causes this error.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4501) HS2 memory leak - FileSystem objects in FileSystem.CACHE

2013-05-06 Thread Thejas M Nair (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13649542#comment-13649542
 ] 

Thejas M Nair commented on HIVE-4501:
-

[~clarkyzl] Yes, HIVE-3098 and HIVE-3155 are related, as it is same kind of 
leak is seen, but with metastore and hive server1 instead.


 HS2 memory leak - FileSystem objects in FileSystem.CACHE
 

 Key: HIVE-4501
 URL: https://issues.apache.org/jira/browse/HIVE-4501
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Affects Versions: 0.11.0
Reporter: Thejas M Nair

 org.apache.hadoop.fs.FileSystem objects are getting accumulated in 
 FileSystem.CACHE, with HS2 in unsecure mode.
 As a workaround, it is possible to set fs.hdfs.impl.disable.cache and 
 fs.file.impl.disable.cache to false.
 Users should not have to bother with this extra configuration. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HIVE-4501) HS2 memory leak - FileSystem objects in FileSystem.CACHE

2013-05-06 Thread Thejas M Nair (JIRA)


[ 
https://issues.apache.org/jira/browse/HIVE-4501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13649543#comment-13649543
 ] 

Thejas M Nair commented on HIVE-4501:
-

I have updated hiveserver2 setup instructions to disable the fs caches - 
https://cwiki.apache.org/confluence/display/Hive/Setting+up+HiveServer2

 HS2 memory leak - FileSystem objects in FileSystem.CACHE
 

 Key: HIVE-4501
 URL: https://issues.apache.org/jira/browse/HIVE-4501
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Affects Versions: 0.11.0
Reporter: Thejas M Nair

 org.apache.hadoop.fs.FileSystem objects are getting accumulated in 
 FileSystem.CACHE, with HS2 in unsecure mode.
 As a workaround, it is possible to set fs.hdfs.impl.disable.cache and 
 fs.file.impl.disable.cache to false.
 Users should not have to bother with this extra configuration. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4503) HiveServer have too many opened fd。

2013-05-06 Thread sutao bian (JIRA)

sutao bian created HIVE-4503:


 Summary: HiveServer have  too  many opened  fd。 
 Key: HIVE-4503
 URL: https://issues.apache.org/jira/browse/HIVE-4503
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.1
 Environment: Hive: hive-0.8.1 
OS: Red Hat Enterprise Linux Server release 5.7 (Tikanga)
Hadoop: 0.20.205

Reporter: sutao bian


When i run hiveserver a while time it will occur error   Caused by: 
java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)

more errors info : 


013-05-06 02:54:47,426 WARN  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:genBodyPlan(5821)) - Common Gby keys:null
2013-05-06 02:54:50,386 WARN  mapred.JobClient 
(JobClient.java:copyAndConfigureFiles(659)) - Use GenericOptionsParser for 
parsing the arguments. Applications should implement Tool for the same.
2013-05-06 02:54:52,565 ERROR exec.Task (SessionState.java:printError(380)) - 
Job Submission failed with exception 
'org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)
at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:3943)
at sun.reflect.GeneratedMethodAccessor1278.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)
Caused by: java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.init(FileOutputStream.java:212)
at 
org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.init(RawLocalFileSystem.java:188)
at 
org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.init(RawLocalFileSystem.java:184)
at 
org.apache.hadoop.fs.RawLocalFileSystem.create(RawLocalFileSystem.java:242)
at 
org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSOutputSummer.init(ChecksumFileSystem.java:335)
at 
org.apache.hadoop.fs.ChecksumFileSystem.create(ChecksumFileSystem.java:368)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:546)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:527)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:434)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:229)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:163)
at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:1164)
at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:1145)
at org.apache.hadoop.mapred.JobInProgress.init(JobInProgress.java:415)
at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:3941)
... 10 more
)'

when i restart the hiveserver it will be ok .

Thanks 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4504) HiveServer have too many opened fd。

2013-05-06 Thread sutao bian (JIRA)

sutao bian created HIVE-4504:


 Summary: HiveServer have  too  many opened  fd。 
 Key: HIVE-4504
 URL: https://issues.apache.org/jira/browse/HIVE-4504
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.8.1
 Environment: Hive: hive-0.8.1 
OS: Red Hat Enterprise Linux Server release 5.7 (Tikanga)
Hadoop: 0.20.205

Reporter: sutao bian


When i run hiveserver a while time it will occur error   Caused by: 
java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)

more errors info : 


013-05-06 02:54:47,426 WARN  parse.SemanticAnalyzer 
(SemanticAnalyzer.java:genBodyPlan(5821)) - Common Gby keys:null
2013-05-06 02:54:50,386 WARN  mapred.JobClient 
(JobClient.java:copyAndConfigureFiles(659)) - Use GenericOptionsParser for 
parsing the arguments. Applications should implement Tool for the same.
2013-05-06 02:54:52,565 ERROR exec.Task (SessionState.java:printError(380)) - 
Job Submission failed with exception 
'org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)
at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:3943)
at sun.reflect.GeneratedMethodAccessor1278.invoke(Unknown Source)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:601)
at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)
Caused by: java.io.FileNotFoundException: 
/opt/tmp/mapred/local/jobTracker/job_201301251143_76286.xml (Too many open 
files)
at java.io.FileOutputStream.open(Native Method)
at java.io.FileOutputStream.init(FileOutputStream.java:212)
at 
org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.init(RawLocalFileSystem.java:188)
at 
org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.init(RawLocalFileSystem.java:184)
at 
org.apache.hadoop.fs.RawLocalFileSystem.create(RawLocalFileSystem.java:242)
at 
org.apache.hadoop.fs.ChecksumFileSystem$ChecksumFSOutputSummer.init(ChecksumFileSystem.java:335)
at 
org.apache.hadoop.fs.ChecksumFileSystem.create(ChecksumFileSystem.java:368)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:546)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:527)
at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:434)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:229)
at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:163)
at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:1164)
at org.apache.hadoop.fs.FileSystem.copyToLocalFile(FileSystem.java:1145)
at org.apache.hadoop.mapred.JobInProgress.init(JobInProgress.java:415)
at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:3941)
... 10 more
)'

when i restart the hiveserver it will be ok .

Thanks 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4505) Hive can't load transforms added using 'ADD FILE'

2013-05-06 Thread Prasad Mujumdar (JIRA)

Prasad Mujumdar created HIVE-4505:
-

 Summary: Hive can't load transforms added using 'ADD FILE'
 Key: HIVE-4505
 URL: https://issues.apache.org/jira/browse/HIVE-4505
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
Priority: Blocker
 Fix For: 0.11.0, 0.12.0


ADD FILE mangles name of the resource when copying to resource download 
directory. As a results following doesn't work:

{code:sql}
ADD FILE test.py;
SELECT TRANSFORM (id) USING 'python test.py' AS b FROM tab1;
{code}

The resource gets added with a different name every time which makes it 
impossible to use transform in non-interactive mode.

This seems to be due to HIVE-3431

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Review Request: HIVE-4505: Hive can't load transforms added using 'ADD FILE'

2013-05-06 Thread Prasad Mujumdar


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/10945/
---

Review request for hive, Ashutosh Chauhan and Navis Ryu.


Description
---

This patch provides addional options for users to configure the resource dir in 
order to avoid conflicts, instead of renaming the files.
- revert HIVE-3431
- Store the hiveserver2 session handle in the config so that it can be used for 
the resource directory setting. eg
property
  namehive.downloaded.resources.dir/name
  value/tmp/resource_dir/${hive.server2.session}/value
/property
- support removing the resource directory at the end of the session.

One can configure the resource dir based on the session id or hiveserver2 
session handle to avoid multiple users trying to use common resource directory.


This addresses bug HIVE-4505.
https://issues.apache.org/jira/browse/HIVE-4505


Diffs
-

  common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 5c1b283 
  ql/src/java/org/apache/hadoop/hive/ql/session/SessionState.java d8c91bd 
  service/src/java/org/apache/hive/service/cli/session/HiveSessionImpl.java 
8f0adb5 

Diff: https://reviews.apache.org/r/10945/diff/


Testing
---


Thanks,

Prasad Mujumdar

[jira] [Updated] (HIVE-4505) Hive can't load transforms added using 'ADD FILE'

2013-05-06 Thread Prasad Mujumdar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasad Mujumdar updated HIVE-4505:
--

Status: Patch Available  (was: Open)

Review request on https://reviews.apache.org/r/10945/

 Hive can't load transforms added using 'ADD FILE'
 -

 Key: HIVE-4505
 URL: https://issues.apache.org/jira/browse/HIVE-4505
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
Priority: Blocker
 Fix For: 0.11.0, 0.12.0

 Attachments: HIVE-4505-1.patch


 ADD FILE mangles name of the resource when copying to resource download 
 directory. As a results following doesn't work:
 {code:sql}
 ADD FILE test.py;
 SELECT TRANSFORM (id) USING 'python test.py' AS b FROM tab1;
 {code}
 The resource gets added with a different name every time which makes it 
 impossible to use transform in non-interactive mode.
 This seems to be due to HIVE-3431

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4505) Hive can't load transforms added using 'ADD FILE'

2013-05-06 Thread Prasad Mujumdar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-4505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasad Mujumdar updated HIVE-4505:
--

Attachment: HIVE-4505-1.patch

Patch for 0.11 branch

 Hive can't load transforms added using 'ADD FILE'
 -

 Key: HIVE-4505
 URL: https://issues.apache.org/jira/browse/HIVE-4505
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.11.0
Reporter: Prasad Mujumdar
Assignee: Prasad Mujumdar
Priority: Blocker
 Fix For: 0.11.0, 0.12.0

 Attachments: HIVE-4505-1.patch


 ADD FILE mangles name of the resource when copying to resource download 
 directory. As a results following doesn't work:
 {code:sql}
 ADD FILE test.py;
 SELECT TRANSFORM (id) USING 'python test.py' AS b FROM tab1;
 {code}
 The resource gets added with a different name every time which makes it 
 impossible to use transform in non-interactive mode.
 This seems to be due to HIVE-3431

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: [VOTE] Apache Hive 0.11.0 Release Candidate 1

2013-05-06 Thread Prasad Mujumdar

  -1 (non-binding)
My apologies, but HIVE-4505 is a regression that IMHO should be addressed.

thanks
Prasad


On Tue, Apr 30, 2013 at 5:18 PM, Ashutosh Chauhan hashut...@apache.orgwrote:

 Hey all,

 Based on feedback from folks, I have respun release candidate, RC1.
 Please take a look. It basically fixes the size bloat of tarball.

 Source tag for RC1 is at:

 https://svn.apache.org/repos/asf/hive/tags/release-0.11.0-rc1


 Source tar ball and convenience binary artifacts can be found
 at:http://people.apache.org/~hashutosh/hive-0.11.0-rc1/

 Maven artifacts for hive are available
 at:https://repository.apache.org/content/repositories/orgapachehive-158/

 Maven artifacts for hcatalog are available at:

 https://repository.apache.org/content/repositories/orgapachehcatalog-159/


 This release has many goodies including HiveServer2, integrated
 hcatalog, windowing and analytical functions, decimal data type,
 better query planning, performance enhancements and various bug fixes.
 In total, we resolved more than 350 issues. Full list of fixed issues
 can be found at:  http://s.apache.org/8Fr


 Voting will conclude in 72 hours.

 Hive PMC Members: Please test and vote.

 Thanks,

 Ashutosh (On behalf of Hive contributors who made 0.11 a possibility)

[jira] [Updated] (HIVE-2616) Passing user identity from metastore client to server in non-secure mode

2013-05-06 Thread Zhuoluo (Clark) Yang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-2616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhuoluo (Clark) Yang updated HIVE-2616:
---

Issue Type: New Feature  (was: Bug)

 Passing user identity from metastore client to server in non-secure mode
 

 Key: HIVE-2616
 URL: https://issues.apache.org/jira/browse/HIVE-2616
 Project: Hive
  Issue Type: New Feature
  Components: Metastore
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.8.1, 0.9.0

 Attachments: hive-2616_1.patch, hive-2616_3.patch, hive-2616_4.patch, 
 hive-2616_5.patch, hive-2616.patch


 Currently in unsecure mode client don't pass on user identity. As a result 
 hdfs and other operations done by server gets executed by user running 
 metastore process instead of being done in context of client. This results in 
 problem as reported here: 
 http://mail-archives.apache.org/mod_mbox/hive-user/20.mbox/%3CCAK0mCrRC3aPqtRHDe2J25Rm0JX6TS1KXxd7KPjqJjoqBjg=a...@mail.gmail.com%3E

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HIVE-4506) join multi small tables

2013-05-06 Thread Fern (JIRA)

Fern created HIVE-4506:
--

 Summary: join multi small tables 
 Key: HIVE-4506
 URL: https://issues.apache.org/jira/browse/HIVE-4506
 Project: Hive
  Issue Type: Wish
Reporter: Fern




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4506) use one map reduce to join multiple small tables

2013-05-06 Thread Fern (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Fern updated HIVE-4506:
---

Priority: Minor (was: Major)
Description:
I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using map
side join. Actually, it would generate two map-reduce by sequence.

Sorry, currently I am just a user of hive and not dig into the code, so I this
is what I expect and have no idea about how to improve.
Affects Version/s: 0.10.0
Summary: use one map reduce to join multiple small tables (was:
join multi small tables )

use one map reduce to join multiple small tables
-

Key: HIVE-4506
URL: https://issues.apache.org/jira/browse/HIVE-4506
Project: Hive
Issue Type: Wish
Affects Versions: 0.10.0
Reporter: Fern
Priority: Minor

I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using
map side join. Actually, it would generate two map-reduce by sequence.
Sorry, currently I am just a user of hive and not dig into the code, so I
this is what I expect and have no idea about how to improve.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HIVE-4506) use one map reduce to join multiple small tables

2013-05-06 Thread Fern (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Fern updated HIVE-4506:
---

Description:
I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using map
side join. Actually, it would generate two map-reduce jobs by sequence.

Sorry, currently I am just a user of hive and not dig into the code, so I this
is what I expect and have no idea about how to improve.

was:
I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using map
side join. Actually, it would generate two map-reduce by sequence.

Sorry, currently I am just a user of hive and not dig into the code, so I this
is what I expect and have no idea about how to improve.

use one map reduce to join multiple small tables
-

Key: HIVE-4506
URL: https://issues.apache.org/jira/browse/HIVE-4506
Project: Hive
Issue Type: Wish
Affects Versions: 0.10.0
Reporter: Fern
Priority: Minor

I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using
map side join. Actually, it would generate two map-reduce jobs by sequence.
Sorry, currently I am just a user of hive and not dig into the code, so I
this is what I expect and have no idea about how to improve.

[jira] [Updated] (HIVE-4506) use one map reduce to join multiple small tables

2013-05-06 Thread Fern (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-4506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Fern updated HIVE-4506:
---

Sorry, currently I am just a user of hive and not dig into the code, so this is
what I expect but I have no idea about how to improve now.

Sorry, currently I am just a user of hive and not dig into the code, so I this
is what I expect and have no idea about how to improve.

use one map reduce to join multiple small tables
-

Key: HIVE-4506
URL: https://issues.apache.org/jira/browse/HIVE-4506
Project: Hive
Issue Type: Wish
Affects Versions: 0.10.0
Reporter: Fern
Priority: Minor

I know we can use map side join for small table.
by my test, if I use HQL like this
--
select /*+mapjoin(b,c)*/...
from a
left join b
on ...
left join c
on ...
---
b and c are both small tables, I expect do the join in one map reduce using
map side join. Actually, it would generate two map-reduce jobs by sequence.
Sorry, currently I am just a user of hive and not dig into the code, so this
is what I expect but I have no idea about how to improve now.

1 2 >

1 - 100 of 125 matches

Mail list logo