[jira] Commented: (HIVE-1095) Hive in Maven

2011-03-10 Thread Amareshwari Sriramadasu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005024#comment-13005024
 ] 

Amareshwari Sriramadasu commented on HIVE-1095:
---

Gerrit, are you planning to regenerate the patch incorporating Carl's comments?

 Hive in Maven
 -

 Key: HIVE-1095
 URL: https://issues.apache.org/jira/browse/HIVE-1095
 Project: Hive
  Issue Type: Task
  Components: Build Infrastructure
Affects Versions: 0.6.0
Reporter: Gerrit Jansen van Vuuren
Priority: Minor
 Attachments: HIVE-1095-trunk.patch, hiveReleasedToMaven.tar.gz


 Getting hive into maven main repositories
 Documentation on how to do this is on:
 http://maven.apache.org/guides/mini/guide-central-repository-upload.html

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Commented: (HIVE-1877) Add java_method() as a synonym for the reflect() UDF

2011-03-10 Thread Amareshwari Sriramadasu (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005034#comment-13005034
 ] 

Amareshwari Sriramadasu commented on HIVE-1877:
---

+1 Patch looks good

 Add java_method() as a synonym for the reflect() UDF
 

 Key: HIVE-1877
 URL: https://issues.apache.org/jira/browse/HIVE-1877
 Project: Hive
  Issue Type: Improvement
  Components: UDF
Affects Versions: 0.7.0
Reporter: Carl Steinbach
Assignee: Carl Steinbach
 Attachments: HIVE-1877.1.patch.txt, HIVE-1877.2.patch.txt


 HIVE-471 added the reflect() UDF which allows people to invoke static Java 
 methods from within HQL
 queries. In my opinion the name is confusing since it describes how the UDF 
 works instead of what
 it does. I propose changing the name of (or providing a synonym for) the UDF 
 to something like
  'invoke_method' or 'java_method', or something similar. I'm open to 
 suggestions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-03-10 Thread Prajakta Kalmegh (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prajakta Kalmegh updated HIVE-1694:
---

Attachment: HIVE-1694.3.patch.txt

Patch with new index type support and optimizer code that uses the new index 
type.

 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Commented: (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-03-10 Thread Prajakta Kalmegh (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005045#comment-13005045
 ] 

Prajakta Kalmegh commented on HIVE-1694:


Hi John

Please find attached the patch with new index type support. We have also made 
changes to the our optimizer code to use count of indexed columns from this new 
index type (instead of computing the size(_offsets)). Can you please upload it 
for review on ReviewBoard?

Thanks.

 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


Build failed in Jenkins: Hive-trunk-h0.20 #603

2011-03-10 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Hive-trunk-h0.20/603/

--
Started by timer
Building remotely on ubuntu1
FATAL: Unable to call fetch2. Invalid object ID 6395
java.lang.IllegalStateException: Unable to call fetch2. Invalid object ID 6395
at 
hudson.remoting.RemoteInvocationHandler$RPCRequest.perform(RemoteInvocationHandler.java:268)
at hudson.remoting.Request$2.run(Request.java:270)
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)



[jira] Updated: (HIVE-2030) isEmptyPath() to use ContentSummary cache

2011-03-10 Thread Siying Dong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siying Dong updated HIVE-2030:
--

Attachment: HIVE-2030.3.patch

 isEmptyPath() to use ContentSummary cache
 -

 Key: HIVE-2030
 URL: https://issues.apache.org/jira/browse/HIVE-2030
 Project: Hive
  Issue Type: Improvement
Reporter: Siying Dong
Assignee: Siying Dong
Priority: Minor
 Attachments: HIVE-2030.1.patch, HIVE-2030.2.patch, HIVE-2030.3.patch


 addInputPaths() calls isEmptyPath() for every input path. Now every call is a 
 DFS namenode call. Making isEmptyPath() to use cached ContentSummary, we 
 should be able to avoid some namenode calls and reduce latency in the case of 
 multiple partitions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Commented: (HIVE-2030) isEmptyPath() to use ContentSummary cache

2011-03-10 Thread He Yongqiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005358#comment-13005358
 ] 

He Yongqiang commented on HIVE-2030:


running tests with the new patch

 isEmptyPath() to use ContentSummary cache
 -

 Key: HIVE-2030
 URL: https://issues.apache.org/jira/browse/HIVE-2030
 Project: Hive
  Issue Type: Improvement
Reporter: Siying Dong
Assignee: Siying Dong
Priority: Minor
 Attachments: HIVE-2030.1.patch, HIVE-2030.2.patch, HIVE-2030.3.patch


 addInputPaths() calls isEmptyPath() for every input path. Now every call is a 
 DFS namenode call. Making isEmptyPath() to use cached ContentSummary, we 
 should be able to avoid some namenode calls and reduce latency in the case of 
 multiple partitions.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-2040:
---

Summary: the retry logic in Hive's concurrency  is not working correctly.  
(was: the retry logic in Hive is not working.)

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-2040:
---

Attachment: HIVE-2040.1.patch

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2040.1.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-2040:
---

Attachment: HIVE-2040.2.patch

change to use a do{}while(); just in case the user set an incorrect retry 
number.

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2040.1.patch, HIVE-2040.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread He Yongqiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

He Yongqiang updated HIVE-2040:
---

Status: Patch Available  (was: Open)

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2040.1.patch, HIVE-2040.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Commented: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread Namit Jain (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005458#comment-13005458
 ] 

Namit Jain commented on HIVE-2040:
--

+1

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2040.1.patch, HIVE-2040.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Commented: (HIVE-1694) Accelerate GROUP BY execution using indexes

2011-03-10 Thread John Sichi (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13005461#comment-13005461
 ] 

John Sichi commented on HIVE-1694:
--

Hi Prajakta,

Review Board is self-service...you can create yourself an account and then 
follow the steps here:

http://wiki.apache.org/hadoop/Hive/HowToContribute#Review_Process


 Accelerate GROUP BY execution using indexes
 ---

 Key: HIVE-1694
 URL: https://issues.apache.org/jira/browse/HIVE-1694
 Project: Hive
  Issue Type: New Feature
  Components: Indexing, Query Processor
Affects Versions: 0.7.0
Reporter: Nikhil Deshpande
Assignee: Prajakta Kalmegh
 Attachments: HIVE-1694.1.patch.txt, HIVE-1694.2.patch.txt, 
 HIVE-1694.3.patch.txt, HIVE-1694_2010-10-28.diff, demo_q1.hql, demo_q2.hql


 The index building patch (Hive-417) is checked into trunk, this JIRA issue 
 tracks supporting indexes in Hive compiler  execution engine for SELECT 
 queries.
 This is in ref. to John's comment at
 https://issues.apache.org/jira/browse/HIVE-417?focusedCommentId=12884869page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#action_12884869
 on creating separate JIRA issue for tracking index usage in optimizer  query 
 execution.
 The aim of this effort is to use indexes to accelerate query execution (for 
 certain class of queries). E.g.
 - Filters and range scans (already being worked on by He Yongqiang as part of 
 HIVE-417?)
 - Joins (index based joins)
 - Group By, Order By and other misc cases
 The proposal is multi-step:
 1. Building index based operators, compiler and execution engine changes
 2. Optimizer enhancements (e.g. cost-based optimizer to compare and choose 
 between index scans, full table scans etc.)
 This JIRA initially focuses on the first step. This JIRA is expected to hold 
 the information about index based plans  operator implementations for above 
 mentioned cases. 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] Updated: (HIVE-2040) the retry logic in Hive's concurrency is not working correctly.

2011-03-10 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-2040:
-

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

Committed. Thanks Yongqiang

 the retry logic in Hive's concurrency  is not working correctly.
 

 Key: HIVE-2040
 URL: https://issues.apache.org/jira/browse/HIVE-2040
 Project: Hive
  Issue Type: Bug
Reporter: He Yongqiang
Assignee: He Yongqiang
 Attachments: HIVE-2040.1.patch, HIVE-2040.2.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


Jenkins build is back to normal : Hive-trunk-h0.20 #605

2011-03-10 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Hive-trunk-h0.20/605/changes