[jira] [Commented] (HIVE-8544) Beeline SeparatedValuesOutputFormat (csv/tsv/dsv) wraps the results with extra double quotes

2014-10-21 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179241#comment-14179241
 ] 

Vikram Dixit K commented on HIVE-8544:
--

+1 for 0.14


 Beeline SeparatedValuesOutputFormat (csv/tsv/dsv) wraps the results with 
 extra double quotes  
 --

 Key: HIVE-8544
 URL: https://issues.apache.org/jira/browse/HIVE-8544
 Project: Hive
  Issue Type: Bug
  Components: CLI, JDBC
Affects Versions: 0.14.0
Reporter: Vaibhav Gumashta
Assignee: Vaibhav Gumashta

 Noticed additional double quotes in select * complex datatypes output  
 while running it through HiveServer2/Beeline
 Actual
 {noformat}
 [18,19]   {col1:alice brown,col2:18}{alice 
 brown:18}
 [18,19]   {col1:alice garcia,col2:18}   {alice 
 garcia:18}
 [18,19]   {col1:alice garcia,col2:18}   {alice 
 garcia:18}
 [18,19]   {col1:alice thompson,col2:18} {alice 
 thompson:18}
 [18,19]   {col1:alice xylophone,col2:18}{alice 
 xylophone:18}
 [18,19]   {col1:bob hernandez,col2:18}  {bob 
 hernandez:18}
 [18,19]   {col1:bob ichabod,col2:18}{bob 
 ichabod:18}
 {noformat}
 Expected
 {noformat}
 [18,19]   {col1:alice brown,col2:18}{alice brown:18}
 [18,19]   {col1:alice garcia,col2:18}   {alice garcia:18}
 [18,19]   {col1:alice garcia,col2:18}   {alice garcia:18}
 [18,19]   {col1:alice thompson,col2:18} {alice thompson:18}
 [18,19]   {col1:alice xylophone,col2:18}{alice xylophone:18}
 [18,19]   {col1:bob hernandez,col2:18}  {bob hernandez:18}
 [18,19]   {col1:bob ichabod,col2:18}{bob ichabod:18}
 {noformat}
 Run these queries through HiveServer2 to reproduce the issue:
 {noformat}
 create table insert_9 (a arrayint, b structcol1:string,col2:int, c 
 mapstring, int);
  insert overwrite table insert_9 select array(age, age+1), struct(name, age), 
 map(name, age) from studenttab10k;
 select * from insert_9;
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8390) CBO produces annoying exception message and wraps exceptions too much

2014-10-21 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179243#comment-14179243
 ] 

Vikram Dixit K commented on HIVE-8390:
--

+1 for 0.14

 CBO produces annoying exception message and wraps exceptions too much
 -

 Key: HIVE-8390
 URL: https://issues.apache.org/jira/browse/HIVE-8390
 Project: Hive
  Issue Type: Bug
  Components: CBO
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
Priority: Minor
 Fix For: 0.15.0

 Attachments: HIVE-8390.01.patch, HIVE-8390.patch


 {noformat}
 java.lang.RuntimeException: java.lang.RuntimeException: 
 java.lang.reflect.UndeclaredThrowableException
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer$OptiqBasedPlanner.getOptimizedAST(SemanticAnalyzer.java:12249)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer$OptiqBasedPlanner.access$200(SemanticAnalyzer.java:12223)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:9904)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:221)
   at 
 org.apache.hadoop.hive.ql.parse.ExplainSemanticAnalyzer.analyzeInternal(ExplainSemanticAnalyzer.java:74)
   at 
 org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:221)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:415)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:303)
   at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:1068)
   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1130)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:1005)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:995)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:246)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:198)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:408)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:344)
   at org.apache.hadoop.hive.ql.QTestUtil.executeClient(QTestUtil.java:832)
   at 
 org.apache.hadoop.hive.cli.TestCliDriver.runTest(TestCliDriver.java:136)
   at 
 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_filter_join_breaktask(TestCliDriver.java:120)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at junit.framework.TestCase.runTest(TestCase.java:168)
   at junit.framework.TestCase.runBare(TestCase.java:134)
   at junit.framework.TestResult$1.protect(TestResult.java:110)
   at junit.framework.TestResult.runProtected(TestResult.java:128)
   at junit.framework.TestResult.run(TestResult.java:113)
   at junit.framework.TestCase.run(TestCase.java:124)
   at junit.framework.TestSuite.runTest(TestSuite.java:243)
   at junit.framework.TestSuite.run(TestSuite.java:238)
   at 
 org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:83)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:264)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:153)
   at 
 org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:124)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:200)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:153)
   at 
 org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:103)
 Caused by: java.lang.RuntimeException: 
 java.lang.reflect.UndeclaredThrowableException
   at 
 net.hydromatic.optiq.tools.Frameworks.withPrepare(Frameworks.java:153)
   at 
 net.hydromatic.optiq.tools.Frameworks.withPlanner(Frameworks.java:109)
   at 
 org.apache.hadoop.hive.ql.parse.SemanticAnalyzer$OptiqBasedPlanner.getOptimizedAST(SemanticAnalyzer.java:12243)
   ... 37 more
 Caused by: java.lang.reflect.UndeclaredThrowableException
   at com.sun.proxy.$Proxy28.getDistinctRowCount(Unknown Source)
   at 
 org.eigenbase.rel.metadata.RelMetadataQuery.getDistinctRowCount(RelMetadataQuery.java:326)
   at 
 org.eigenbase.rel.rules.LoptOptimizeJoinRule.computeJoinCardinality(LoptOptimizeJoinRule.java:581)
   at 
 org.eigenbase.rel.rules.LoptOptimizeJoinRule.getBestNextFactor(LoptOptimizeJoinRule.java:774)
   at 
 

[jira] [Commented] (HIVE-6934) PartitionPruner doesn't handle top level constant expression correctly

2014-10-21 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179244#comment-14179244
 ] 

Vikram Dixit K commented on HIVE-6934:
--

+1 for 0.14

 PartitionPruner doesn't handle top level constant expression correctly
 --

 Key: HIVE-6934
 URL: https://issues.apache.org/jira/browse/HIVE-6934
 Project: Hive
  Issue Type: Bug
Reporter: Harish Butani
Assignee: Hari Sankar Sivarama Subramaniyan
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-6934.4.patch, HIVE-6934.5.patch, HIVE-6934.6.patch, 
 HIVE-6934.7.patch, HIVE-6934.8.patch, HIVE-6934.9.patch, HIVE-6934.91.patch, 
 HIVE-6934.92.patch


 You hit this error indirectly, because how we handle invalid constant 
 comparisons. Consider:
 {code}
 create table x(key int, value string) partitioned by (dt int, ts string);
 -- both these queries hit this issue
 select * from x where key = 'abc';
 select * from x where dt = 'abc';
 -- the issue is the comparison get converted to the constant false
 -- and the PartitionPruner doesn't handle top level constant exprs corrcetly
 {code}
 Thanks to [~hsubramaniyan] for uncovering this as part of adding tests for 
 HIVE-5376



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8551) NPE in FunctionRegistry (affects CBO in negative tests)

2014-10-21 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179393#comment-14179393
 ] 

Vikram Dixit K commented on HIVE-8551:
--

+1 for 0.14

 NPE in FunctionRegistry (affects CBO in negative tests)
 ---

 Key: HIVE-8551
 URL: https://issues.apache.org/jira/browse/HIVE-8551
 Project: Hive
  Issue Type: Bug
  Components: CBO
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Attachments: HIVE-8551.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8186) Self join may fail if one side have virtual column(s) and other doesn't

2014-10-21 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14179394#comment-14179394
 ] 

Vikram Dixit K commented on HIVE-8186:
--

+1 for 0.14

 Self join may fail if one side have virtual column(s) and other doesn't
 ---

 Key: HIVE-8186
 URL: https://issues.apache.org/jira/browse/HIVE-8186
 Project: Hive
  Issue Type: Sub-task
Reporter: Sergey Shelukhin
Assignee: Navis
 Fix For: 0.15.0

 Attachments: HIVE-8186.1.patch.txt, HIVE-8186.2.patch.txt, 
 HIVE-8186.3.patch.txt, HIVE-8186.4.patch.txt, HIVE-8186.5.patch.txt, 
 HIVE-8186.6.patch.txt


 See comments. This also fails on trunk, although not on original join_vc query



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-21 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.9.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch, HIVE-8409.2.patch, HIVE-8409.3.patch, 
 HIVE-8409.7.patch, HIVE-8409.8.patch, HIVE-8409.9.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7914) Simplify join predicates for CBO to avoid cross products

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177174#comment-14177174
 ] 

Vikram Dixit K commented on HIVE-7914:
--

+1 for 0.14

 Simplify join predicates for CBO to avoid cross products
 

 Key: HIVE-7914
 URL: https://issues.apache.org/jira/browse/HIVE-7914
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.13.1
Reporter: Mostafa Mokhtar
Assignee: Laljo John Pullokkaran
 Fix For: 0.15.0

 Attachments: HIVE-7914.patch


 Simplify join predicates for disjunctive predicates to avoid cross products.
 For TPC-DS query 13 we generate a cross products.
 The join predicate on (store_sales x customer_demographics) ,  (store_sales x 
 household_demographics) and (store_sales x customer_address) can be pull up 
 to avoid the cross products
 {code}
 select avg(ss_quantity)
,avg(ss_ext_sales_price)
,avg(ss_ext_wholesale_cost)
,sum(ss_ext_wholesale_cost)
  from store_sales
  ,store
  ,customer_demographics
  ,household_demographics
  ,customer_address
  ,date_dim
  where store.s_store_sk = store_sales.ss_store_sk
  and  store_sales.ss_sold_date_sk = date_dim.d_date_sk and date_dim.d_year = 
 2001
  and((store_sales.ss_hdemo_sk=household_demographics.hd_demo_sk
   and customer_demographics.cd_demo_sk = store_sales.ss_cdemo_sk
   and customer_demographics.cd_marital_status = 'M'
   and customer_demographics.cd_education_status = '4 yr Degree'
   and store_sales.ss_sales_price between 100.00 and 150.00
   and household_demographics.hd_dep_count = 3   
  )or
  (store_sales.ss_hdemo_sk=household_demographics.hd_demo_sk
   and customer_demographics.cd_demo_sk = store_sales.ss_cdemo_sk
   and customer_demographics.cd_marital_status = 'D'
   and customer_demographics.cd_education_status = 'Primary'
   and store_sales.ss_sales_price between 50.00 and 100.00   
   and household_demographics.hd_dep_count = 1
  ) or 
  (store_sales.ss_hdemo_sk=household_demographics.hd_demo_sk
   and customer_demographics.cd_demo_sk = ss_cdemo_sk
   and customer_demographics.cd_marital_status = 'U'
   and customer_demographics.cd_education_status = 'Advanced Degree'
   and store_sales.ss_sales_price between 150.00 and 200.00 
   and household_demographics.hd_dep_count = 1  
  ))
  and((store_sales.ss_addr_sk = customer_address.ca_address_sk
   and customer_address.ca_country = 'United States'
   and customer_address.ca_state in ('KY', 'GA', 'NM')
   and store_sales.ss_net_profit between 100 and 200  
  ) or
  (store_sales.ss_addr_sk = customer_address.ca_address_sk
   and customer_address.ca_country = 'United States'
   and customer_address.ca_state in ('MT', 'OR', 'IN')
   and store_sales.ss_net_profit between 150 and 300  
  ) or
  (store_sales.ss_addr_sk = customer_address.ca_address_sk
   and customer_address.ca_country = 'United States'
   and customer_address.ca_state in ('WI', 'MO', 'WV')
   and store_sales.ss_net_profit between 50 and 250  
  ))
 ;
 {code}
 This is the plan currently generated without any predicate simplification 
 {code}
 Warning: Map Join MAPJOIN[59][bigTable=?] in task 'Map 8' is a cross product
 Warning: Map Join MAPJOIN[58][bigTable=?] in task 'Map 8' is a cross product
 Warning: Shuffle Join JOIN[29][tables = [$hdt$_5, $hdt$_6]] in Stage 'Reducer 
 2' is a cross product
 OK
 STAGE DEPENDENCIES:
   Stage-1 is a root stage
   Stage-0 depends on stages: Stage-1
 STAGE PLANS:
   Stage: Stage-1
 Tez
   Edges:
 Map 7 - Map 8 (BROADCAST_EDGE)
 Map 8 - Map 5 (BROADCAST_EDGE), Map 6 (BROADCAST_EDGE)
 Reducer 2 - Map 1 (SIMPLE_EDGE), Map 4 (BROADCAST_EDGE), Map 7 
 (SIMPLE_EDGE)
 Reducer 3 - Reducer 2 (SIMPLE_EDGE)
   DagName: mmokhtar_20140828155050_7059c24b-501b-4683-86c0-4f3c023f0b0e:1
   Vertices:
 Map 1 
 Map Operator Tree:
 TableScan
   alias: customer_address
   Statistics: Num rows: 4000 Data size: 40595195284 Basic 
 stats: COMPLETE Column stats: NONE
   Select Operator
 expressions: ca_address_sk (type: int), ca_state (type: 
 string), ca_country (type: string)
 outputColumnNames: _col0, _col1, _col2
 Statistics: Num rows: 4000 Data size: 40595195284 
 Basic stats: COMPLETE Column stats: NONE
 Reduce Output Operator
   sort order: 
   Statistics: Num rows: 4000 Data size: 40595195284 
 Basic stats: COMPLETE Column stats: NONE
   value expressions: _col0 (type: int), _col1 (type: 
 string), _col2 (type: string)
 Execution mode: 

[jira] [Commented] (HIVE-8387) add retry logic to ZooKeeperStorage in WebHCat

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177485#comment-14177485
 ] 

Vikram Dixit K commented on HIVE-8387:
--

+1 for 0.14

 add retry logic to ZooKeeperStorage in WebHCat
 --

 Key: HIVE-8387
 URL: https://issues.apache.org/jira/browse/HIVE-8387
 Project: Hive
  Issue Type: Bug
  Components: WebHCat
Affects Versions: 0.14.0
Reporter: Eugene Koifman
Assignee: Eugene Koifman
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-8387.2.patch, HIVE-8387.patch


 ZK interactions may run into transient errors that should be retried.  
 Currently there is no retry logic in WebHCat for this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8492) Enhance Constant Folding to propagate constants for simple expressions

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177489#comment-14177489
 ] 

Vikram Dixit K commented on HIVE-8492:
--

+1 for 0.14

 Enhance Constant Folding to propagate constants for simple expressions 
 ---

 Key: HIVE-8492
 URL: https://issues.apache.org/jira/browse/HIVE-8492
 Project: Hive
  Issue Type: Improvement
  Components: Logical Optimizer
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8492.1.patch, HIVE-8492.patch


 currently it propagates only for projects, can be enhanced to propagate for 
 simple expressions like casts, e.g,
 {code}
 SELECT src1.key, src1.key + 1, src2.value FROM src src1 join src src2 ON 
 src1.key = src2.key AND cast(src1.key as double) = 86; 
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8350) Constant folding should happen before group-by optimization

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177493#comment-14177493
 ] 

Vikram Dixit K commented on HIVE-8350:
--

+1 for 0.14

 Constant folding should happen before group-by optimization
 ---

 Key: HIVE-8350
 URL: https://issues.apache.org/jira/browse/HIVE-8350
 Project: Hive
  Issue Type: Improvement
  Components: Logical Optimizer
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8350.1.patch, HIVE-8350.patch


 Constant folding should happen as early as possible, so later optimizations 
 can take advantages of it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8358) Constant folding should happen before PCR

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177490#comment-14177490
 ] 

Vikram Dixit K commented on HIVE-8358:
--

+1 for 0.14

 Constant folding should happen before PCR
 -

 Key: HIVE-8358
 URL: https://issues.apache.org/jira/browse/HIVE-8358
 Project: Hive
  Issue Type: Improvement
  Components: Logical Optimizer
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8358.1.patch, HIVE-8358.2.patch, HIVE-8358.3.patch, 
 HIVE-8358.4.patch, HIVE-8358.patch


 So, that partition pruning and transitive predicate propagation may take 
 advantage of constant folding.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8497) StatsNoJobTask doesn't close RecordReader, FSDataInputStream of which keeps open to prevent stale data clean

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177512#comment-14177512
 ] 

Vikram Dixit K commented on HIVE-8497:
--

+1 for 0.14

 StatsNoJobTask doesn't close RecordReader, FSDataInputStream of which keeps 
 open to prevent stale data clean
 

 Key: HIVE-8497
 URL: https://issues.apache.org/jira/browse/HIVE-8497
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
 Environment: Windows
Reporter: Xiaobing Zhou
Assignee: Xiaobing Zhou
Priority: Critical
 Attachments: HIVE-8497.1.patch, HIVE-8497.2.patch


 run the test
 {noformat}
 mvn -Phadoop-2  test -Dtest=TestCliDriver -Dqfile=alter_merge_stats_orc.q
 {noformat}
 to reproduce it. Simply, this query does three data loads which generates 
 three base orc files.
 ANALYZE TABLE...COMPUTE STATISTICS NOSCAN will execute StatsNoJobTask to get 
 stats, where file handle is held so as not able to clean base file. As a 
 result, after running ALTER TABLE..CONCATENATE, follow-up queries go to stale 
 base file and merged file.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8501) Fix CBO to use indexes when GenericUDFBridge is applied

2014-10-20 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14177709#comment-14177709
 ] 

Vikram Dixit K commented on HIVE-8501:
--

+1 for 0.14

 Fix CBO to use indexes when GenericUDFBridge is applied 
 

 Key: HIVE-8501
 URL: https://issues.apache.org/jira/browse/HIVE-8501
 Project: Hive
  Issue Type: Improvement
Reporter: Pengcheng Xiong
Assignee: Pengcheng Xiong
Priority: Minor
 Fix For: 0.15.0

 Attachments: HIVE-8501.1.patch, HIVE-8501.2.patch, HIVE-8501.3.patch, 
 HIVE-8501.4.patch


 previous https://issues.apache.org/jira/browse/HIVE-8389 assumes that
 we have predicate: ((UDFToDouble(key)  UDFToDouble(80)) and 
 (UDFToDouble(key)  UDFToDouble(100))) for example.
 This does not work for the case when we have predicate: ((UDFToDouble(key)  
 80.0) and (UDFToDouble(key)  100.0))



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Status: Patch Available  (was: Open)

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8479.1.patch, HIVE-8479.2.patch, HIVE-8479.3.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Status: Open  (was: Patch Available)

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8479.1.patch, HIVE-8479.2.patch, HIVE-8479.3.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Attachment: HIVE-8479.3.patch

Uploading again to trigger run of unit tests.

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8479.1.patch, HIVE-8479.2.patch, HIVE-8479.3.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Component/s: (was: Tez)

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8479.1.patch, HIVE-8479.2.patch, HIVE-8479.3.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.7.patch

Address review comments.

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch, HIVE-8409.2.patch, HIVE-8409.3.patch, 
 HIVE-8409.7.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-20 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.8.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch, HIVE-8409.2.patch, HIVE-8409.3.patch, 
 HIVE-8409.7.patch, HIVE-8409.8.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8349) DISTRIBUTE BY should work with tez auto-parallelism enabled

2014-10-17 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14175368#comment-14175368
 ] 

Vikram Dixit K commented on HIVE-8349:
--

+1 for 0.14

 DISTRIBUTE BY should work with tez auto-parallelism enabled
 ---

 Key: HIVE-8349
 URL: https://issues.apache.org/jira/browse/HIVE-8349
 Project: Hive
  Issue Type: Bug
  Components: Physical Optimizer, Tez
Affects Versions: 0.14.0
Reporter: Gopal V
Assignee: Gopal V
 Fix For: 0.14.0

 Attachments: HIVE-8349.1.patch, HIVE-8349.2.patch, HIVE-8349.3.patch, 
 HIVE-8349.4.patch


 Current implementation of DISTRIBUTE BY does not work when tez 
 auto-parallelism is turned on, because of hashCode distribution issues.
 In case of distribute by, the key is actually zero bytes, with only 
 partitioning enabled via hashCode - this adversely affects the uniform 
 hashing implementation.
 In an ideal scenario, the edge should go from the ordered kv input to the 
 unordered partitioned edge, to speed up the processing massively.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8489) Add sanity check to dynamic partition pruning

2014-10-17 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14175638#comment-14175638
 ] 

Vikram Dixit K commented on HIVE-8489:
--

+1 LGTM. +1 for 0.14 as well.

 Add sanity check to dynamic partition pruning
 -

 Key: HIVE-8489
 URL: https://issues.apache.org/jira/browse/HIVE-8489
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Fix For: 0.14.0

 Attachments: HIVE-8489.1.patch


 For debugging/safety it would help to make sure we received the right number 
 of events from tez tasks. This is always supposed to be the case, but would 
 be really hard to debug if something goes wrong.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8452) Cleanup handling of resource configuration for tez

2014-10-17 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14175640#comment-14175640
 ] 

Vikram Dixit K commented on HIVE-8452:
--

+1 LGTM. +1 for 0.14 as well.

 Cleanup handling of resource configuration for tez
 --

 Key: HIVE-8452
 URL: https://issues.apache.org/jira/browse/HIVE-8452
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Fix For: 0.14.0

 Attachments: HIVE-8452.1.patch


 Currently there's a way to configure container size for tez as well as java 
 opts to start containers.
 However, if you only set container size and not java opts, we fall back to MR 
 settings for java opts which will likely fail (killed by yarn).
 vcores is always taken from the MR settings. Need to have a way to overwrite 
 that too.
 Tez 0.5 and onwards will automatically generate the right java opts string 
 when the container/cpus are specified. This is preferable to falling back to 
 MR.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8478) Vectorized Reduce-Side Group By doesn't handle Decimal type correctly

2014-10-17 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14175655#comment-14175655
 ] 

Vikram Dixit K commented on HIVE-8478:
--

+1 for 0.14 as well.

 Vectorized Reduce-Side Group By doesn't handle Decimal type correctly
 -

 Key: HIVE-8478
 URL: https://issues.apache.org/jira/browse/HIVE-8478
 Project: Hive
  Issue Type: Bug
  Components: Vectorization
Affects Versions: 0.14.0
Reporter: Matt McCline
Assignee: Matt McCline
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8478.01.patch, HIVE-8478.02.patch, 
 HIVE-8478.03.patch, HIVE-8478.04.patch


 Note that DecimalColumnVector is different than LongColumnVector because it 
 keeps (an instance) reference to a Decimal128 class whereas the latter stores 
 a long primitive value. So, trouble if you set the reference instead of 
 updating the object.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8428) PCR doesnt remove filters involving casts

2014-10-16 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174122#comment-14174122
 ] 

Vikram Dixit K commented on HIVE-8428:
--

+1 for 0.14

 PCR doesnt remove filters involving casts
 -

 Key: HIVE-8428
 URL: https://issues.apache.org/jira/browse/HIVE-8428
 Project: Hive
  Issue Type: Improvement
  Components: Logical Optimizer
Affects Versions: 0.11.0, 0.12.0, 0.13.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8428.1.patch, HIVE-8428.2.patch, HIVE-8428.3.patch, 
 HIVE-8428.patch


 e.g.,
 select key,value from srcpart where hr = cast(11 as double);



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8476) JavaDoc updates to HiveEndPoint.newConnection() for secure streaming with Kerberos

2014-10-16 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174595#comment-14174595
 ] 

Vikram Dixit K commented on HIVE-8476:
--

+1 for 0.14

 JavaDoc updates to HiveEndPoint.newConnection() for secure streaming with 
 Kerberos
 --

 Key: HIVE-8476
 URL: https://issues.apache.org/jira/browse/HIVE-8476
 Project: Hive
  Issue Type: Bug
  Components: HCatalog
Affects Versions: 0.13.1
Reporter: Roshan Naik
Assignee: Roshan Naik
 Fix For: 0.14.0

 Attachments: HIVE-8476.patch


 Add additional notes on using kerberos authenticated streaming connection in 
 HiveEndPoint.newConnection() method



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8464) Vectorized reducer nested group by query returns wrong results

2014-10-16 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174596#comment-14174596
 ] 

Vikram Dixit K commented on HIVE-8464:
--

+1 for 0.14

 Vectorized reducer nested group by query returns wrong results
 --

 Key: HIVE-8464
 URL: https://issues.apache.org/jira/browse/HIVE-8464
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Matt McCline
Assignee: Matt McCline
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8464.01.patch


 Query has 2 Group By operators in reduce-stage and VectorGroupByOperator can 
 only handle one Group By operator being fed key batches by reduce-shuffle.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8460) ORC SARG literal creation for double from float may lead to wrong evaluation of SARG

2014-10-16 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174597#comment-14174597
 ] 

Vikram Dixit K commented on HIVE-8460:
--

+1 for 0.14

 ORC SARG literal creation for double from float may lead to wrong evaluation 
 of SARG
 

 Key: HIVE-8460
 URL: https://issues.apache.org/jira/browse/HIVE-8460
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.13.0, 0.13.1
Reporter: Ashutosh Chauhan
Assignee: Prasanth J
 Attachments: HIVE-8460.1.patch, HIVE-8460.2.patch


 e.g., expression like d = 0.22 where 0.22 is of float type, gets converted to 
 double d = 0.21986 in SARG creation. This will cause erroneous evaluation 
 of SARG expression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8462) CBO duplicates columns

2014-10-16 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14174627#comment-14174627
 ] 

Vikram Dixit K commented on HIVE-8462:
--

+1 for 0.14

 CBO duplicates columns
 --

 Key: HIVE-8462
 URL: https://issues.apache.org/jira/browse/HIVE-8462
 Project: Hive
  Issue Type: Bug
  Components: CBO
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-8462.01.patch, HIVE-8462.patch


 {noformat}
 select *, rank() over(partition by key order by value) as rr from src1
 {noformat}
 Original plan appears to be incorrect:
 {noformat}
 HiveProjectRel(key=[$0], value=[$1], (tok_function rank (tok_windowspec 
 (tok_partitioningspec (tok_distributeby (tok_table_or_col key)) (tok_orderby 
 (tok_tabsortcolnameasc (tok_table_or_col value))=[$5], rr=[$5])
   HiveProjectRel(key=[$0], value=[$1], block__offset__inside__file=[$2], 
 input__file__name=[$3], row__id=[$4], (tok_function rank (tok_windowspec 
 (tok_partitioningspec (tok_distributeby (tok_table_or_col key)) (tok_orderby 
 (tok_tabsortcolnameasc (tok_table_or_col value))=[rank() OVER (PARTITION 
 BY $0 ORDER BY $1 ROWS BETWEEN 2147483647 FOLLOWING AND 2147483647 
 PRECEDING)])
 HiveTableScanRel(table=[[default.src1]])
 {noformat}
 and final AST has {noformat}
  TOK_SELEXPR
 .
TOK_TABLE_OR_COL
   $hdt$_0
(tok_function rank (tok_windowspec (tok_partitioningspec 
 (tok_distributeby (tok_table_or_col key)) (tok_orderby (tok_tabsortcolnameasc 
 (tok_table_or_col value))
 (tok_function rank (tok_windowspec (tok_partitioningspec 
 (tok_distributeby (tok_table_or_col key)) (tok_orderby (tok_tabsortcolnameasc 
 (tok_table_or_col value))
  TOK_SELEXPR
 .
TOK_TABLE_OR_COL
   $hdt$_0
(tok_function rank (tok_windowspec (tok_partitioningspec 
 (tok_distributeby (tok_table_or_col key)) (tok_orderby (tok_tabsortcolnameasc 
 (tok_table_or_col value))
 rr
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7733) Ambiguous column reference error on query

2014-10-15 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173031#comment-14173031
 ] 

Vikram Dixit K commented on HIVE-7733:
--

+1 for 0.14

 Ambiguous column reference error on query
 -

 Key: HIVE-7733
 URL: https://issues.apache.org/jira/browse/HIVE-7733
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.13.0
Reporter: Jason Dere
Assignee: Navis
 Fix For: 0.15.0

 Attachments: HIVE-7733.1.patch.txt, HIVE-7733.2.patch.txt, 
 HIVE-7733.3.patch.txt, HIVE-7733.4.patch.txt, HIVE-7733.5.patch.txt, 
 HIVE-7733.6.patch.txt, HIVE-7733.7.patch.txt


 {noformat}
 CREATE TABLE agg1 
   ( 
  col0 INT, 
  col1 STRING, 
  col2 DOUBLE 
   ); 
 explain SELECT single_use_subq11.a1 AS a1, 
single_use_subq11.a2 AS a2 
 FROM   (SELECT Sum(agg1.col2) AS a1 
 FROM   agg1 
 GROUP  BY agg1.col0) single_use_subq12 
JOIN (SELECT alias.a2 AS a0, 
 alias.a1 AS a1, 
 alias.a1 AS a2 
  FROM   (SELECT agg1.col1 AS a0, 
 '42'  AS a1, 
 agg1.col0 AS a2 
  FROM   agg1 
  UNION ALL 
  SELECT agg1.col1 AS a0, 
 '41'  AS a1, 
 agg1.col0 AS a2 
  FROM   agg1) alias 
  GROUP  BY alias.a2, 
alias.a1) single_use_subq11 
  ON ( single_use_subq11.a0 = single_use_subq11.a0 );
 {noformat}
 Gets the following error:
 FAILED: SemanticException [Error 10007]: Ambiguous column reference a2
 Looks like this query had been working in 0.12 but starting failing with this 
 error in 0.13



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-6715) Hive JDBC should include username into open session request for non-sasl connection

2014-10-15 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14173033#comment-14173033
 ] 

Vikram Dixit K commented on HIVE-6715:
--

+1 for 0.14

 Hive JDBC should include username into open session request for non-sasl 
 connection
 ---

 Key: HIVE-6715
 URL: https://issues.apache.org/jira/browse/HIVE-6715
 Project: Hive
  Issue Type: Bug
  Components: JDBC
Reporter: Srinath
Assignee: Prasad Mujumdar
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-6715.1.patch, HIVE-6715.2.patch, HIVE-6715.3.patch


 The only parameter from sessVars that's being set in 
 HiveConnection.openSession() is HS2_PROXY_USER. 
 HIVE_AUTH_USER must also be set.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-15 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Attachment: HIVE-8479.1.patch

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8479.1.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-15 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Status: Patch Available  (was: Open)

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8479.1.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-15 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8479:


 Summary: Tez sessions cannot change queues once assigned to one 
within a CLI session
 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8479.1.patch

Observed during execution. Once associated with a queue, the CLI session cannot 
change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8479) Tez sessions cannot change queues once assigned to one within a CLI session

2014-10-15 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8479:
-
Attachment: HIVE-8479.2.patch

 Tez sessions cannot change queues once assigned to one within a CLI session
 ---

 Key: HIVE-8479
 URL: https://issues.apache.org/jira/browse/HIVE-8479
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8479.1.patch, HIVE-8479.2.patch


 Observed during execution. Once associated with a queue, the CLI session 
 cannot change queues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8415) Vectorized comparison of timestamp and integer needs to treat integer as seconds since epoch

2014-10-14 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171547#comment-14171547
 ] 

Vikram Dixit K commented on HIVE-8415:
--

+1 for 0.14

 Vectorized comparison of timestamp and integer needs to treat integer as 
 seconds since epoch
 

 Key: HIVE-8415
 URL: https://issues.apache.org/jira/browse/HIVE-8415
 Project: Hive
  Issue Type: Bug
  Components: Vectorization
Affects Versions: 0.14.0
Reporter: Matt McCline
Assignee: Matt McCline
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8415.01.patch, HIVE-8415.02.patch, 
 HIVE-8415.03.patch, HIVE-8415.04.patch, HIVE-8415.05.patch, HIVE-8415.06.patch


 Current vectorization code treats integers as nanoseconds which doesn't match 
 non-vectorized Hive behavior that interprets integers and doubles as seconds 
 since epoch.
 Currently, the patch includes Jason's changes from patch #2 in HIVE-8391.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8389) Fix CBO when indexes are used

2014-10-14 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14171907#comment-14171907
 ] 

Vikram Dixit K commented on HIVE-8389:
--

+1 for 0.14

 Fix CBO when indexes are used
 -

 Key: HIVE-8389
 URL: https://issues.apache.org/jira/browse/HIVE-8389
 Project: Hive
  Issue Type: Bug
Reporter: Pengcheng Xiong
Assignee: Pengcheng Xiong
 Fix For: 0.15.0

 Attachments: HIVE-8389.1.patch, HIVE-8389.2.patch, HIVE-8389.3.patch


 Disable CBO when indexes are used



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8413) [CBO] Handle ill-formed queries which have distinct, having in incorrect context

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169538#comment-14169538
 ] 

Vikram Dixit K commented on HIVE-8413:
--

+1 for 0.14

 [CBO] Handle ill-formed queries which have distinct, having in incorrect 
 context 
 -

 Key: HIVE-8413
 URL: https://issues.apache.org/jira/browse/HIVE-8413
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8413.1.patch, HIVE-8413.2.patch, HIVE-8413.patch


 e.g., select hash (distinct key) from src;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8421) [CBO] Use OptiqSemanticException in error conditions

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169546#comment-14169546
 ] 

Vikram Dixit K commented on HIVE-8421:
--

+1 for 0.14

 [CBO] Use OptiqSemanticException in error conditions
 

 Key: HIVE-8421
 URL: https://issues.apache.org/jira/browse/HIVE-8421
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8421.patch


 TestNegativeCliDriver



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8443:
-
Attachment: HIVE-8443.1.patch

 Disable tez_smb_1 for mapreduce and prevent from test hang
 --

 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8443:


 Summary: Disable tez_smb_1 for mapreduce and prevent from test hang
 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169621#comment-14169621
 ] 

Vikram Dixit K commented on HIVE-8443:
--

[~jdere] Can you take a look?

 Disable tez_smb_1 for mapreduce and prevent from test hang
 --

 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8443:
-
Status: Patch Available  (was: Open)

 Disable tez_smb_1 for mapreduce and prevent from test hang
 --

 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch, HIVE-8443.2.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169733#comment-14169733
 ] 

Vikram Dixit K commented on HIVE-8443:
--

Updated.

 Disable tez_smb_1 for mapreduce and prevent from test hang
 --

 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch, HIVE-8443.2.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8443) Disable tez_smb_1 for mapreduce and prevent from test hang

2014-10-13 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8443:
-
Attachment: HIVE-8443.2.patch

 Disable tez_smb_1 for mapreduce and prevent from test hang
 --

 Key: HIVE-8443
 URL: https://issues.apache.org/jira/browse/HIVE-8443
 Project: Hive
  Issue Type: Bug
  Components: Tests
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Attachments: HIVE-8443.1.patch, HIVE-8443.2.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8401) OrcFileMergeOperator only close last orc file it opened, which resulted in stale data in table directory

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170010#comment-14170010
 ] 

Vikram Dixit K commented on HIVE-8401:
--

+1 for 0.14

 OrcFileMergeOperator only close last orc file it opened, which resulted in 
 stale data in table directory
 

 Key: HIVE-8401
 URL: https://issues.apache.org/jira/browse/HIVE-8401
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
 Environment: Windows Server
Reporter: Xiaobing Zhou
Assignee: Xiaobing Zhou
Priority: Critical
 Attachments: HIVE-8401.1.patch, alter_merge_2_orc.q.out


 run the test
 {noformat}
 mvn -Phadoop-2  test -Dtest=TestCliDriver -Dqfile=alter_merge_2_orc.q
 {noformat}
 to reproduce it. Simply, this query does three data loads which generates 
 three orc files, ALTER TABLE CONCATENATE tries to merge orc pieces into a 
 single one which is final file to queried.
 Output 
 \hive\itests\qtest\target\qfile-results\clientpositive\alter_merge_2_orc.q.out
  shows # records as 600 that is wrong as opposed to 610 expected.
 Because OrcFileMergeOperator only closes last orc file, the 1st and 2nd orc 
 files still remain in table directory due to failure of deleting unclosed 
 file for old data clean when MoveTask tries to copy merged orc file from 
 scratch dir to table dir. Eventually the query goes to old data(1st and 2nd 
 orc files).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8427) Hive Streaming : secure streaming hangs leading to time outs.

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170046#comment-14170046
 ] 

Vikram Dixit K commented on HIVE-8427:
--

+1 for 0.14. 

 Hive Streaming :  secure streaming hangs leading to time outs.
 --

 Key: HIVE-8427
 URL: https://issues.apache.org/jira/browse/HIVE-8427
 Project: Hive
  Issue Type: Bug
  Components: HCatalog
Affects Versions: 0.14.0
Reporter: Roshan Naik
Assignee: Roshan Naik
  Labels: ACID, Streaming
 Fix For: 0.14.0

 Attachments: HIVE-8427.patch


 Need to enable Thrift Sasl setting  for secure mode communcation



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-13 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8381:
-
Attachment: HIVE-8381.4.patch

Itests were missed. There should be an easier way to do this :P

{noformat}
mvn versions:set -DnewVersion=0.15.0-SNAPSHOT -DgenerateBackupPoms=false 
{noformat}

did not change it for the itests directory for some reason.

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch, HIVE-8381.2.patch, HIVE-8381.3.patch, 
 HIVE-8381.4.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8225) CBO trunk merge: union11 test fails due to incorrect plan

2014-10-13 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14170300#comment-14170300
 ] 

Vikram Dixit K commented on HIVE-8225:
--

+1 for 0.14

 CBO trunk merge: union11 test fails due to incorrect plan
 -

 Key: HIVE-8225
 URL: https://issues.apache.org/jira/browse/HIVE-8225
 Project: Hive
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Pengcheng Xiong
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-8225.1.patch, HIVE-8225.2.patch, HIVE-8225.3.patch, 
 HIVE-8225.4.patch, HIVE-8225.5.patch, HIVE-8225.6.patch, HIVE-8225.7.patch, 
 HIVE-8225.8.patch, HIVE-8225.inprogress.patch, HIVE-8225.inprogress.patch, 
 HIVE-8225.patch


 The result changes to as if the union didn't have count() inside. The issue 
 can be fixed by using srcunion.value outside the subquery in count (replace 
 count(1) with count(srcunion.value)). Otherwise, it looks like count(1) node 
 from union-ed queries is not present in AST at all, which might cause this 
 result.
 -Interestingly, adding group by to each query in a union produces completely 
 weird result (count(1) is 309 for each key, whereas it should be 1 and the 
 logical incorrect value if internal count is lost is 500)- Nm, that groups 
 by table column called key, which is weird but is what Hive does



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-13 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.3.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch, HIVE-8409.2.patch, HIVE-8409.3.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8453) Fix possible unnecessary deserialization in hive-tez

2014-10-13 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8453:


 Summary: Fix possible unnecessary deserialization in hive-tez
 Key: HIVE-8453
 URL: https://issues.apache.org/jira/browse/HIVE-8453
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K


HIVE-8409 requires deserialization of the entire row to merge on the sorted 
columns which is expensive. We could potentially deserialize only the sorted 
column from the serialized data and save on this cost.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8292) Reading from partitioned bucketed tables has high overhead in MapOperator.cleanUpInputFileChangedOp

2014-10-10 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167363#comment-14167363
 ] 

Vikram Dixit K commented on HIVE-8292:
--

LGTM +1. +1 for 0.14 as well.

 Reading from partitioned bucketed tables has high overhead in 
 MapOperator.cleanUpInputFileChangedOp
 ---

 Key: HIVE-8292
 URL: https://issues.apache.org/jira/browse/HIVE-8292
 Project: Hive
  Issue Type: Bug
  Components: Query Processor
Affects Versions: 0.14.0
 Environment: cn105
Reporter: Mostafa Mokhtar
Assignee: Gopal V
 Fix For: 0.14.0

 Attachments: 2014_09_29_14_46_04.jfr, HIVE-8292.1.patch, 
 HIVE-8292.2.patch


 Reading from bucketed partitioned tables has significantly higher overhead 
 compared to non-bucketed non-partitioned files.
 50% of the profile is spent in MapOperator.cleanUpInputFileChangedOp
 5% the CPU in 
 {code}
  Path onepath = normalizePath(onefile);
 {code}
 And 
 45% the CPU in 
 {code}
  onepath.toUri().relativize(fpath.toUri()).equals(fpath.toUri());
 {code}
 From the profiler 
 {code}
 Stack Trace   Sample CountPercentage(%)
 hive.ql.exec.tez.MapRecordSource.processRow(Object)   5,327   62.348
hive.ql.exec.vector.VectorMapOperator.process(Writable)5,326   62.336
   hive.ql.exec.Operator.cleanUpInputFileChanged() 4,851   56.777
  hive.ql.exec.MapOperator.cleanUpInputFileChangedOp() 4,849   56.753
  java.net.URI.relativize(URI) 3,903   45.681
 java.net.URI.relativize(URI, URI) 3,903   
 45.681
java.net.URI.normalize(String) 2,169   
 25.386
java.net.URI.equal(String, String) 
 526 6.156
java.net.URI.equalIgnoringCase(String, 
 String) 1   0.012
java.lang.String.substring(int)
 1   0.012
 hive.ql.exec.MapOperator.normalizePath(String)506 5.922
 org.apache.commons.logging.impl.Log4JLogger.info(Object)  32  
 0.375
  java.net.URI.equals(Object)  12  0.14
  java.util.HashMap$KeySet.iterator()  5   
 0.059
  java.util.HashMap.get(Object)4   
 0.047
  java.util.LinkedHashMap.get(Object)  3   
 0.035
  hive.ql.exec.Operator.cleanUpInputFileChanged()  1   0.012
   hive.ql.exec.Operator.forward(Object, ObjectInspector)  473 5.536
   hive.ql.exec.mr.ExecMapperContext.inputFileChanged()1   0.012
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8420) TestHadoop20SAuthBridge broken with hadoop-1

2014-10-10 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167376#comment-14167376
 ] 

Vikram Dixit K commented on HIVE-8420:
--

Yes. We should get this into 0.14 as well. +1 for 0.14

 TestHadoop20SAuthBridge broken with hadoop-1
 

 Key: HIVE-8420
 URL: https://issues.apache.org/jira/browse/HIVE-8420
 Project: Hive
  Issue Type: Test
  Components: Tests
Affects Versions: 0.14.0
Reporter: Szehon Ho
Assignee: Szehon Ho
Priority: Blocker
 Attachments: HIVE-8420.patch


 Looks like TestHadoop20SAuthBridge had new dependencies on hadoop-2 methods, 
 with upgrade to hadoop2.5.
 {code}
 [ERROR] Failed to execute goal 
 org.apache.maven.plugins:maven-compiler-plugin:3.1:testCompile 
 (default-testCompile) on project hive-it-unit: Compilation failure: 
 Compilation failure:
 [ERROR] 
 /data/hive-ptest/working/apache-svn-trunk-source/itests/hive-unit/src/test/java/org/apache/hadoop/hive/thrift/TestHadoop20SAuthBridge.java:[50,44]
  cannot find symbol
 [ERROR] symbol:   class DefaultImpersonationProvider
 [ERROR] location: package org.apache.hadoop.security.authorize
 [ERROR] 
 /data/hive-ptest/working/apache-svn-trunk-source/itests/hive-unit/src/test/java/org/apache/hadoop/hive/thrift/TestHadoop20SAuthBridge.java:[133,21]
  cannot find symbol
 [ERROR] symbol:   variable DefaultImpersonationProvider
 [ERROR] location: class org.apache.hadoop.hive.thrift.TestHadoop20SAuthBridge
 [ERROR] 
 /data/hive-ptest/working/apache-svn-trunk-source/itests/hive-unit/src/test/java/org/apache/hadoop/hive/thrift/TestHadoop20SAuthBridge.java:[296,7]
  cannot find symbol
 [ERROR] symbol:   variable DefaultImpersonationProvider
 [ERROR] location: class org.apache.hadoop.hive.thrift.TestHadoop20SAuthBridge
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8328) MapJoin implementation in Tez should not reload hashtables

2014-10-10 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167418#comment-14167418
 ] 

Vikram Dixit K commented on HIVE-8328:
--

+1 LGTM. +1 for 0.14 as well.

 MapJoin implementation in Tez should not reload hashtables 
 ---

 Key: HIVE-8328
 URL: https://issues.apache.org/jira/browse/HIVE-8328
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Gopal V
Assignee: Gopal V
Priority: Blocker
  Labels: Regression
 Fix For: 0.14.0

 Attachments: HIVE-8328.1.patch, HIVE-8328.WIP.patch


 {code}
  private void loadHashTable() throws HiveException {
 if ((this.getExecContext() != null)
  ((this.getExecContext().getLocalWork() == null) || 
 (!this.getExecContext()
 .getLocalWork().getInputFileChangeSensitive( {
   if (hashTblInitedOnce) {
 return;
   } else {
 hashTblInitedOnce = true;
   }
 }
 {code}
 This is not exiting in Tez, even if hashTblInitedOnce=true;



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-10 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8381:
-
Attachment: HIVE-8381.3.patch

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch, HIVE-8381.2.patch, HIVE-8381.3.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8432) Remove tez_smb_1 from tests. (Hopefully reduces test run times)

2014-10-10 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8432:


 Summary: Remove tez_smb_1 from tests. (Hopefully reduces test run 
times)
 Key: HIVE-8432
 URL: https://issues.apache.org/jira/browse/HIVE-8432
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8432) Remove tez_smb_1 from tests. (Hopefully reduces test run times)

2014-10-10 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8432:
-
Attachment: HIVE-8432.1.patch

Will commit this. Hopefully reduces the wait times on queues.

 Remove tez_smb_1 from tests. (Hopefully reduces test run times)
 ---

 Key: HIVE-8432
 URL: https://issues.apache.org/jira/browse/HIVE-8432
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8432.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8432) Remove tez_smb_1 from tests. (Hopefully reduces test run times)

2014-10-10 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8432:
-
Status: Patch Available  (was: Open)

 Remove tez_smb_1 from tests. (Hopefully reduces test run times)
 ---

 Key: HIVE-8432
 URL: https://issues.apache.org/jira/browse/HIVE-8432
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8432.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8432) Remove tez_smb_1 from tests. (Hopefully reduces test run times)

2014-10-10 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8432:
-
Resolution: Fixed
Status: Resolved  (was: Patch Available)

 Remove tez_smb_1 from tests. (Hopefully reduces test run times)
 ---

 Key: HIVE-8432
 URL: https://issues.apache.org/jira/browse/HIVE-8432
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8432.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7932) It may cause NP exception when add accessed columns to ReadEntity

2014-10-09 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164885#comment-14164885
 ] 

Vikram Dixit K commented on HIVE-7932:
--

+1 for 0.14

 It may cause NP exception when add accessed columns to ReadEntity
 -

 Key: HIVE-7932
 URL: https://issues.apache.org/jira/browse/HIVE-7932
 Project: Hive
  Issue Type: Bug
Reporter: Xiaomeng Huang
Assignee: Xiaomeng Huang
 Fix For: 0.15.0

 Attachments: HIVE-7932.001.patch, HIVE-7932.002.patch


 {code}
 case TABLE:
entity.getAccessedColumns().addAll(
   tableToColumnAccessMap.get(entity.getTable().getCompleteName()));
 {code}
 if  tableToColumnAccessMap.get(entity.getTable().getCompleteName()) is null, 
 addAll(null) will throw null pointer exception.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8280) CBO : When filter is applied on dimension table PK/FK code path is not in effect.

2014-10-09 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164892#comment-14164892
 ] 

Vikram Dixit K commented on HIVE-8280:
--

+1 for 0.14

 CBO : When filter is applied on dimension table PK/FK code path is not in 
 effect.
 -

 Key: HIVE-8280
 URL: https://issues.apache.org/jira/browse/HIVE-8280
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0
Reporter: Mostafa Mokhtar
Assignee: Harish Butani
 Fix For: 0.14.0

 Attachments: HIVE-8280.1.patch, HIVE-8280.2.patch, HIVE-8280.3.patch


 When a filter is applied on PK side joins don't qualify as PK/FK join.
 In getUniqueKeys when a filter is applied on the table the child is no  
 longer a table scan.
 {code}
   public SetBitSet getUniqueKeys(ProjectRelBase rel, boolean ignoreNulls) {
 RelNode child = rel.getChild();
 if (!(child instanceof HiveTableScanRel)) {
   FunctionRelNode, Metadata fn = RelMdUniqueKeys.SOURCE.apply(
   rel.getClass(), BuiltInMetadata.UniqueKeys.class);
   return ((BuiltInMetadata.UniqueKeys) fn.apply(rel))
   .getUniqueKeys(ignoreNulls);
 } 
 {code}
 Repro 
 {code}
 with ss as 
 (select 
 ss_customer_sk, ss_item_sk, ss_ticket_number
 from
 store_sales,
 store
 where
 s_store_sk = ss_store_sk
 and s_market_id = 4), 
 sr as
 (select sr_customer_sk,sr_item_sk ,sr_ticket_number from store_returns, store 
 where s_store_sk = sr_store_sk and s_market_id=4) 
 select 
 count(*)
 from
 ss,
 sr
 where
 ss_customer_sk = sr_customer_sk
 and ss_item_sk = sr_item_sk
 and ss_ticket_number = sr_ticket_number;
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-09 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.2.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch, HIVE-8409.2.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8380) NanoTime class serializes and deserializes Timestamp incorrectly

2014-10-09 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14165499#comment-14165499
 ] 

Vikram Dixit K commented on HIVE-8380:
--

+1 for 0.14.

 NanoTime class serializes and deserializes Timestamp incorrectly
 

 Key: HIVE-8380
 URL: https://issues.apache.org/jira/browse/HIVE-8380
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.14.0
Reporter: Brock Noland
Assignee: Brock Noland
Priority: Blocker
 Fix For: 0.14.0

 Attachments: HIVE-8380.patch, HIVE-8380.patch


 Same as PARQUET-114



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8407) [CBO] Handle filters with non-boolean return type

2014-10-09 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14165905#comment-14165905
 ] 

Vikram Dixit K commented on HIVE-8407:
--

+1 for 0.14

 [CBO] Handle filters with non-boolean return type
 -

 Key: HIVE-8407
 URL: https://issues.apache.org/jira/browse/HIVE-8407
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8407.patch


 e.g. select * from src where 'foo';



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8315) CBO : Negate condition underestimates selectivity which results in an in-efficient plan

2014-10-08 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163650#comment-14163650
 ] 

Vikram Dixit K commented on HIVE-8315:
--

+1 for 0.14

 CBO : Negate condition underestimates selectivity which results in an 
 in-efficient plan
 ---

 Key: HIVE-8315
 URL: https://issues.apache.org/jira/browse/HIVE-8315
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0
Reporter: Mostafa Mokhtar
Assignee: Harish Butani
 Fix For: 0.15.0

 Attachments: HIVE-8315.1.patch


 For TPC-DS Q64 the predicate cd1.cd_marital_status  cd2.cd_marital_status 
 under estimate the join selectivity by a huge margin and results in 
 in-efficient join order.
 This is a subset of the logical plan showing that item was joined very last
 {code}
 HiveJoinRel(condition=[=($0, $37)], 
 joinType=[inner]): rowcount = 1.0, cumulative cost = {6.386017602518958E8 
 rows, 0.0 cpu, 0.0 io}, id = 3790
   HiveJoinRel(condition=[=($0, $33)], 
 joinType=[inner]): rowcount = 1.0, cumulative cost = {6.386017582518958E8 
 rows, 0.0 cpu, 0.0 io}, id = 3067
 HiveFilterRel(condition=[($30, $32)]): 
 rowcount = 1.8252236387887635, cumulative cost = {6.386017554266721E8 rows, 
 0.0 cpu, 0.0 io}, id = 1153
   HiveProjectRel(ss_item_sk=[$2], 
 ss_customer_sk=[$3], ss_cdemo_sk=[$4], ss_hdemo_sk=[$5], ss_addr_sk=[$6], 
 ss_store_sk=[$7], ss_promo_sk=[$8], ss_ticket_number=[$9], 
 ss_wholesale_cost=[$10], ss_list_price=[$11], ss_coupon_amt=[$12], 
 ss_sold_date_sk=[$13], sr_item_sk=[$0], sr_ticket_number=[$1], 
 c_customer_sk=[$23], c_current_cdemo_sk=[$24], c_current_hdemo_sk=[$25], 
 c_current_addr_sk=[$26], c_first_shipto_date_sk=[$27], 
 c_first_sales_date_sk=[$28], d_date_sk=[$14], d_year=[$15], d_date_sk0=[$29], 
 d_year0=[$30], d_date_sk1=[$31], d_year1=[$32], s_store_sk=[$18], 
 s_store_name=[$19], s_zip=[$20], cd_demo_sk=[$16], cd_marital_status=[$17], 
 cd_demo_sk0=[$21], cd_marital_status0=[$22]): rowcount = 
 3.6246005783468924E7, cumulative cost = {6.386017554266721E8 rows, 0.0 cpu, 
 0.0 io}, id = 2312
 HiveJoinRel(condition=[AND(=($2, $0), 
 =($9, $1))], joinType=[inner]): rowcount = 3.6246005783468924E7, cumulative 
 cost = {6.386017554266721E8 rows, 0.0 cpu, 0.0 io}, id = 2310
   HiveProjectRel(sr_item_sk=[$1], 
 sr_ticket_number=[$8]): rowcount = 5.5578005E7, cumulative cost = {0.0 rows, 
 0.0 cpu, 0.0 io}, id = 912
 
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200_orig.store_returns]]): 
 rowcount = 5.5578005E7, cumulative cost = {0}, id = 62
   HiveJoinRel(condition=[=($1, $21)], 
 joinType=[inner]): rowcount = 1.2950939439433252E7, cumulative cost = 
 {5.700728109872389E8 rows, 0.0 cpu, 0.0 io}, id = 2308
 HiveJoinRel(condition=[=($5, 
 $16)], joinType=[inner]): rowcount = 5491530.921341597, cumulative cost = 
 {5.629812800658973E8 rows, 0.0 cpu, 0.0 io}, id = 2301
   HiveJoinRel(condition=[=($2, 
 $14)], joinType=[inner]): rowcount = 5491530.921341597, cumulative cost = 
 {5.574895371445558E8 rows, 0.0 cpu, 0.0 io}, id = 2299
 HiveJoinRel(condition=[=($11, 
 $12)], joinType=[inner]): rowcount = 5491530.921341597, cumulative cost = 
 {5.500772062232143E8 rows, 0.0 cpu, 0.0 io}, id = 1898
   
 HiveProjectRel(ss_item_sk=[$1], ss_customer_sk=[$2], ss_cdemo_sk=[$3], 
 ss_hdemo_sk=[$4], ss_addr_sk=[$5], ss_store_sk=[$6], ss_promo_sk=[$7], 
 ss_ticket_number=[$8], ss_wholesale_cost=[$10], ss_list_price=[$11], 
 ss_coupon_amt=[$18], ss_sold_date_sk=[$22]): rowcount = 5.50076554E8, 
 cumulative cost = {0.0 rows, 0.0 cpu, 0.0 io}, id = 909
 
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200_orig.store_sales]]): 
 rowcount = 5.50076554E8, cumulative cost = {0}, id = 55{code}
 Query 
 {code}
 select cs1.product_name ,cs1.store_name ,cs1.store_zip ,cs1.b_street_number 
 ,cs1.b_streen_name ,cs1.b_city
  ,cs1.b_zip ,cs1.c_street_number ,cs1.c_street_name ,cs1.c_city 
 ,cs1.c_zip ,cs1.syear ,cs1.cnt
  ,cs1.s1 ,cs1.s2 ,cs1.s3
  ,cs2.s1 ,cs2.s2 ,cs2.s3 ,cs2.syear ,cs2.cnt
 from
 (select i_product_name as product_name ,i_item_sk as item_sk ,s_store_name as 
 store_name
  ,s_zip as store_zip ,ad1.ca_street_number as b_street_number 
 ,ad1.ca_street_name as b_streen_name
  

[jira] [Commented] (HIVE-8344) Hive on Tez sets mapreduce.framework.name to yarn-tez

2014-10-08 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163931#comment-14163931
 ] 

Vikram Dixit K commented on HIVE-8344:
--

+1 LGTM. +1 for 0.14 as well.

 Hive on Tez sets mapreduce.framework.name to yarn-tez
 -

 Key: HIVE-8344
 URL: https://issues.apache.org/jira/browse/HIVE-8344
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-8344.1.patch, HIVE-8344.2.patch, HIVE-8344.3.patch


 This was done to run MR jobs when in Tez mode (emulate MR on Tez). However, 
 we don't switch back when the user specifies MR as exec engine.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7917) Hive max reducers count has regressed from a prime number to 999 (re-apply HIVE-7158)

2014-10-08 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164114#comment-14164114
 ] 

Vikram Dixit K commented on HIVE-7917:
--

+1 for 0.14

 Hive max reducers count has regressed from a prime number to 999  (re-apply 
 HIVE-7158)
 --

 Key: HIVE-7917
 URL: https://issues.apache.org/jira/browse/HIVE-7917
 Project: Hive
  Issue Type: Bug
  Components: Configuration
Reporter: Gopal V
Assignee: Gopal V
 Attachments: HIVE-7917.1.patch, HIVE-7917.2.patch


 HIVE-6037 overwrote configuration changes made by HIVE-7158 for Configuration 
 parameters related to reducer parallelism.
 The changes need to be reapplied.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8363) AccumuloStorageHandler compile failure hadoop-1

2014-10-08 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164215#comment-14164215
 ] 

Vikram Dixit K commented on HIVE-8363:
--

+1 for 0.14

 AccumuloStorageHandler compile failure hadoop-1
 ---

 Key: HIVE-8363
 URL: https://issues.apache.org/jira/browse/HIVE-8363
 Project: Hive
  Issue Type: Bug
  Components: StorageHandler
Reporter: Szehon Ho
Assignee: Josh Elser
Priority: Blocker
 Fix For: 0.14.0

 Attachments: HIVE-8363.1.patch


 There's an error about AccumuloStorageHandler compiling on hadoop-1.  It 
 seems the signature of split() is not the same.  Looks like we can should use 
 another utils to fix this.
 {code}
 [ERROR] Failed to execute goal 
 org.apache.maven.plugins:maven-compiler-plugin:3.1:compile (default-compile) 
 on project hive-accumulo-handler: Compilation failure
 [ERROR] 
 /data/hive-ptest/working/apache-svn-trunk-source/accumulo-handler/src/java/org/apache/hadoop/hive/accumulo/columns/ColumnMapper.java:[57,52]
  no suitable method found for split(java.lang.String,char)
 [ERROR] method 
 org.apache.hadoop.util.StringUtils.split(java.lang.String,char,char) is not 
 applicable
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8409:


 Summary: SMB joins fail intermittently on tez
 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0


Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete the 
fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.1.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Status: Patch Available  (was: Open)

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Status: Open  (was: Patch Available)

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: HIVE-8409.1.patch

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Attachment: (was: HIVE-8409.1.patch)

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8409) SMB joins fail intermittently on tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8409:
-
Status: Patch Available  (was: Open)

 SMB joins fail intermittently on tez
 

 Key: HIVE-8409
 URL: https://issues.apache.org/jira/browse/HIVE-8409
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8409.1.patch


 Flakiness with regard to SMB joins in tez. TEZ-1647 is required to complete 
 the fix.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8364) We're not waiting for all inputs in MapRecordProcessor on Tez

2014-10-08 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14164596#comment-14164596
 ] 

Vikram Dixit K commented on HIVE-8364:
--

Superseded by HIVE-8409.

 We're not waiting for all inputs in MapRecordProcessor on Tez
 -

 Key: HIVE-8364
 URL: https://issues.apache.org/jira/browse/HIVE-8364
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Vikram Dixit K
 Fix For: 0.14.0

 Attachments: HIVE-8364.1.patch, HIVE-8364.2.patch


 Seems like this could be a race condition: We're blocking for some inputs to 
 become available, but the main MR input is just assumed ready...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8364) We're not waiting for all inputs in MapRecordProcessor on Tez

2014-10-08 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8364:
-
Resolution: Duplicate
Status: Resolved  (was: Patch Available)

 We're not waiting for all inputs in MapRecordProcessor on Tez
 -

 Key: HIVE-8364
 URL: https://issues.apache.org/jira/browse/HIVE-8364
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Vikram Dixit K
 Fix For: 0.14.0

 Attachments: HIVE-8364.1.patch, HIVE-8364.2.patch


 Seems like this could be a race condition: We're blocking for some inputs to 
 become available, but the main MR input is just assumed ready...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8376) Umbrella Jira for HiveServer2 dynamic service discovery

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162171#comment-14162171
 ] 

Vikram Dixit K commented on HIVE-8376:
--

+1 for 0.14

 Umbrella Jira for HiveServer2 dynamic service discovery
 ---

 Key: HIVE-8376
 URL: https://issues.apache.org/jira/browse/HIVE-8376
 Project: Hive
  Issue Type: New Feature
  Components: HiveServer2, JDBC
Affects Versions: 0.14.0
Reporter: Vaibhav Gumashta
Assignee: Vaibhav Gumashta
  Labels: TODOC14
 Fix For: 0.14.0


 Creating an ☂ Jira for documentation purpose. I'll add a detailed doc for the 
 implementation  usage here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8366) CBO fails if there is a table sample in subquery

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162169#comment-14162169
 ] 

Vikram Dixit K commented on HIVE-8366:
--

+1 for 0.14 as well.

 CBO fails if there is a table sample in subquery
 

 Key: HIVE-8366
 URL: https://issues.apache.org/jira/browse/HIVE-8366
 Project: Hive
  Issue Type: Bug
  Components: CBO, Logical Optimizer
Affects Versions: 0.14.0
Reporter: Ashutosh Chauhan
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-8366.patch


 Bail out from cbo in such cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8261) CBO : Predicate pushdown is removed by Optiq

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162172#comment-14162172
 ] 

Vikram Dixit K commented on HIVE-8261:
--

+1 for 0.14

 CBO : Predicate pushdown is removed by Optiq 
 -

 Key: HIVE-8261
 URL: https://issues.apache.org/jira/browse/HIVE-8261
 Project: Hive
  Issue Type: Bug
  Components: CBO
Affects Versions: 0.14.0, 0.13.1
Reporter: Mostafa Mokhtar
Assignee: Harish Butani
 Fix For: 0.14.0

 Attachments: HIVE-8261.1.patch


 Plan for TPC-DS Q64 wasn't optimal upon looking at the logical plan I 
 realized that predicate pushdown is not applied on date_dim d1.
 Interestingly before optiq we have the predicate pushed :
 {code}
 HiveFilterRel(condition=[=($5, $1)])
 HiveJoinRel(condition=[=($3, $6)], joinType=[inner])
   HiveProjectRel(_o__col0=[$0], _o__col1=[$2], _o__col2=[$3], 
 _o__col3=[$1])
 HiveFilterRel(condition=[=($0, 2000)])
   HiveAggregateRel(group=[{0, 1}], agg#0=[count()], agg#1=[sum($2)])
 HiveProjectRel($f0=[$4], $f1=[$5], $f2=[$2])
   HiveJoinRel(condition=[=($1, $8)], joinType=[inner])
 HiveJoinRel(condition=[=($1, $5)], joinType=[inner])
   HiveJoinRel(condition=[=($0, $3)], joinType=[inner])
 HiveProjectRel(ss_sold_date_sk=[$0], ss_item_sk=[$2], 
 ss_wholesale_cost=[$11])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.store_sales]])
 HiveProjectRel(d_date_sk=[$0], d_year=[$6])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.date_dim]])
   HiveFilterRel(condition=[AND(in($2, 'maroon', 'burnished', 
 'dim', 'steel', 'navajo', 'chocolate'), between(false, $1, 35, +(35, 10)), 
 between(false, $1, +(35, 1), +(35, 15)))])
 HiveProjectRel(i_item_sk=[$0], i_current_price=[$5], 
 i_color=[$17])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.item]])
 HiveProjectRel(_o__col0=[$0])
   HiveAggregateRel(group=[{0}])
 HiveProjectRel($f0=[$0])
   HiveJoinRel(condition=[AND(=($0, $2), =($1, $3))], 
 joinType=[inner])
 HiveProjectRel(cs_item_sk=[$15], 
 cs_order_number=[$17])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.catalog_sales]])
 HiveProjectRel(cr_item_sk=[$2], cr_order_number=[$16])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.catalog_returns]])
   HiveProjectRel(_o__col0=[$0], _o__col1=[$2], _o__col3=[$1])
 HiveFilterRel(condition=[=($0, +(2000, 1))])
   HiveAggregateRel(group=[{0, 1}], agg#0=[count()])
 HiveProjectRel($f0=[$4], $f1=[$5], $f2=[$2])
   HiveJoinRel(condition=[=($1, $8)], joinType=[inner])
 HiveJoinRel(condition=[=($1, $5)], joinType=[inner])
   HiveJoinRel(condition=[=($0, $3)], joinType=[inner])
 HiveProjectRel(ss_sold_date_sk=[$0], ss_item_sk=[$2], 
 ss_wholesale_cost=[$11])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.store_sales]])
 HiveProjectRel(d_date_sk=[$0], d_year=[$6])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.date_dim]])
   HiveFilterRel(condition=[AND(in($2, 'maroon', 'burnished', 
 'dim', 'steel', 'navajo', 'chocolate'), between(false, $1, 35, +(35, 10)), 
 between(false, $1, +(35, 1), +(35, 15)))])
 HiveProjectRel(i_item_sk=[$0], i_current_price=[$5], 
 i_color=[$17])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.item]])
 HiveProjectRel(_o__col0=[$0])
   HiveAggregateRel(group=[{0}])
 HiveProjectRel($f0=[$0])
   HiveJoinRel(condition=[AND(=($0, $2), =($1, $3))], 
 joinType=[inner])
 HiveProjectRel(cs_item_sk=[$15], 
 cs_order_number=[$17])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.catalog_sales]])
 HiveProjectRel(cr_item_sk=[$2], cr_order_number=[$16])
   
 HiveTableScanRel(table=[[tpcds_bin_partitioned_orc_200.catalog_returns]])
 {code}
 While after Optiq the filter on date_dim gets pulled up the plan 
 {code}
   HiveFilterRel(condition=[=($5, $1)]): rowcount = 1.0, cumulative cost = 
 {5.50188454E8 rows, 0.0 cpu, 0.0 io}, id = 6895
 HiveProjectRel(_o__col0=[$0], _o__col1=[$1], _o__col2=[$2], 
 _o__col3=[$3], _o__col00=[$4], _o__col10=[$5], _o__col30=[$6]): rowcount = 
 1.0, cumulative cost = 

[jira] [Commented] (HIVE-8258) Compactor cleaners can be starved on a busy table or partition.

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162174#comment-14162174
 ] 

Vikram Dixit K commented on HIVE-8258:
--

+1 for 0.14

 Compactor cleaners can be starved on a busy table or partition.
 ---

 Key: HIVE-8258
 URL: https://issues.apache.org/jira/browse/HIVE-8258
 Project: Hive
  Issue Type: Bug
  Components: Transactions
Affects Versions: 0.13.1
Reporter: Alan Gates
Assignee: Alan Gates
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8258.2.patch, HIVE-8258.3.patch, HIVE-8258.4.patch, 
 HIVE-8258.patch


 Currently the cleaning thread in the compactor does not run on a table or 
 partition while any locks are held on this partition.  This leaves it open to 
 starvation in the case of a busy table or partition.  It only needs to wait 
 until all locks on the table/partition at the time of the compaction have 
 expired.  Any jobs initiated after that (and thus any locks obtained) will be 
 for the new versions of the files.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-07 Thread Vikram Dixit K (JIRA)
Vikram Dixit K created HIVE-8381:


 Summary: Update hive version on trunk to 0.15
 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K


In view of the branching for 0.14 done, we need to upgrade the version of hive 
on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-07 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8381:
-
Attachment: HIVE-8381.1.patch

[~brocknoland] Updated version of trunk.

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-07 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8381:
-
Status: Patch Available  (was: Open)

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8360) Add cross cluster support for webhcat E2E tests

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162437#comment-14162437
 ] 

Vikram Dixit K commented on HIVE-8360:
--

+1 for 0.14.

 Add cross cluster support for webhcat E2E tests
 ---

 Key: HIVE-8360
 URL: https://issues.apache.org/jira/browse/HIVE-8360
 Project: Hive
  Issue Type: Test
  Components: Tests, WebHCat
 Environment: Secure cluster
Reporter: Aswathy Chellammal Sreekumar
Priority: Critical
 Fix For: 0.14.0

 Attachments: AD-MIT.patch


 In current Webhcat E2E test setup, cross domain secure cluster runs will fail 
 since the realm name for user principles are not included in the kinit 
 command. This patch concatenates the realm name to the user principal there 
 by resulting in a successful kinit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8393) Handle SIGINT on Tez

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162925#comment-14162925
 ] 

Vikram Dixit K commented on HIVE-8393:
--

+1 LGTM pending tests. +1 for 0.14 as well.

 Handle SIGINT on Tez
 

 Key: HIVE-8393
 URL: https://issues.apache.org/jira/browse/HIVE-8393
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Gunther Hagleitner
 Attachments: HIVE-8393.1.patch


 The signal handler in hive currently only interrupts MR jobs. The same logic 
 can be used to interrupt Tez dags as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-6972) jdbc HTTP configuration options should be part of sessionConf part of connection string

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-6972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162937#comment-14162937
 ] 

Vikram Dixit K commented on HIVE-6972:
--

+1 for 0.14

 jdbc HTTP configuration options should be part of sessionConf part of 
 connection string
 ---

 Key: HIVE-6972
 URL: https://issues.apache.org/jira/browse/HIVE-6972
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2, JDBC
Affects Versions: 0.13.0
Reporter: Thejas M Nair
Assignee: Vaibhav Gumashta
 Fix For: 0.14.0

 Attachments: HIVE-6972.1.patch, HIVE-6972.2.patch, HIVE-6972.3.patch


 The http connection parameters are currently part of the HiveConf section of 
 the connection string. It should ideally be part of SessionConf section, as 
 that is where rest of the connection parameters are.
 HTTP transport parameters are not part of the hiveconfiguration parameters 
 that need to be set.
 ie
 instead of 
 jdbc:hive2://host:port/db?hive.server2.transport.mode=http;hive.server2.thrift.http.path=http_endpoint
 it should be -
 jdbc:hive2://host:port/db;transportMode=http;httpPath=http_endpoint



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8272) Query with particular decimal expression causes NPE during execution initialization

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14162964#comment-14162964
 ] 

Vikram Dixit K commented on HIVE-8272:
--

+1 for 0.14

 Query with particular decimal expression causes NPE during execution 
 initialization
 ---

 Key: HIVE-8272
 URL: https://issues.apache.org/jira/browse/HIVE-8272
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer
Reporter: Matt McCline
Assignee: Jason Dere
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-8272.1.patch


 Query:
 {code}
 select 
   cast(sum(dc)*100 as decimal(11,3)) as c1
   from somedecimaltable
   order by c1
   limit 100;
 {code}
 Fails during execution initialization due to *null* ExprNodeDesc.
 Noticed while trying to simplify a Vectorization issue and realized it was a 
 more general issue.
 {code}
 Caused by: java.lang.RuntimeException: Map operator initialization failed
   at 
 org.apache.hadoop.hive.ql.exec.mr.ExecMapper.configure(ExecMapper.java:154)
   ... 22 more
 Caused by: java.lang.RuntimeException: java.lang.NullPointerException
   at 
 org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.initializeOp(ReduceSinkOperator.java:215)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:380)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:464)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:420)
   at 
 org.apache.hadoop.hive.ql.exec.GroupByOperator.initializeOp(GroupByOperator.java:427)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:380)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:464)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:420)
   at 
 org.apache.hadoop.hive.ql.exec.SelectOperator.initializeOp(SelectOperator.java:65)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:380)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:464)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.initializeChildren(Operator.java:420)
   at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.initializeOp(TableScanOperator.java:193)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:380)
   at 
 org.apache.hadoop.hive.ql.exec.MapOperator.initializeOp(MapOperator.java:425)
   at org.apache.hadoop.hive.ql.exec.Operator.initialize(Operator.java:380)
   at 
 org.apache.hadoop.hive.ql.exec.mr.ExecMapper.configure(ExecMapper.java:133)
   ... 22 more
 Caused by: java.lang.NullPointerException
   at 
 org.apache.hadoop.hive.ql.plan.ExprNodeGenericFuncDesc.getExprString(ExprNodeGenericFuncDesc.java:154)
   at 
 org.apache.hadoop.hive.ql.plan.ExprNodeGenericFuncDesc.getExprString(ExprNodeGenericFuncDesc.java:154)
   at 
 org.apache.hadoop.hive.ql.exec.ReduceSinkOperator.initializeOp(ReduceSinkOperator.java:148)
   ... 38 more
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-07 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8381:
-
Attachment: HIVE-8381.2.patch

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch, HIVE-8381.2.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8381) Update hive version on trunk to 0.15

2014-10-07 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14163114#comment-14163114
 ] 

Vikram Dixit K commented on HIVE-8381:
--

My bad. Partial patch. Version 2 updates all poms.

 Update hive version on trunk to 0.15
 

 Key: HIVE-8381
 URL: https://issues.apache.org/jira/browse/HIVE-8381
 Project: Hive
  Issue Type: Bug
  Components: Build Infrastructure
Affects Versions: 0.15.0
Reporter: Vikram Dixit K
Assignee: Vikram Dixit K
 Attachments: HIVE-8381.1.patch, HIVE-8381.2.patch


 In view of the branching for 0.14 done, we need to upgrade the version of 
 hive on trunk to 0.15.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8336) Update pom, now that Optiq is renamed to Calcite

2014-10-06 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160622#comment-14160622
 ] 

Vikram Dixit K commented on HIVE-8336:
--

+1 for 0.14

 Update pom, now that Optiq is renamed to Calcite
 

 Key: HIVE-8336
 URL: https://issues.apache.org/jira/browse/HIVE-8336
 Project: Hive
  Issue Type: Bug
Reporter: Julian Hyde
Assignee: Gunther Hagleitner
 Fix For: 0.14.0

 Attachments: HIVE-8336.1.patch


 Apache Optiq is in the process of renaming to Apache Calcite. See INFRA-8413 
 and OPTIQ-430.
 There is not yet a snapshot of {groupId: 'org.apache.calcite', artifactId: 
 'calcite-*'} deployed to nexus. When there is, I'll post a patch to pom.xml.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8348) Fix Hive to match changes introduced by TEZ-1510

2014-10-06 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161237#comment-14161237
 ] 

Vikram Dixit K commented on HIVE-8348:
--

+1 for 0.14

 Fix Hive to match changes introduced by TEZ-1510
 

 Key: HIVE-8348
 URL: https://issues.apache.org/jira/browse/HIVE-8348
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Gopal V
Assignee: Gopal V
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8348.1.patch


 TEZ-1510 changes the way configuration objects are handled in tez, by 
 avoiding polluting the {{new Configuration();}} object within HiveServer2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8372) Potential NPE in Tez MergeFileRecordProcessor

2014-10-06 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161355#comment-14161355
 ] 

Vikram Dixit K commented on HIVE-8372:
--

Pending tests pass.

 Potential NPE in Tez MergeFileRecordProcessor
 -

 Key: HIVE-8372
 URL: https://issues.apache.org/jira/browse/HIVE-8372
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Prasanth J
Assignee: Prasanth J
 Attachments: HIVE-8372.1.patch


 MergeFileRecordProcessor retrieves map work from cache. This map work can be 
 instance of merge file work. When the merge file work already exists in the 
 cache casting the map work to merge file work is missing which will result in 
 NullPointerException.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8372) Potential NPE in Tez MergeFileRecordProcessor

2014-10-06 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161354#comment-14161354
 ] 

Vikram Dixit K commented on HIVE-8372:
--

+1 LGTM. +1 for 0.14 as well.

 Potential NPE in Tez MergeFileRecordProcessor
 -

 Key: HIVE-8372
 URL: https://issues.apache.org/jira/browse/HIVE-8372
 Project: Hive
  Issue Type: Bug
  Components: Tez
Affects Versions: 0.14.0
Reporter: Prasanth J
Assignee: Prasanth J
 Attachments: HIVE-8372.1.patch


 MergeFileRecordProcessor retrieves map work from cache. This map work can be 
 instance of merge file work. When the merge file work already exists in the 
 cache casting the map work to merge file work is missing which will result in 
 NullPointerException.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-8364) We're not waiting for all inputs in MapRecordProcessor on Tez

2014-10-06 Thread Vikram Dixit K (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-8364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vikram Dixit K updated HIVE-8364:
-
Attachment: HIVE-8364.2.patch

Improves on the proposed patch.

 We're not waiting for all inputs in MapRecordProcessor on Tez
 -

 Key: HIVE-8364
 URL: https://issues.apache.org/jira/browse/HIVE-8364
 Project: Hive
  Issue Type: Bug
Reporter: Gunther Hagleitner
Assignee: Vikram Dixit K
 Fix For: 0.14.0

 Attachments: HIVE-8364.1.patch, HIVE-8364.2.patch


 Seems like this could be a race condition: We're blocking for some inputs to 
 become available, but the main MR input is just assumed ready...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8322) VectorReduceSinkOperator: ClassCastException: ~StandardUnionObjectInspector$StandardUnion cannot be cast to ~IntWritable

2014-10-03 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14158232#comment-14158232
 ] 

Vikram Dixit K commented on HIVE-8322:
--

+1 for 0.14

 VectorReduceSinkOperator: ClassCastException: 
 ~StandardUnionObjectInspector$StandardUnion cannot be cast to ~IntWritable
 

 Key: HIVE-8322
 URL: https://issues.apache.org/jira/browse/HIVE-8322
 Project: Hive
  Issue Type: Bug
  Components: Tez, Vectorization
Affects Versions: 0.14.0
Reporter: Matt McCline
Assignee: Matt McCline
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8322.01.patch, HIVE-8322.02.patch, 
 HIVE-8322.03.patch, HIVE-8322.04.patch


 Some queries with count(distinct(..)) fail now in VectorReduceSinkOperator.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7960) Upgrade to Hadoop 2.5

2014-10-03 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14158369#comment-14158369
 ] 

Vikram Dixit K commented on HIVE-7960:
--

+1 for 0.14

 Upgrade to Hadoop 2.5
 -

 Key: HIVE-7960
 URL: https://issues.apache.org/jira/browse/HIVE-7960
 Project: Hive
  Issue Type: Task
Reporter: Brock Noland
Assignee: Gunther Hagleitner
 Attachments: HIVE-7960.1.patch


 Tracking JIRA for upgrading to 2.5



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8332) Reading an ACID table with vectorization on results in NPE

2014-10-02 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14156960#comment-14156960
 ] 

Vikram Dixit K commented on HIVE-8332:
--

+1 for 0.14.

 Reading an ACID table with vectorization on results in NPE
 --

 Key: HIVE-8332
 URL: https://issues.apache.org/jira/browse/HIVE-8332
 Project: Hive
  Issue Type: Bug
  Components: Transactions, Vectorization
Affects Versions: 0.14.0
Reporter: Alan Gates
Assignee: Alan Gates
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8332.patch


 On a transactional table, insert some data, then with vectorization turned on 
 do a select.  The result is:
 {code}
 Caused by: java.lang.NullPointerException at 
 org.apache.hadoop.hive.ql.io.orc.OrcInputFormat$1.getObjectInspector(OrcInputFormat.java:1137)
  at 
 org.apache.hadoop.hive.ql.io.orc.VectorizedOrcAcidRowReader.init(VectorizedOrcAcidRowReader.java:61)
  at 
 org.apache.hadoop.hive.ql.io.orc.OrcInputFormat.getRecordReader(OrcInputFormat.java:1041)
  at 
 org.apache.hadoop.hive.ql.io.HiveInputFormat.getRecordReader(HiveInputFormat.java:246)
   ... 25 more
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-7695) hive stats issue when insert query is appending data into table

2014-10-02 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157154#comment-14157154
 ] 

Vikram Dixit K commented on HIVE-7695:
--

+1 for 0.14

 hive stats issue when insert query is appending data into table
 ---

 Key: HIVE-7695
 URL: https://issues.apache.org/jira/browse/HIVE-7695
 Project: Hive
  Issue Type: Bug
  Components: Logical Optimizer, Statistics
Affects Versions: 0.13.0, 0.13.1
Reporter: Thejas M Nair
Assignee: Ashutosh Chauhan
 Fix For: 0.15.0

 Attachments: HIVE-7695-test.patch, HIVE-7695.1.patch, 
 HIVE-7695.2.patch, HIVE-7695.3.patch, HIVE-7695.4.patch, tests.patch


 When 'insert' is used to append (not overrite), the stats don't seem to get 
 updated.
 {code}
 set hive.compute.query.using.stats=true;
  set hive.stats.autogather=true; 
 //insert 4 rows
 insert into table t2 select * from t1;
 select count(*) from t2;
 4
 //insert 4 again (appending instead of overrite)
 insert into table t2 select * from t1;
 select count(*) from t2;
 4
 select * from t2;
 1
 2
 3
 4
 1
 2
 3
 4
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-8318) Null Scan optimizer throws exception when no partitions are selected

2014-10-02 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157157#comment-14157157
 ] 

Vikram Dixit K commented on HIVE-8318:
--

+1 for 0.14

 Null Scan optimizer throws exception when no partitions are selected
 

 Key: HIVE-8318
 URL: https://issues.apache.org/jira/browse/HIVE-8318
 Project: Hive
  Issue Type: Bug
  Components: Physical Optimizer
Affects Versions: 0.14.0
Reporter: Mostafa Mokhtar
Assignee: Ashutosh Chauhan
Priority: Critical
 Fix For: 0.15.0

 Attachments: HIVE-8318.patch


 Run query that involves a join and constant folding that is always false 
 {code}
 select  s_store_id as store_id,
   sum(ss_ext_sales_price) as sales
   from store_sales, 
  date_dim,
  store
  where ss_sold_date_sk = d_date_sk
and d_date between cast('1998-08-04' as date) 
   and (cast('1998-09-04' as date))
and ss_store_sk = s_store_sk and 1 = 2
  group by s_store_id
 {code}
 If vectorization is enable we hit this exception 
 {code}
 , TaskAttempt 3 failed, info=[Error: Failure while running 
 task:java.lang.RuntimeException: java.lang.RuntimeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
 processing row
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:187)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:142)
   at 
 org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:324)
   at 
 org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:180)
   at 
 org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:172)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:415)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548)
   at 
 org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:172)
   at 
 org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call(TezTaskRunner.java:167)
   at java.util.concurrent.FutureTask.run(FutureTask.java:262)
   at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
   at java.lang.Thread.run(Thread.java:744)
 Caused by: java.lang.RuntimeException: 
 org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while 
 processing row
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:93)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:272)
   at 
 org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:164)
   ... 13 more
 Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime 
 Error while processing row
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
   at 
 org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:85)
   ... 16 more
 Caused by: java.lang.ClassCastException: org.apache.hadoop.io.NullWritable 
 cannot be cast to org.apache.hadoop.hive.ql.exec.vector.VectorizedRowBatch
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.processOp(VectorFilterOperator.java:86)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:801)
   at 
 org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
   at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:801)
   at 
 org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
   ... 17 more
 {code}
 If vectorization is disabled we hit this exception
 {code}
 Caused by: java.lang.RuntimeException: cannot find field ss_store_sk from 
 [org.apache.hadoop.hive.serde2.objectinspector.UnionStructObjectInspector$MyField@52c64dde]
   at 
 org.apache.hadoop.hive.serde2.objectinspector.ObjectInspectorUtils.getStandardStructFieldRef(ObjectInspectorUtils.java:410)
   at 
 org.apache.hadoop.hive.serde2.objectinspector.UnionStructObjectInspector.getStructFieldRef(UnionStructObjectInspector.java:112)
   at 
 org.apache.hadoop.hive.ql.exec.ExprNodeColumnEvaluator.initialize(ExprNodeColumnEvaluator.java:55)
   at 
 org.apache.hadoop.hive.ql.exec.Operator.initEvaluators(Operator.java:940)
   at 

[jira] [Commented] (HIVE-8240) VectorColumnAssignFactory throws Incompatible Bytes vector column and primitive category VARCHAR

2014-10-02 Thread Vikram Dixit K (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-8240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14157331#comment-14157331
 ] 

Vikram Dixit K commented on HIVE-8240:
--

+1 for 0.14

 VectorColumnAssignFactory throws Incompatible Bytes vector column and 
 primitive category VARCHAR
 --

 Key: HIVE-8240
 URL: https://issues.apache.org/jira/browse/HIVE-8240
 Project: Hive
  Issue Type: Bug
  Components: Vectorization
Affects Versions: 0.14.0
Reporter: Matt McCline
Assignee: Matt McCline
Priority: Critical
 Fix For: 0.14.0

 Attachments: HIVE-8240.01.patch, HIVE-8240.02.patch, 
 HIVE-8240.04.patch, HIVE-8240.05.patch, HIVE-8240.06.patch, HIVE-8240.07.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


<    1   2   3   4   5   6   7   8   9   10   >