date:20140730


[ 
https://issues.apache.org/jira/browse/HIVE-7029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14078977#comment-14078977
 ] 

Hive QA commented on HIVE-7029:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658529/HIVE-7029.7.patch

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 5835 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_dynpart_sort_opt_vectorization
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/97/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/97/console
Test logs: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-97/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 2 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12658529

 Vectorize ReduceWork
 

 Key: HIVE-7029
 URL: https://issues.apache.org/jira/browse/HIVE-7029
 Project: Hive
  Issue Type: Sub-task
Reporter: Matt McCline
Assignee: Matt McCline
 Attachments: HIVE-7029.1.patch, HIVE-7029.2.patch, HIVE-7029.3.patch, 
 HIVE-7029.4.patch, HIVE-7029.5.patch, HIVE-7029.6.patch, HIVE-7029.7.patch


 This will enable vectorization team to independently work on vectorization on 
 reduce side even before vectorized shuffle is ready.
 NOTE: Tez only (i.e. TezTask only)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Created] (HIVE-7553) avoid the scheduling maintenance window for every jar change

2014-07-30 Thread Ferdinand Xu (JIRA)

Ferdinand Xu created HIVE-7553:
--

 Summary: avoid the scheduling maintenance window for every jar 
change
 Key: HIVE-7553
 URL: https://issues.apache.org/jira/browse/HIVE-7553
 Project: Hive
  Issue Type: Bug
  Components: HiveServer2
Reporter: Ferdinand Xu
Assignee: Ferdinand Xu


When user needs to refresh existing or add a new jar to HS2, it needs to 
restart it. As HS2 is service exposed to clients, this requires scheduling 
maintenance window for every jar change. It would be great if we could avoid 
that.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7436) Load Spark configuration into Hive driver

2014-07-30 Thread Chengxiang Li (JIRA)

[
https://issues.apache.org/jira/browse/HIVE-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079007#comment-14079007
]

Chengxiang Li commented on HIVE-7436:
-

[~xuefuz] HADOOP_CONF_DIR is added to HADOOP_CLASSPATH in hadoop-config.sh, so
as HIVE_CONF_DIR in hive-config.sh. if we only load spark configuration file
from classpath, there are 2 choices:
# export SPARK_CONF_DIR, and add it to HADOOP_CLASSPATH manually.
# commit a patch which would add SPARK_CONF_DIR to HADOOP_CLASSPATH in hive
scripts(such as hive-config.sh). export SPARK_CONF_DIR.

my concern about supporting load spark configuration file from SPARK_CONF_DIR
in implementation level is that:
# HADOOP/HIVE/HIVE on TEZ only load configuration file from classpath actually.
# it may introduce more complexity, like what should we do if different spark
configuration file available on SPARK_CONF_DIR and HADOOP_CLASSPATH both?

The way how to configure Hive on Tez is similar as current Hive on Spark. [Hive
on Tez
Configuration|http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1.2/bk_installing_manually_book/content/rpm-chap-tez_configure_tez.html]

Load Spark configuration into Hive driver
-

Key: HIVE-7436
URL: https://issues.apache.org/jira/browse/HIVE-7436
Project: Hive
Issue Type: Sub-task
Components: Spark
Reporter: Chengxiang Li
Assignee: Chengxiang Li
Fix For: spark-branch

Attachments: HIVE-7436-Spark.1.patch, HIVE-7436-Spark.2.patch,
HIVE-7436-Spark.3.patch

load Spark configuration into Hive driver, there are 3 ways to setup spark
configurations:
# Java property.
# Configure properties in spark configuration file(spark-defaults.conf).
# Hive configuration file(hive-site.xml).
The below configuration has more priority, and would overwrite previous
configuration with the same property name.
Please refer to [http://spark.apache.org/docs/latest/configuration.html] for
all configurable properties of spark, and you can configure spark
configuration in Hive through following ways:
# Configure through spark configuration file.
#* Create spark-defaults.conf, and place it in the /etc/spark/conf
configuration directory. configure properties in spark-defaults.conf in java
properties format.
#* Create the $SPARK_CONF_DIR environment variable and set it to the location
of spark-defaults.conf.
export SPARK_CONF_DIR=/etc/spark/conf
#* Add $SAPRK_CONF_DIR to the $HADOOP_CLASSPATH environment variable.
export HADOOP_CLASSPATH=$SPARK_CONF_DIR:$HADOOP_CLASSPATH
# Configure through hive configuration file.
#* edit hive-site.xml in hive conf directory, configure properties in
spark-defaults.conf in xml format.
Hive driver default spark properties:
||name||default value||description||
|spark.master|local|Spark master url.|
|spark.app.name|Hive on Spark|Default Spark application name.|
NO PRECOMMIT TESTS. This is for spark-branch only.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7519) Refactor QTestUtil to remove its duplication with QFileClient for qtest setup and teardown


[ 
https://issues.apache.org/jira/browse/HIVE-7519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079025#comment-14079025
 ] 

Hive QA commented on HIVE-7519:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658531/HIVE-7519.1.patch

{color:red}ERROR:{color} -1 due to 31 failed/errored test(s), 5838 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_authorization_role_grant2
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_dynpart_sort_opt_vectorization
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_print_header
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample2
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample4
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample5
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample6
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample7
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_sample9
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats9
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vector_non_string_partition
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorization_part
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorization_part_project
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorized_bucketmapjoin1
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorized_context
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorized_parquet
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_vectorized_timestamp_funcs
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_dynpart_sort_optimization
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
org.apache.hadoop.hive.ql.parse.TestParse.testParse_case_sensitivity
org.apache.hadoop.hive.ql.parse.TestParse.testParse_input5
org.apache.hadoop.hive.ql.parse.TestParse.testParse_input_testsequencefile
org.apache.hadoop.hive.ql.parse.TestParse.testParse_input_testxpath
org.apache.hadoop.hive.ql.parse.TestParse.testParse_input_testxpath2
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample2
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample3
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample4
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample5
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample6
org.apache.hadoop.hive.ql.parse.TestParse.testParse_sample7
org.apache.hive.hcatalog.pig.TestOrcHCatLoader.testReadDataPrimitiveTypes
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/98/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/98/console
Test logs: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-98/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 31 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12658531

 Refactor QTestUtil to remove its duplication with QFileClient for qtest setup 
 and teardown 
 ---

 Key: HIVE-7519
 URL: https://issues.apache.org/jira/browse/HIVE-7519
 Project: Hive
  Issue Type: Improvement
Reporter: Ashish Kumar Singh
Assignee: Ashish Kumar Singh
 Attachments: HIVE-7519.1.patch, HIVE-7519.patch


 QTestUtil hard codes creation and dropping of source tables for qtests. 
 QFileClient does the same thing but in a better way, uses q_test_init.sql and 
 q_test_cleanup.sql scripts. As QTestUtil is growing quite large it makes 
 sense to refactor it to use QFileClient's approach. This will also remove 
 duplication of code addressing same purpose.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HIVE-7544) Changes related to TEZ-1288 (FastTezSerialization)

2014-07-30 Thread Rajesh Balamohan (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-7544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated HIVE-7544:
---

Attachment: HIVE-7544.1.patch

 Changes related to TEZ-1288 (FastTezSerialization)
 --

 Key: HIVE-7544
 URL: https://issues.apache.org/jira/browse/HIVE-7544
 Project: Hive
  Issue Type: Sub-task
  Components: Tez
Affects Versions: 0.14.0
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: HIVE-7544.1.patch


 Add ability to make use of TezBytesWritableSerialization.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Assigned] (HIVE-4934) ntile function has to be the last thing in the select list


 [ 
https://issues.apache.org/jira/browse/HIVE-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke reassigned HIVE-4934:
--

Assignee: Lars Francke

 ntile function has to be the last thing in the select list
 --

 Key: HIVE-4934
 URL: https://issues.apache.org/jira/browse/HIVE-4934
 Project: Hive
  Issue Type: Bug
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
 FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
 Only COMPLETE mode supported for NTile function
 SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
 ...works...
 {code}
 I'm not sure if that is a bug or necessary. Either way the error message is 
 not helpful as it's not documented anywhere what {{COMPLETE}} mode is. A 
 cursory glance at the code didn't help me either.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HIVE-4934) ntile function has to be the last thing in the select list


 [ 
https://issues.apache.org/jira/browse/HIVE-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke resolved HIVE-4934.


Resolution: Fixed

This was a misunderstanding on my part. I'll add a sentence to the 
documentation to clear this up for other.

 ntile function has to be the last thing in the select list
 --

 Key: HIVE-4934
 URL: https://issues.apache.org/jira/browse/HIVE-4934
 Project: Hive
  Issue Type: Bug
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
 FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
 Only COMPLETE mode supported for NTile function
 SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
 ...works...
 {code}
 I'm not sure if that is a bug or necessary. Either way the error message is 
 not helpful as it's not documented anywhere what {{COMPLETE}} mode is. A 
 cursory glance at the code didn't help me either.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HIVE-4934) Improve documentation of OVER clause


 [ 
https://issues.apache.org/jira/browse/HIVE-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke updated HIVE-4934:
---

Description: 
{code}
CREATE TABLE test (foo INT);
SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
Only COMPLETE mode supported for NTile function

SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
...works...
{code}

I'm not sure if that is a bug or necessary. Either way the error message is not 
helpful as it's not documented anywhere what {{COMPLETE}} mode is. A cursory 
glance at the code didn't help me either.

Edit: It is not a bug, it wasn't clear to me that the OVER clause only applies 
to the directly preceding function.

  was:
{code}
CREATE TABLE test (foo INT);
SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
Only COMPLETE mode supported for NTile function

SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
...works...
{code}

I'm not sure if that is a bug or necessary. Either way the error message is not 
helpful as it's not documented anywhere what {{COMPLETE}} mode is. A cursory 
glance at the code didn't help me either.


 Improve documentation of OVER clause
 

 Key: HIVE-4934
 URL: https://issues.apache.org/jira/browse/HIVE-4934
 Project: Hive
  Issue Type: Bug
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
 FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
 Only COMPLETE mode supported for NTile function
 SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
 ...works...
 {code}
 I'm not sure if that is a bug or necessary. Either way the error message is 
 not helpful as it's not documented anywhere what {{COMPLETE}} mode is. A 
 cursory glance at the code didn't help me either.
 Edit: It is not a bug, it wasn't clear to me that the OVER clause only 
 applies to the directly preceding function.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HIVE-4934) Improve documentation of OVER clause


 [ 
https://issues.apache.org/jira/browse/HIVE-4934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke updated HIVE-4934:
---

Summary: Improve documentation of OVER clause  (was: ntile function has to 
be the last thing in the select list)

 Improve documentation of OVER clause
 

 Key: HIVE-4934
 URL: https://issues.apache.org/jira/browse/HIVE-4934
 Project: Hive
  Issue Type: Bug
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 SELECT ntile(10), foo OVER (PARTITION BY foo) FROM test;
 FAILED: SemanticException org.apache.hadoop.hive.ql.metadata.HiveException: 
 Only COMPLETE mode supported for NTile function
 SELECT foo, ntile(10) OVER (PARTITION BY foo) FROM test;
 ...works...
 {code}
 I'm not sure if that is a bug or necessary. Either way the error message is 
 not helpful as it's not documented anywhere what {{COMPLETE}} mode is. A 
 cursory glance at the code didn't help me either.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Re: Review Request 23799: HIVE-7390: refactor csv output format with in RFC mode and add one more option to support formatting as the csv format in hive cli

2014-07-30 Thread cheng xu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/23799/
---

(Updated July 30, 2014, 8:30 a.m.)


Review request for hive.


Changes
---

1. use hadoop.io.utils to close stream
2. change integrated test due to code changes
3. add quotedCsv format instead of option according to discussion
4. add one constructor parameter to specify the status of quoted


Bugs: HIVE-7390
https://issues.apache.org/jira/browse/HIVE-7390


Repository: hive-git


Description
---

HIVE-7390: refactor csv output format with in RFC mode and add one more option 
to support formatting as the csv format in hive cli


Diffs (updated)
-

  beeline/pom.xml 6ec1d1aff3f35c097aa6054aae84faf2d63854f1 
  beeline/src/java/org/apache/hive/beeline/BeeLine.java 
528a98e29c23421f9352bdf7c5edd3a9fae0e3ea 
  beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java 
7853c3f38f3c3fb9ae0b9939c714f1dc940ba053 
  beeline/src/main/resources/BeeLine.properties 
390d062b8dc52dfa790c7351f3db44c1e0dd7e37 
  
itests/hive-unit/src/test/java/org/apache/hive/beeline/TestBeeLineWithArgs.java 
bd97aff5959fd9040fc0f0a1f6b782f2aa6f 
  pom.xml b5a5697e6a3b689c2b244ba0338be541261eaa3d 

Diff: https://reviews.apache.org/r/23799/diff/


Testing
---


Thanks,

cheng xu

[jira] [Commented] (HIVE-7432) Remove deprecated Avro's Schema.parse usages


[ 
https://issues.apache.org/jira/browse/HIVE-7432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079067#comment-14079067
 ] 

Hive QA commented on HIVE-7432:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658545/HIVE-7432.patch

{color:red}ERROR:{color} -1 due to 66 failed/errored test(s), 5838 tests 
executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_change_schema
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_decimal
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_decimal_native
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_evolved_schemas
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_joins
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_joins_native
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_native
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_nullable_fields
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_partitioned
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_partitioned_native
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_sanity_test
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_schema_evolution_native
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_schema_literal
org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_index_serde
org.apache.hadoop.hive.cli.TestMiniTezCliDriver.testCliDriver_dynpart_sort_optimization
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeArrays
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeBytes
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeEnums
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeFixed
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeMapWithNullablePrimitiveValues
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeMapsWithPrimitiveKeys
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeNullableEnums
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeNullableTypes
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeRecords
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeUnions
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.canDeserializeVoidType
org.apache.hadoop.hive.serde2.avro.TestAvroDeserializer.verifyCaching
org.apache.hadoop.hive.serde2.avro.TestAvroObjectInspectorGenerator.convertsNullableEnum
org.apache.hadoop.hive.serde2.avro.TestAvroObjectInspectorGenerator.objectInspectorsAreCached
org.apache.hadoop.hive.serde2.avro.TestAvroSerde.initializeDoesNotReuseSchemasFromConf
org.apache.hadoop.hive.serde2.avro.TestAvroSerdeUtils.determineSchemaCanReadSchemaFromHDFS
org.apache.hadoop.hive.serde2.avro.TestAvroSerdeUtils.getTypeFromNullableTypePositiveCase
org.apache.hadoop.hive.serde2.avro.TestAvroSerdeUtils.isNullableTypeAcceptsNullableUnions
org.apache.hadoop.hive.serde2.avro.TestAvroSerdeUtils.noneOptionWorksForSpecifyingSchemas
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeArraysWithNullableComplexElements
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeArraysWithNullablePrimitiveElements
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeBooleans
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeBytes
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeDecimals
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeDoubles
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeEnums
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeFixed
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeFloats
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeInts
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeListOfDecimals
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeLists
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeMapOfDecimals
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeMaps
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeMapsWithNullableComplexValues
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeMapsWithNullablePrimitiveValues
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeNullableBytes
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeNullableDecimals
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeNullableEnums
org.apache.hadoop.hive.serde2.avro.TestAvroSerializer.canSerializeNullableFixed

[jira] [Commented] (HIVE-7509) Fast stripe level merging for ORC


[ 
https://issues.apache.org/jira/browse/HIVE-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079068#comment-14079068
 ] 

Hive QA commented on HIVE-7509:
---



{color:red}Overall{color}: -1 no tests executed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658568/HIVE-7509.4.patch

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/100/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/100/console
Test logs: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-100/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Tests exited with: NonZeroExitCodeException
Command 'bash /data/hive-ptest/working/scratch/source-prep.sh' failed with exit 
status 1 and output '+ [[ -n /usr/java/jdk1.7.0_45-cloudera ]]
+ export JAVA_HOME=/usr/java/jdk1.7.0_45-cloudera
+ JAVA_HOME=/usr/java/jdk1.7.0_45-cloudera
+ export 
PATH=/usr/java/jdk1.7.0_45-cloudera/bin/:/usr/java/jdk1.6.0_34/bin:/usr/local/apache-maven-3.0.5/bin:/usr/local/apache-maven-3.0.5/bin:/usr/java/jdk1.6.0_34/bin:/usr/local/apache-ant-1.9.1/bin:/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin:/home/hiveptest/bin
+ 
PATH=/usr/java/jdk1.7.0_45-cloudera/bin/:/usr/java/jdk1.6.0_34/bin:/usr/local/apache-maven-3.0.5/bin:/usr/local/apache-maven-3.0.5/bin:/usr/java/jdk1.6.0_34/bin:/usr/local/apache-ant-1.9.1/bin:/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/sbin:/home/hiveptest/bin
+ export 'ANT_OPTS=-Xmx1g -XX:MaxPermSize=256m '
+ ANT_OPTS='-Xmx1g -XX:MaxPermSize=256m '
+ export 'M2_OPTS=-Xmx1g -XX:MaxPermSize=256m -Dhttp.proxyHost=localhost 
-Dhttp.proxyPort=3128'
+ M2_OPTS='-Xmx1g -XX:MaxPermSize=256m -Dhttp.proxyHost=localhost 
-Dhttp.proxyPort=3128'
+ cd /data/hive-ptest/working/
+ tee /data/hive-ptest/logs/PreCommit-HIVE-TRUNK-Build-100/source-prep.txt
+ [[ false == \t\r\u\e ]]
+ mkdir -p maven ivy
+ [[ svn = \s\v\n ]]
+ [[ -n '' ]]
+ [[ -d apache-svn-trunk-source ]]
+ [[ ! -d apache-svn-trunk-source/.svn ]]
+ [[ ! -d apache-svn-trunk-source ]]
+ cd apache-svn-trunk-source
+ svn revert -R .
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroSerializer.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestSchemaReEncoder.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroDeserializer.java'
Reverted 'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroSerde.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroSerdeUtils.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestThatEvolvedSchemasActAsWeWant.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestAvroObjectInspectorGenerator.java'
Reverted 
'serde/src/test/org/apache/hadoop/hive/serde2/avro/TestGenericAvroRecordWritable.java'
Reverted 
'serde/src/java/org/apache/hadoop/hive/serde2/avro/SchemaResolutionProblem.java'
Reverted 'serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroSerdeUtils.java'
Reverted 
'serde/src/java/org/apache/hadoop/hive/serde2/avro/AvroGenericRecordWritable.java'
Reverted 
'ql/src/java/org/apache/hadoop/hive/ql/io/avro/AvroGenericRecordReader.java'
++ egrep -v '^X|^Performing status on external'
++ awk '{print $2}'
++ svn status --no-ignore
+ rm -rf target datanucleus.log ant/target shims/target shims/0.20/target 
shims/0.20S/target shims/0.23/target shims/aggregator/target 
shims/common/target shims/common-secure/target packaging/target 
hbase-handler/target testutils/target jdbc/target metastore/target 
itests/target itests/hcatalog-unit/target itests/test-serde/target 
itests/qtest/target itests/hive-unit-hadoop2/target itests/hive-minikdc/target 
itests/hive-unit/target itests/custom-serde/target itests/util/target 
hcatalog/target hcatalog/core/target hcatalog/streaming/target 
hcatalog/server-extensions/target hcatalog/webhcat/svr/target 
hcatalog/webhcat/java-client/target hcatalog/hcatalog-pig-adapter/target 
hwi/target common/target common/src/gen service/target contrib/target 
serde/target beeline/target odbc/target cli/target 
ql/dependency-reduced-pom.xml ql/target
+ svn update

Fetching external item into 'hcatalog/src/test/e2e/harness'
External at revision 1614583.

At revision 1614583.
+ patchCommandPath=/data/hive-ptest/working/scratch/smart-apply-patch.sh
+ patchFilePath=/data/hive-ptest/working/scratch/build.patch
+ [[ -f /data/hive-ptest/working/scratch/build.patch ]]
+ chmod +x /data/hive-ptest/working/scratch/smart-apply-patch.sh
+ /data/hive-ptest/working/scratch/smart-apply-patch.sh 
/data/hive-ptest/working/scratch/build.patch
The patch does not appear to apply with p0, p1, or p2
+ exit 1
'
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12658568

 Fast stripe level merging for ORC

[jira] [Updated] (HIVE-7390) Make quote character optional and configurable in BeeLine CSV/TSV output

2014-07-30 Thread Ferdinand Xu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ferdinand Xu updated HIVE-7390:
---

Attachment: HIVE-7390.4.patch

code changes according to the discussion

 Make quote character optional and configurable in BeeLine CSV/TSV output
 

 Key: HIVE-7390
 URL: https://issues.apache.org/jira/browse/HIVE-7390
 Project: Hive
  Issue Type: New Feature
  Components: Clients
Affects Versions: 0.13.1
Reporter: Jim Halfpenny
Assignee: Ferdinand Xu
 Attachments: HIVE-7390.1.patch, HIVE-7390.2.patch, HIVE-7390.3.patch, 
 HIVE-7390.4.patch, HIVE-7390.patch


 Currently when either the CSV or TSV output formats are used in beeline each 
 column is wrapped in single quotes. Quote wrapping of columns should be 
 optional and the user should be able to choose the character used to wrap the 
 columns.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HIVE-4933) Document how aliases work with the OVER clause


 [ 
https://issues.apache.org/jira/browse/HIVE-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke updated HIVE-4933:
---

Summary: Document how aliases work with the OVER clause  (was: Can't use 
alias directly before OVER clause)

 Document how aliases work with the OVER clause
 --

 Key: HIVE-4933
 URL: https://issues.apache.org/jira/browse/HIVE-4933
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 hive SELECT SUM(foo) AS bar OVER (PARTITION BY foo) FROM test;
 MismatchedTokenException(175!=110)
   at 
 org.antlr.runtime.BaseRecognizer.recoverFromMismatchedToken(BaseRecognizer.java:617)
   at org.antlr.runtime.BaseRecognizer.match(BaseRecognizer.java:115)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser_FromClauseParser.fromClause(HiveParser_FromClauseParser.java:1424)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.fromClause(HiveParser.java:35998)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.selectStatement(HiveParser.java:33974)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.regular_body(HiveParser.java:33882)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatement(HiveParser.java:33389)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatementExpression(HiveParser.java:33169)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.execStatement(HiveParser.java:1284)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.statement(HiveParser.java:983)
   at 
 org.apache.hadoop.hive.ql.parse.ParseDriver.parse(ParseDriver.java:190)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:434)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:352)
   at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:995)
   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1038)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:931)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:921)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:422)
   at 
 org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:790)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:684)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:623)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
 FAILED: ParseException line 1:20 mismatched input 'OVER' expecting FROM near 
 'bar' in from clause{code}
 The same happens without the {{AS}} but it works when leaving out the alias 
 entirely.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-4933) Document how aliases work with the OVER clause


[ 
https://issues.apache.org/jira/browse/HIVE-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079077#comment-14079077
 ] 

Lars Francke commented on HIVE-4933:


The proper usage turns out to be

{code:sql}
SELECT SUM(foo) OVER (PARTITION BY foo) AS bar FROM test;
{code}

I have added documentation to the Wiki for this.

 Document how aliases work with the OVER clause
 --

 Key: HIVE-4933
 URL: https://issues.apache.org/jira/browse/HIVE-4933
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 hive SELECT SUM(foo) AS bar OVER (PARTITION BY foo) FROM test;
 MismatchedTokenException(175!=110)
   at 
 org.antlr.runtime.BaseRecognizer.recoverFromMismatchedToken(BaseRecognizer.java:617)
   at org.antlr.runtime.BaseRecognizer.match(BaseRecognizer.java:115)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser_FromClauseParser.fromClause(HiveParser_FromClauseParser.java:1424)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.fromClause(HiveParser.java:35998)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.selectStatement(HiveParser.java:33974)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.regular_body(HiveParser.java:33882)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatement(HiveParser.java:33389)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatementExpression(HiveParser.java:33169)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.execStatement(HiveParser.java:1284)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.statement(HiveParser.java:983)
   at 
 org.apache.hadoop.hive.ql.parse.ParseDriver.parse(ParseDriver.java:190)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:434)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:352)
   at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:995)
   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1038)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:931)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:921)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:422)
   at 
 org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:790)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:684)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:623)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
 FAILED: ParseException line 1:20 mismatched input 'OVER' expecting FROM near 
 'bar' in from clause{code}
 The same happens without the {{AS}} but it works when leaving out the alias 
 entirely.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Assigned] (HIVE-4933) Document how aliases work with the OVER clause


 [ 
https://issues.apache.org/jira/browse/HIVE-4933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Francke reassigned HIVE-4933:
--

Assignee: Lars Francke

 Document how aliases work with the OVER clause
 --

 Key: HIVE-4933
 URL: https://issues.apache.org/jira/browse/HIVE-4933
 Project: Hive
  Issue Type: Bug
Affects Versions: 0.11.0
Reporter: Lars Francke
Assignee: Lars Francke
Priority: Minor

 {code}
 CREATE TABLE test (foo INT);
 hive SELECT SUM(foo) AS bar OVER (PARTITION BY foo) FROM test;
 MismatchedTokenException(175!=110)
   at 
 org.antlr.runtime.BaseRecognizer.recoverFromMismatchedToken(BaseRecognizer.java:617)
   at org.antlr.runtime.BaseRecognizer.match(BaseRecognizer.java:115)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser_FromClauseParser.fromClause(HiveParser_FromClauseParser.java:1424)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.fromClause(HiveParser.java:35998)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.selectStatement(HiveParser.java:33974)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.regular_body(HiveParser.java:33882)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatement(HiveParser.java:33389)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.queryStatementExpression(HiveParser.java:33169)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.execStatement(HiveParser.java:1284)
   at 
 org.apache.hadoop.hive.ql.parse.HiveParser.statement(HiveParser.java:983)
   at 
 org.apache.hadoop.hive.ql.parse.ParseDriver.parse(ParseDriver.java:190)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:434)
   at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:352)
   at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:995)
   at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:1038)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:931)
   at org.apache.hadoop.hive.ql.Driver.run(Driver.java:921)
   at 
 org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:268)
   at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:220)
   at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:422)
   at 
 org.apache.hadoop.hive.cli.CliDriver.executeDriver(CliDriver.java:790)
   at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:684)
   at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:623)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
 FAILED: ParseException line 1:20 mismatched input 'OVER' expecting FROM near 
 'bar' in from clause{code}
 The same happens without the {{AS}} but it works when leaving out the alias 
 entirely.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HIVE-7327) Refactoring: make Hive map side data processing reusable


 [ 
https://issues.apache.org/jira/browse/HIVE-7327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang resolved HIVE-7327.
---

Resolution: Won't Fix

Closed as not fix. Will reopen if need comes back.

 Refactoring: make Hive map side data processing reusable
 

 Key: HIVE-7327
 URL: https://issues.apache.org/jira/browse/HIVE-7327
 Project: Hive
  Issue Type: Sub-task
Affects Versions: 0.13.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang

 ExecMapper is Hive's mapper implementation for MapReduce. Table rows are read 
 by MR framework and processed by ExecMapper.map() method, which invokes 
 Hive's map-side operator tree starting from MapOperator. This task is to 
 extract the map-side data processing offered by the operator tree so that it 
 can be used by other execution engine such as Spark. This is purely 
 refactoring the existing code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Resolved] (HIVE-7328) Refactoring: make Hive reduce side data processing reusable


 [ 
https://issues.apache.org/jira/browse/HIVE-7328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xuefu Zhang resolved HIVE-7328.
---

Resolution: Won't Fix

Closed as will not fix. Will reopen if need comes back.

 Refactoring: make Hive reduce side data processing reusable
 ---

 Key: HIVE-7328
 URL: https://issues.apache.org/jira/browse/HIVE-7328
 Project: Hive
  Issue Type: Sub-task
Affects Versions: 0.13.0
Reporter: Xuefu Zhang
Assignee: Xuefu Zhang

 ExecReducer is Hive's reducer implementation for MapReduce. Table rows are 
 shuffled by MR framework to ExecReducer and further processed by 
 ExecReducer.reduce() method, which invokes Hive's reduce-side operator tree 
 starting. This task is to extract the reduce-side data processing offered by 
 the operator tree so that it can be reused by other execution engine such as 
 Spark. This is purely refactoring the existing code.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Updated] (HIVE-7552) Collect spark job statistic through spark metrics[Spark Branch]

2014-07-30 Thread Chengxiang Li (JIRA)


 [ 
https://issues.apache.org/jira/browse/HIVE-7552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chengxiang Li updated HIVE-7552:


Description: 
MR/Tez use counters to collect job statistic information, while Spark does not 
use accumulator to do the same thing. Instead, Spark store task metrics 
information in TaskMetrics and send it back to scheduler. We  could get spark 
job statistic information through combine all TaskMetrics with SparkListener.
NO PRECOMMIT TESTS. This is for spark-branch only.

  was:
MR/Tez use counters to collect job statistic information, while Spark has a 
configurable metrics system based on the Coda Hale Metrics Library. We  could 
collect spark job statistic information through spark metrics system in hive 
driver side.
NO PRECOMMIT TESTS. This is for spark-branch only.


 Collect spark job statistic through spark metrics[Spark Branch]
 ---

 Key: HIVE-7552
 URL: https://issues.apache.org/jira/browse/HIVE-7552
 Project: Hive
  Issue Type: New Feature
  Components: Spark
Reporter: Chengxiang Li

 MR/Tez use counters to collect job statistic information, while Spark does 
 not use accumulator to do the same thing. Instead, Spark store task metrics 
 information in TaskMetrics and send it back to scheduler. We  could get spark 
 job statistic information through combine all TaskMetrics with SparkListener.
 NO PRECOMMIT TESTS. This is for spark-branch only.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7436) Load Spark configuration into Hive driver

[
https://issues.apache.org/jira/browse/HIVE-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079131#comment-14079131
]

Xuefu Zhang commented on HIVE-7436:
---

[~chengxiang li], I guess for now expecting spark-defaults.conf from hadoop
classpath is fine for now, though we might need to go back to revisit and
rebrainstorm on this. Note that we don't have to follow exactly what Tez did on
every aspect, but I agree it can serve as a good reference point, giving users
a similar experience.

Load Spark configuration into Hive driver
-

Key: HIVE-7436
URL: https://issues.apache.org/jira/browse/HIVE-7436
Project: Hive
Issue Type: Sub-task
Components: Spark
Reporter: Chengxiang Li
Assignee: Chengxiang Li
Fix For: spark-branch

Attachments: HIVE-7436-Spark.1.patch, HIVE-7436-Spark.2.patch,
HIVE-7436-Spark.3.patch

--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7436) Load Spark configuration into Hive driver

[
https://issues.apache.org/jira/browse/HIVE-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079150#comment-14079150
]

Xuefu Zhang commented on HIVE-7436:
---

One more question: where did you see that tez-site.xml is read from classpath
by Hive, in the code or documentation somewhere? I wasn't able to find either.

Load Spark configuration into Hive driver
-

Key: HIVE-7436
URL: https://issues.apache.org/jira/browse/HIVE-7436
Project: Hive
Issue Type: Sub-task
Components: Spark
Reporter: Chengxiang Li
Assignee: Chengxiang Li
Fix For: spark-branch

Attachments: HIVE-7436-Spark.1.patch, HIVE-7436-Spark.2.patch,
HIVE-7436-Spark.3.patch

--
This message was sent by Atlassian JIRA
(v6.2#6252)

Re: Review Request 23799: HIVE-7390: refactor csv output format with in RFC mode and add one more option to support formatting as the csv format in hive cli

2014-07-30 Thread Lars Francke


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/23799/#review49091
---


In general this feels a bit awkward. I think better CSV/TSV support is a good 
idea but quotedCsv seems misleading as the old csv and tsv now quote as 
well if the separator is contained in the column value.


beeline/src/java/org/apache/hive/beeline/BeeLine.java
https://reviews.apache.org/r/23799/#comment85924

Missing space here and next line



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85920

remove this and call to getSeparator, can just be separator.



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85915

Can be converted to a variable arity function (e.g. String... vals)



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85916

Rename to writer?



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85917

Same as above: Can be converted to variable arity method



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85918

...variable arity



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85919

Remove this and probably replace the call to isSingleQuoted with just 
singleQuoted, no need to go through a simple getter



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85923

Missing spaces around the else



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85922

I'd either remove the getter and setters entirely or they need changing so 
that things are properly updated when separator/singleQuoted/csvPreference are 
changed.

Example: Someone passes in a CsvPreference with a different separator than 
the one set in here.

I think part of this patch needs to be the removal of all these simple 
(getter/)setters.

If you don't want that then you need some verification logic that things 
make sense.



beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java
https://reviews.apache.org/r/23799/#comment85921

This is not a getter but a setter.


- Lars Francke


On July 30, 2014, 8:30 a.m., cheng xu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/23799/
 ---
 
 (Updated July 30, 2014, 8:30 a.m.)
 
 
 Review request for hive.
 
 
 Bugs: HIVE-7390
 https://issues.apache.org/jira/browse/HIVE-7390
 
 
 Repository: hive-git
 
 
 Description
 ---
 
 HIVE-7390: refactor csv output format with in RFC mode and add one more 
 option to support formatting as the csv format in hive cli
 
 
 Diffs
 -
 
   beeline/pom.xml 6ec1d1aff3f35c097aa6054aae84faf2d63854f1 
   beeline/src/java/org/apache/hive/beeline/BeeLine.java 
 528a98e29c23421f9352bdf7c5edd3a9fae0e3ea 
   beeline/src/java/org/apache/hive/beeline/SeparatedValuesOutputFormat.java 
 7853c3f38f3c3fb9ae0b9939c714f1dc940ba053 
   beeline/src/main/resources/BeeLine.properties 
 390d062b8dc52dfa790c7351f3db44c1e0dd7e37 
   
 itests/hive-unit/src/test/java/org/apache/hive/beeline/TestBeeLineWithArgs.java
  bd97aff5959fd9040fc0f0a1f6b782f2aa6f 
   pom.xml b5a5697e6a3b689c2b244ba0338be541261eaa3d 
 
 Diff: https://reviews.apache.org/r/23799/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 cheng xu

[jira] [Commented] (HIVE-7532) allow disabling direct sql per query with external metastore


[ 
https://issues.apache.org/jira/browse/HIVE-7532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079180#comment-14079180
 ] 

Hive QA commented on HIVE-7532:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658566/HIVE-7532.2.patch.txt

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 5823 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
org.apache.hive.jdbc.miniHS2.TestHiveServer2.testConnection
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/101/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/101/console
Test logs: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-101/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 2 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12658566

 allow disabling direct sql per query with external metastore
 

 Key: HIVE-7532
 URL: https://issues.apache.org/jira/browse/HIVE-7532
 Project: Hive
  Issue Type: Improvement
Reporter: Sergey Shelukhin
Assignee: Navis
 Attachments: HIVE-7532.1.patch.txt, HIVE-7532.2.patch.txt


 Currently with external metastore, direct sql can only be disabled via 
 metastore config globally. Perhaps it makes sense to have the ability to 
 propagate the setting per query from client to override the metastore 
 setting, e.g. if one particular query causes it to fail.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7390) Make quote character optional and configurable in BeeLine CSV/TSV output


[ 
https://issues.apache.org/jira/browse/HIVE-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079216#comment-14079216
 ] 

Lars Francke commented on HIVE-7390:


As noted in my review I'm not too sure about adding another format especially 
if it's called quotedCSV because that implies that the others aren't using 
quoting but they actually are when needed.

The old way sometimes produces invalid CSV (when quoting or delimiter chars 
exist in the data) so I think it's a good idea to fix this (and super-csv seems 
to solve that). I'm not sure if preserving the old functionality is worth 
anything. And if you do then maybe deprecate it and name it `deprecatedCSV` or 
something like that.

I'd be in favor of two options instead (similar to what was suggested 
originally)
* Delimiter
* Quoting character

Maybe even a third: Quoting mode. I'm in favor of always adding quotes as it 
makes parsing easier (no need to check for quoted/unquoted columns etc.). If 
not adding that I'd vote in favor of changing the current quoting mode to the 
AllwaysQuote mode.

 Make quote character optional and configurable in BeeLine CSV/TSV output
 

 Key: HIVE-7390
 URL: https://issues.apache.org/jira/browse/HIVE-7390
 Project: Hive
  Issue Type: New Feature
  Components: Clients
Affects Versions: 0.13.1
Reporter: Jim Halfpenny
Assignee: Ferdinand Xu
 Attachments: HIVE-7390.1.patch, HIVE-7390.2.patch, HIVE-7390.3.patch, 
 HIVE-7390.4.patch, HIVE-7390.patch


 Currently when either the CSV or TSV output formats are used in beeline each 
 column is wrapped in single quotes. Quote wrapping of columns should be 
 optional and the user should be able to choose the character used to wrap the 
 columns.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-7547) Add ipAddress and userName to ExecHook


[ 
https://issues.apache.org/jira/browse/HIVE-7547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14079292#comment-14079292
 ] 

Hive QA commented on HIVE-7547:
---



{color:red}Overall{color}: -1 at least one tests failed

Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12658571/HIVE-7547.2.patch

{color:red}ERROR:{color} -1 due to 2 failed/errored test(s), 5825 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
org.apache.hive.hcatalog.pig.TestOrcHCatLoader.testReadDataPrimitiveTypes
{noformat}

Test results: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/102/testReport
Console output: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/102/console
Test logs: 
http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-102/

Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 2 tests failed
{noformat}

This message is automatically generated.

ATTACHMENT ID: 12658571

 Add ipAddress and userName to ExecHook
 --

 Key: HIVE-7547
 URL: https://issues.apache.org/jira/browse/HIVE-7547
 Project: Hive
  Issue Type: New Feature
  Components: Diagnosability
Reporter: Szehon Ho
Assignee: Szehon Ho
 Attachments: HIVE-7547.2.patch, HIVE-7547.patch


 Auditing tools should be able to know about the ipAddress and userName of the 
 user executing operations.  
 These could be made available through the Hive execution-hooks.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

hive udf cannot recognize generic method

2014-07-30 Thread Dan Fan

Hi there

I am writing a hive UDF function. The input could be string, int, double etc.
The return is based on the data type. I was trying to use the generic method, 
however, hive seems not recognize it.
Here is the piece of code I have as example.


  public T T evaluate(final T s, final String column_name, final int bitmap) 
throws Exception {


 if (s instanceof Double)

return (T) new Double(-1.0);

 Else if( s instance of Integer)

Return (T) new Integer(-1) ;

…..

}


Does anyone know if hive supports the generic method ? Or I have to override 
the evaluate method for each type of input.


Thanks


Dan

[jira] [Commented] (HIVE-6437) DefaultHiveAuthorizationProvider should not initialize a new HiveConf