[GitHub] hive pull request #251: HIVE-14836: Test the predicate pushing down support ...

2017-09-14 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/251

HIVE-14836: Test the predicate pushing down support for Parquet vecto…

…rization read path

Add more unit test for Predicate pushing down for Parquet Vectorization.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive HIVE-14836

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/251.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #251


commit d6a0fc65bea35db9f557bca923838a8bfd7266dc
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2017-09-14T13:50:57Z

HIVE-14836: Test the predicate pushing down support for Parquet 
vectorization read path




---


[GitHub] hive pull request #113: HIVE-15112 Implement Parquet vectorization reader fo...

2016-11-20 Thread winningsix
Github user winningsix closed the pull request at:

https://github.com/apache/hive/pull/113


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #116: HIVE-15112: Implement Parquet vectorization reader f...

2016-11-20 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/116

HIVE-15112: Implement Parquet vectorization reader for Complex types

Refactor UT

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive HIVE-15112-new

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/116.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #116


commit 825175086bdb0d1b26ca084ba4e741e4dca637c9
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-11-07T06:10:16Z

HIVE-15112: Implement Parquet vectorization reader for Complex types

Refactor UT




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #113: HIVE-15112 Implement Parquet vectorization reader fo...

2016-11-09 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/113

HIVE-15112 Implement Parquet vectorization reader for Struct type

Patch includes:
1. support for struct type
2. UT refine

To be done:
QTest for struct type

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive complex_types

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/113.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #113


commit 37f50c7629b5ef2a8fb6e9f63caaec6223abf308
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-09-01T22:15:31Z

HIVE-14815: Support vectorization for Parquet

clean code and add qtest

Refine code

Clean code

Clean up code

Clean up

clean up code

Update qfile output files

Clean up code

Address comments

Avoid creating new HiveDecimalWritable object

Address more comments

Remove unused imports

Address further comments

Fix NPE

Fix for failed cases

commit 891b219838e4978f2eb4d41c0016214d44cc1bb7
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-11-07T06:10:16Z

HIVE-15112: Implement Parquet vectorization reader for Complex types

commit 26e513a2ac67dcfb05875e6ad7ba07f158be9073
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-11-09T19:49:46Z

Refactor UT




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #105: HIVE-13873 Column pruning for nested fields

2016-09-28 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/105

HIVE-13873 Column pruning for nested fields



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive HIVE-13873

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/105.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #105


commit ea462c256f773410c7023dcbfbe365c7cc8200b6
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-09-28T01:15:51Z

HIVE-13873 Column pruning for nested fields




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #103: HIVE-14029: Update Spark version to 2.0.0

2016-09-27 Thread winningsix
Github user winningsix closed the pull request at:

https://github.com/apache/hive/pull/103


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #104: HIVE-14815: Support vectorization for Parquet

2016-09-22 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/104

HIVE-14815: Support vectorization for Parquet

This patch includes the following changes:
1. Implement a vectorized Page reader which support dictionary and RLE 
encoding.
2. Enable vectorization for Parquet input format.
3. Support several data types
This is a WIP jira.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive vectorization_parquet

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/104.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #104


commit a38c766e09bc1c3728fa413767b9fbaa19a4b005
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-09-01T22:15:31Z

HIVE-14815: Support vectorization for Parquet




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] hive pull request #103: HIVE-14029: Update Spark version to 2.0.0

2016-09-22 Thread winningsix
GitHub user winningsix opened a pull request:

https://github.com/apache/hive/pull/103

HIVE-14029: Update Spark version to 2.0.0

Changes include:
* Spark API updates:

1. SparkShuffler#call return Iterator instead of Iterable
2. SparkListener -> JavaSparkListener
3. InputMetrics constructor doesn’t accept readMethod
4. Method remoteBlocksFetched and localBlocksFetched in ShuffleReadMetrics 
return long type instead of integer

* Dependency upgrade:

1. Jackson: 2.4.2 -> 2.6.5
2. Netty version: 4.0.23.Final -> 4.0.29.Final
3. Scala binary version: 2.10 -> 2.11
4. Scala version: 2.10.4 -> 2.11.8

Test done by smoke tests in a cluster and integration test in Jenkins

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/winningsix/hive HIVE-14029

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/hive/pull/103.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #103


commit 965e57295a83b06db61b22f3fda0bb19e47c248a
Author: Ferdinand Xu <cheng.a...@intel.com>
Date:   2016-09-17T19:10:04Z

HIVE-14029: Update Spark version to 2.0.0




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---