[GitHub] hive pull request #251: HIVE-14836: Test the predicate pushing down support ...
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/251 HIVE-14836: Test the predicate pushing down support for Parquet vecto⦠â¦rization read path Add more unit test for Predicate pushing down for Parquet Vectorization. You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive HIVE-14836 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/251.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #251 commit d6a0fc65bea35db9f557bca923838a8bfd7266dc Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2017-09-14T13:50:57Z HIVE-14836: Test the predicate pushing down support for Parquet vectorization read path ---
[GitHub] hive pull request #113: HIVE-15112 Implement Parquet vectorization reader fo...
Github user winningsix closed the pull request at: https://github.com/apache/hive/pull/113 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #116: HIVE-15112: Implement Parquet vectorization reader f...
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/116 HIVE-15112: Implement Parquet vectorization reader for Complex types Refactor UT You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive HIVE-15112-new Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/116.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #116 commit 825175086bdb0d1b26ca084ba4e741e4dca637c9 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-11-07T06:10:16Z HIVE-15112: Implement Parquet vectorization reader for Complex types Refactor UT --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #113: HIVE-15112 Implement Parquet vectorization reader fo...
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/113 HIVE-15112 Implement Parquet vectorization reader for Struct type Patch includes: 1. support for struct type 2. UT refine To be done: QTest for struct type You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive complex_types Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/113.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #113 commit 37f50c7629b5ef2a8fb6e9f63caaec6223abf308 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-09-01T22:15:31Z HIVE-14815: Support vectorization for Parquet clean code and add qtest Refine code Clean code Clean up code Clean up clean up code Update qfile output files Clean up code Address comments Avoid creating new HiveDecimalWritable object Address more comments Remove unused imports Address further comments Fix NPE Fix for failed cases commit 891b219838e4978f2eb4d41c0016214d44cc1bb7 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-11-07T06:10:16Z HIVE-15112: Implement Parquet vectorization reader for Complex types commit 26e513a2ac67dcfb05875e6ad7ba07f158be9073 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-11-09T19:49:46Z Refactor UT --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #105: HIVE-13873 Column pruning for nested fields
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/105 HIVE-13873 Column pruning for nested fields You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive HIVE-13873 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/105.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #105 commit ea462c256f773410c7023dcbfbe365c7cc8200b6 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-09-28T01:15:51Z HIVE-13873 Column pruning for nested fields --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #103: HIVE-14029: Update Spark version to 2.0.0
Github user winningsix closed the pull request at: https://github.com/apache/hive/pull/103 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #104: HIVE-14815: Support vectorization for Parquet
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/104 HIVE-14815: Support vectorization for Parquet This patch includes the following changes: 1. Implement a vectorized Page reader which support dictionary and RLE encoding. 2. Enable vectorization for Parquet input format. 3. Support several data types This is a WIP jira. You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive vectorization_parquet Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/104.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #104 commit a38c766e09bc1c3728fa413767b9fbaa19a4b005 Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-09-01T22:15:31Z HIVE-14815: Support vectorization for Parquet --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] hive pull request #103: HIVE-14029: Update Spark version to 2.0.0
GitHub user winningsix opened a pull request: https://github.com/apache/hive/pull/103 HIVE-14029: Update Spark version to 2.0.0 Changes include: * Spark API updates: 1. SparkShuffler#call return Iterator instead of Iterable 2. SparkListener -> JavaSparkListener 3. InputMetrics constructor doesnât accept readMethod 4. Method remoteBlocksFetched and localBlocksFetched in ShuffleReadMetrics return long type instead of integer * Dependency upgrade: 1. Jackson: 2.4.2 -> 2.6.5 2. Netty version: 4.0.23.Final -> 4.0.29.Final 3. Scala binary version: 2.10 -> 2.11 4. Scala version: 2.10.4 -> 2.11.8 Test done by smoke tests in a cluster and integration test in Jenkins You can merge this pull request into a Git repository by running: $ git pull https://github.com/winningsix/hive HIVE-14029 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/hive/pull/103.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #103 commit 965e57295a83b06db61b22f3fda0bb19e47c248a Author: Ferdinand Xu <cheng.a...@intel.com> Date: 2016-09-17T19:10:04Z HIVE-14029: Update Spark version to 2.0.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---