GitHub user sameeragarwal opened a pull request:

    https://github.com/apache/spark/pull/14304

    [SPARK-16668][TEST] Test parquet reader for row groups containing both 
dictionary and plain encoded pages

    ## What changes were proposed in this pull request?
    
    This patch adds an explicit test for [SPARK-14217] by setting the parquet 
dictionary and page size the generated parquet file spans across 3 pages 
(within a single row group) where the first page is dictionary encoded and the 
remaining two are plain encoded.
    
    ## How was this patch tested?
    
    1. ParquetEncodingSuite
    2. Also manually tested that this test fails without 
https://github.com/apache/spark/pull/12279

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sameeragarwal/spark hybrid-encoding-test

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/14304.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #14304
    
----
commit adffc4407a783bdf86d5ee5a26d289ee496d1247
Author: Sameer Agarwal <samee...@cs.berkeley.edu>
Date:   2016-07-21T06:08:17Z

    experiments

commit 5e7556cf96d991b2f38fda82d28256687f056474
Author: Sameer Agarwal <samee...@cs.berkeley.edu>
Date:   2016-07-21T07:59:34Z

    works

commit 6b688e97310f903066b4085cb0374e76a9baef0a
Author: Sameer Agarwal <samee...@cs.berkeley.edu>
Date:   2016-07-21T18:29:53Z

    cleanup

commit f3029080c449d40c1dde8e97b97f0354866788c4
Author: Sameer Agarwal <samee...@cs.berkeley.edu>
Date:   2016-07-21T18:30:47Z

    cleanup

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to