GitHub user ueshin opened a pull request: https://github.com/apache/spark/pull/1373
[SPARK-2446][SQL] Add BinaryType support to Parquet I/O. To support `BinaryType`, the following changes are needed: - Make `StringType` use `OriginalType.UTF8` - Add `BinaryType` using `PrimitiveTypeName.BINARY` without `OriginalType` You can merge this pull request into a Git repository by running: $ git pull https://github.com/ueshin/apache-spark issues/SPARK-2446 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/1373.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1373 ---- commit 616e04a989edce9f5109a3b2ad11b78e1e0f1a77 Author: Takuya UESHIN <ues...@happy-camper.st> Date: 2014-07-09T15:25:08Z Make StringType use OriginalType.UTF8. commit ecacb925c4e1f2dbf863b604ad2600cc8c9663d8 Author: Takuya UESHIN <ues...@happy-camper.st> Date: 2014-07-11T09:13:46Z Add BinaryType support to Parquet I/O. ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---