Github user velvia commented on the pull request: https://github.com/apache/incubator-spark/pull/576#issuecomment-34715532 My concern with this is that Parquet is typically used for high performance OLAP queries, and changing it to JSON makes it much slower. Out of curiosity, I have used Parquet with Thrift, and I know it supports Avro, is JSON a separate path? I'm probably going to publish a blog post on Parquet, Thrift, and Spark soon.
- [GitHub] incubator-spark pull request: Added parquetFileAsJS... laserson
- [GitHub] incubator-spark pull request: Added parquetFil... AmplabJenkins
- [GitHub] incubator-spark pull request: Added parquetFil... velvia
- [GitHub] incubator-spark pull request: Added parquetFil... laserson
- [GitHub] incubator-spark pull request: Added parquetFil... pwendell
- [GitHub] incubator-spark pull request: Added parquetFil... laserson
- [GitHub] incubator-spark pull request: Added parquetFil... laserson
- [GitHub] incubator-spark pull request: Added parquetFil... velvia
- [GitHub] incubator-spark pull request: Added parquetFil... laserson