[
https://issues.apache.org/jira/browse/HAWQ-178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15126502#comment-15126502
]
ASF GitHub Bot commented on HAWQ-178:
-------------------------------------
Github user tzolov commented on the pull request:
https://github.com/apache/incubator-hawq/pull/302#issuecomment-178057412
@adamjshook, it's nice to hear from you!
You are right the `JsonRecordReader`,`JsonStreamReader` reads beyond the
`Split` boundries. I was missleaded by the `JsonInputFormat` which is never
been used. The `HdfsSplittableDataAccessor` (the JsonAccessor parent) doesn't
use the InptuFormat and getSplits\getRecordReader methods are not called.
The `HdfsSplittableDataAccessor` generates exactly **one** split!
> Add JSON plugin support in code base
> ------------------------------------
>
> Key: HAWQ-178
> URL: https://issues.apache.org/jira/browse/HAWQ-178
> Project: Apache HAWQ
> Issue Type: New Feature
> Components: PXF
> Reporter: Goden Yao
> Assignee: Goden Yao
> Fix For: backlog
>
> Attachments: PXFJSONPluginforHAWQ2.0andPXF3.0.0.pdf
>
>
> JSON has been a popular format used in HDFS as well as in the community,
> there has been a few JSON PXF plugins developed by the community and we'd
> like to see it being incorporated into the code base as an optional package.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)