[
https://issues.apache.org/jira/browse/TEZ-2974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated TEZ-2974:
----------------------------------
Attachment: TEZ-2974.1.patch
Attaching .1 patch. Instead of reading entire TFile into buffer (getValue() was
doing that earlier with TFile scanner), it uses getValueStream() now. This
helps in reading one line at a time and converts to tuple for processing.
> Tez tools: TFileRecordReader in tez-tools should support reading >2 GB tfiles
> -----------------------------------------------------------------------------
>
> Key: TEZ-2974
> URL: https://issues.apache.org/jira/browse/TEZ-2974
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Attachments: TEZ-2974.1.patch
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)