[
https://issues.apache.org/jira/browse/ARROW-16000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17569157#comment-17569157
]
Antoine Pitrou edited comment on ARROW-16000 at 7/20/22 7:47 PM:
-----------------------------------------------------------------
Ideally it could, except that a FileSource is suppose to provide a
RandomAccessFile, not an InputStream. A transformation callback can only work
for those file formats (CSV for the moment, JSON later on) that read files in a
purely streaming manner.
was (Author: pitrou):
Ideally it could, except that a FileSource is suppose to provide a
RandomAccessFile, not an InputStream. A transformation callback can only work
for those file format that read files in a purely streaming manner.
> [C++][Dataset] Support Latin-1 encoding
> ---------------------------------------
>
> Key: ARROW-16000
> URL: https://issues.apache.org/jira/browse/ARROW-16000
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Nicola Crane
> Assignee: Joost Hoozemans
> Priority: Major
>
> In ARROW-15992 a user is reporting issues with trying to read in files with
> Latin-1 encoding. I had a look through the docs for the Dataset API and I
> don't think this is currently supported.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)