jvanalstine opened a new issue #10784:
URL: https://github.com/apache/arrow/issues/10784
Greetings,
I have been parsing my text files via
```
int64_t buf_size = 65536;
arrow::MemoryPool* pool = arrow::default_memory_pool();
ARROW_RETURN_NOT_OK(arrow::jemalloc_memory_pool(&pool));
ARROW_ASSIGN_OR_RAISE(auto text_readable,
arrow::io::ReadableFile::Open(text_file, pool));
ARROW_ASSIGN_OR_RAISE(auto buffered_text_stream,
arrow::io::BufferedInputStream::Create(buf_size,
pool, text_readable));
ARROW_ASSIGN_OR_RAISE(auto gzip_codec,
arrow::util::Codec::Create(arrow::Compression::GZIP));
ARROW_ASSIGN_OR_RAISE(auto compressed_text_stream,
arrow::io::CompressedInputStream::Make(gzip_codec.get(), buffered_text_stream));
```
and then using `reinterpret_cast<const char*>(buf->data());` to parse the
buffer into lines.
Is there a utility that i'm overlooking in the library that will give me an
iterator parsing a text file line by line?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]