jvanalstine opened a new issue #10784:
URL: https://github.com/apache/arrow/issues/10784


   Greetings,
   
   I have been parsing my text files via
   ```
     int64_t buf_size = 65536;
   
     arrow::MemoryPool* pool = arrow::default_memory_pool();
     ARROW_RETURN_NOT_OK(arrow::jemalloc_memory_pool(&pool));
   
     ARROW_ASSIGN_OR_RAISE(auto text_readable,
                           arrow::io::ReadableFile::Open(text_file, pool));
     ARROW_ASSIGN_OR_RAISE(auto buffered_text_stream,
                           arrow::io::BufferedInputStream::Create(buf_size, 
pool, text_readable));
     ARROW_ASSIGN_OR_RAISE(auto gzip_codec,
                           
arrow::util::Codec::Create(arrow::Compression::GZIP));
     ARROW_ASSIGN_OR_RAISE(auto compressed_text_stream,
                           
arrow::io::CompressedInputStream::Make(gzip_codec.get(), buffered_text_stream));
   ```
   and then using `reinterpret_cast<const char*>(buf->data());` to parse the 
buffer into lines. 
   
   Is there a utility that i'm overlooking in the library that will give me an 
iterator parsing a text file line by line?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to