zhztheplayer commented on code in PR #13830: URL: https://github.com/apache/arrow/pull/13830#discussion_r944056618
########## cpp/src/arrow/dataset/file_base.cc: ########## @@ -89,6 +89,28 @@ Result<std::shared_ptr<io::InputStream>> FileSource::OpenCompressed( return io::CompressedInputStream::Make(codec.get(), std::move(file)); } +Result<std::shared_ptr<io::InputStream>> FileSource::OpenRange(int64_t start, + int64_t end) const { Review Comment: Does this mean the input file will be truncated exactly from `start` to `end` then deliver to fragment? In some cases fuzzed match might be required. E.g. Spark picks parquet row groups whose binary midpoints are located in [start, end). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org