pavibhai opened a new issue, #5777:
URL: https://github.com/apache/iceberg/issues/5777
### Feature Request / Improvement
Currently the ORCFileAppender opens a written ORC file just to derive the
Stripe offsets
```java
@Override
public List<Long> splitOffsets() {
Preconditions.checkState(isClosed, "File is not yet closed");
try (Reader reader = ORC.newFileReader(file.toInputFile(), conf)) {
List<StripeInformation> stripes = reader.getStripes();
return Collections.unmodifiableList(Lists.transform(stripes,
StripeInformation::getOffset));
} catch (IOException e) {
throw new RuntimeIOException(e, "Can't close ORC reader %s",
file.location());
}
}
```
Starting with ORC 1.7 we have added a public API to retrieve offset
information from the writer. This when called after close gives the complete
stripe information that is written out to the file. With this we can avoid
opening the written file.
### Query engine
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]