[
https://issues.apache.org/jira/browse/ORC-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655922#comment-17655922
]
Dongjoon Hyun commented on ORC-1343:
------------------------------------
ORC Java writer is not the only ORC writer. We have more ORC writers (C++,
Presto, Scritchley, Trino) . Recently, Rust ORC writer is also introduced in
the community, too.
-
https://github.com/apache/orc/blob/b3491a3db7d4168c65904308d0bd2b0dc07a6b5f/proto/orc_proto.proto#L364-L369
I'm not sure those writers have the same old limitation like Java Writer (which
is before ORC-1283). That's my question on this JIRA.
IMO, it doesn't sound like a new problem. It's an old Java Reader issue which
didn't support ORC files without INDEX. Did I miss something?
> But here we don't deal with the logic of filter creation, so it's a new
> problem
> Reading ORC files without index occurs error using latested spark
> -----------------------------------------------------------------
>
> Key: ORC-1343
> URL: https://issues.apache.org/jira/browse/ORC-1343
> Project: ORC
> Issue Type: Bug
> Affects Versions: 1.7.7, 1.8.1
> Reporter: Penglei Shi
> Priority: Major
>
> https://issues.apache.org/jira/browse/ORC-1283 this issue has fixed the
> problem that ENABLE_INDEXES does not take effect. But without index, filters
> pushdown will occur error, this seems to be because of the code below in
> RecordReaderImpl.java
> {code:java}
> if (indexes[columnIx] == null) {
> throw new AssertionError("Index is not populated for " + columnIx);
> } {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)