Github user dongjoon-hyun commented on a diff in the pull request:
https://github.com/apache/spark/pull/21372#discussion_r189472897
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/execution/datasources/orc/OrcColumnVector.java
---
@@ -136,7 +136,7 @@ public int getInt(int rowId) {
public long getLong(int rowId) {
int index = getRowIndex(rowId);
if (isTimestamp) {
- return timestampData.time[index] * 1000 + timestampData.nanos[index]
/ 1000;
+ return timestampData.time[index] * 1000 + timestampData.nanos[index]
/ 1000 % 1000;
--- End diff --
In Apache ORC 1.4.4, ORC-306 fixes this according to the [original
definition](https://github.com/apache/hive/blob/master/storage-api/src/java/org/apache/hadoop/hive/ql/exec/vector/TimestampColumnVector.java#L45-L46).
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]