I have been comparing my custom orc converted files to hive formatted tables (using hive) My question is what is the Compression size meaning here and why the difference ?
HIVE table: File Version: 0.12 with HIVE_8732Rows: 194032Compression: ZLIBCompression size: 262144 My Orc File: File Version: 0.12 with ORC_101Rows: 229376Compression: ZLIBCompression size: 4096 I use this for writing my file: OrcFile.writerOptions(conf).stripeSize(100000).bufferSize(10000).compress(org.apache.orc.CompressionKind.ZLIB).version(OrcFile.Version.V_0_12).setSchema(orcSchema));
