increase 16k IOs with parquet in Drill?

Mark Himelstein Sun, 10 May 2015 17:28:40 -0700

Hi,

I'm new to the list so apologies up front if this is the wrong place topost this (glad to take input).

I converted a large set of CSV files to parquet files using Drill. Itried this with snappy and uncompressed.

Subsequent reads with a 'select count(*) from dfs.`mydir` where`somecolumn` > 47;' always does 16k reads. Using flightrecorder thisseems to come from the Page Header in the Parquet files.

Anyone know a way to increase the 16k reads? Thinking about writing myown parquet files but thought I'd ask if there was some config way to doit first. And also ask if writing my own parquet file with bigger sizesin the Page Header will help?


Thanks in advance,
Mark

increase 16k IOs with parquet in Drill?

Reply via email to