Hi All,
I am wandering if I can use a very large block size in production HDFS
cluster? Such as 4 or 8 gigabytes or even larger.

Is there any problem with HDFS if there are a large number of large blocks
in it?

Then if the large blocks are stored as Carbondata or other columnar formats
such as Orc or Parquet, and we want to execute queries on top of such data,
what troubles we may have?

Thanks!
Haoqiong

Reply via email to