Hi All, I am wandering if I can use a very large block size in production HDFS cluster? Such as 4 or 8 gigabytes or even larger.
Is there any problem with HDFS if there are a large number of large blocks in it? Then if the large blocks are stored as Carbondata or other columnar formats such as Orc or Parquet, and we want to execute queries on top of such data, what troubles we may have? Thanks! Haoqiong