Hi everybody, https://engineering.linkedin.com/blog/2021/the-exabyte-club--linkedin-s-journey-of-scaling-the-hadoop-distr is a nice blog post reading. There are some interesting follow ups in my opinion:
- Fair vs Non-Fair locking for the HDFS Namenode. IIUC this seems to be a code change rather than a jvm setting tunable, but I am wondering if others have experience with different locking mechanisms in production for HDFS. - Observer HDFS Namenode. IIUC this was introduced in Hadoop 2.10, it would be nice if we could offer it via puppet for the docker provisioner (if we don't already do it, I didn't find it). Having a separate Namenode to handle read requests could be interesting for busy clusters. Has anybody already deployed it? Thanks in advance, Luca
