Regarding #2 – I think there are two options here:
1. Modify Accumulo to take advantage of HDFS Heterogeneous Storage 2. Modify Accumulo WAL code to support volumes From: Jeff Kubina [mailto:jeff.kub...@gmail.com] Sent: Wednesday, November 02, 2016 9:02 PM To: user@accumulo.apache.org Subject: Re: New Accumulo Blog Post Thanks for the blog post, very interesting read. Some questions ... 1. Are the operations "Writes mutation to tablet servers’ WAL/Sync or flush tablet servers’ WAL" and "Adds mutations to sorted in memory map of each tablet." performed by threads in parallel? 2. Could the latency of hsync-ing the WALs be overcome by modifying Accumulo to write them to a separate SSD-only HDFS? To maintain data locality it would require two datanode processes (one for the HDDs and one for the SSD), running on the same node, which is not hard to do.