Hello all, I think large data processing capability is more important than fault tolerance at the moment.
Here's my few thoughts. 1. We have to fix partitioning issue (most related with graph). 2. According to my experiences, Avro RPC implementation is unstable and requires huge resources. I'd like to just remove if nobody maintain this implementation. Additionally, we should focus more on memory efficiency and fast parallel algorithms, not disk-based. -- Best Regards, Edward J. Yoon @eddieyoon
