itsvikramagr commented on issue #24922: [SPARK-28120][SS]  Rocksdb state 
storage implementation
URL: https://github.com/apache/spark/pull/24922#issuecomment-539838772
 
 
   > 1. we using flatMapGroupsWithState, it cause it fail at begining
   Will update the PR with the fix
   
   > 2. Rocksdb checkpoint creating had a quite high time cost, sometimes > 20 
secs, ..  then I changed all of them to a ext4 partition, the result is much 
better, it's now could be < 10ms for most case, but still sometimes could be > 
100ms.
   
   For Isolation and Data consistency, we checkpoint the rocksdb state to local 
disk. As you have suggested a good file system and SSD based instance storage 
should be used to get the best performance.
   
   > 3. All spark executors stucks when one of executor try to load snapshot 
file from spark checkpoint.
   Great catch. Let me look at it and make appropriate changes.
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to