Re: Clarifying questions...

Jonas Pfefferle Tue, 09 Jul 2019 01:05:01 -0700

Hi David,


Good to hear things work now.

1) Technically, you can use the RdmaStorageTier "directly" with a SSD sinceit allocates its data in "datapath" (and then mmaps it). Now this path istypically a hugetlbfs but it can be a standard mount point. However, thereare a few drawbacks with this approach: all IO is buffered and you have nocontrol over when it is written to the SSD and since Rdma requires that allmemory is pinned you have to allocate as much memory as your SSD has. Sooverall that is not really feasible.

My recommendation is to use the NVMf storage tier locally.

2) Correct, at the moment that is the only way you can do this: startmultiple instances of SPDK or use SPDK RAID0 if you just want to usemultiple devices in the same storage class.

FYI the shuffle plugin also supports configuring the storage class it shouldwrite to: "spark.crail.shuffle.storageclass" (put into Spark config)


Regards,
Jonas

 On Tue, 9 Jul 2019 01:05:37 +0000
 David Crespi <[email protected]> wrote:

HI,
Wanted to ask if there is a way of using local ssd via theRdmaStorageTier, so a couple of question.
From the blog example there were these three classes.

crail@clustermaster:~$ cat $CRAIL_HOME/conf/slaves

clusternode1 -t org.apache.crail.storage.rdma.RdmaStorageTier -c 0

clusternode1 -t org.apache.crail.storage.nvmf.NvmfStorageTier -c 1

disaggnode -t org.apache.crail.storage.nvmf.NvmfStorageTier -c 2
1. Is there a way of using the RdmaStorageTier directly with a SSDthat is local to the server “clusternode1”?Or is it that the local SSD has to be included into a NVMf subsystemon that local server, thus the NvmfStorageTieris used on that same server in order to access the SSD locally viaan nvmf subsystem.
1. I asked the question a few days ago about how to use the sameSubsystem NQN, which I can’t with a single
instance of SPDK. Is this how using the same a NQN is possible, thatdifferent instances of SPDK would be used… one on each server (i.e.clusternode1 & clusternode2), each with their own “version” of thatsame Subsystem?
BTW…
I have my environment all running now, and all in containers.Everything appears to be working as advertised.The spark shuffle seems to be filling up the memory tier, thencontinuing on to the ssd tier. Haven’t done anythingover 300G yet, but it’s coming. I’m clarifying the above to be sureI’m not missing out on one of the configs. I’mcurrently also using HDFS for the tmp results as I currently onlyhave one instance of SPDK, so bothNVMf class 1 and 2 can’t exist for me (assuming the answers abovethat is 😊).
Regards,

          David

Re: Clarifying questions...

Reply via email to