Abhishek Shakya created SPARK-36817:
---------------------------------------
Summary: Does Apache Spark 3 support GPU usage for Spark RDDs?
Key: SPARK-36817
URL: https://issues.apache.org/jira/browse/SPARK-36817
Project: Spark
Issue Type: Question
Components: Spark Core
Affects Versions: 3.1.2
Reporter: Abhishek Shakya
I am currently trying to run genomic analyses pipelines using
[Hail|https://hail.is/](library for genomics analyses written in python and
Scala). Recently, Apache Spark 3 was released and it supported GPU usage.
I tried [spark-rapids|https://nvidia.github.io/spark-rapids/] library start an
on-premise slurm cluster with gpu nodes. I was able to initialise the cluster.
However, when I tried running hail tasks, the executors keep getting killed.
On querying in Hail forum, I got the response that
{quote}That’s a GPU code generator for Spark-SQL, and Hail doesn’t use any
Spark-SQL interfaces, only the RDD interfaces.
{quote}
So, does Spark3 not support GPU usage for RDD interfaces?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]