Hossein Falaki created SPARK-17919:

             Summary: Make timeout to RBackend configurable in SparkR
                 Key: SPARK-17919
                 URL: https://issues.apache.org/jira/browse/SPARK-17919
             Project: Spark
          Issue Type: Story
          Components: SparkR
    Affects Versions: 2.0.1
            Reporter: Hossein Falaki

I am working on a project where {{gapply()}} is being used with a large dataset 
that happens to be extremely skewed. On that skewed partition, the user 
function takes more than 2 hours to return and that turns out to be larger than 
the timeout that we hardcode in SparkR for backend connection.

connectBackend <- function(hostname, port, timeout = 6000) 

Ideally user should be able to reconfigure Spark and increase the timeout. It 
should be a small fix.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to