Michal Monselise created SPARK-10246:
----------------------------------------

             Summary: Join in PySpark using a list of column names
                 Key: SPARK-10246
                 URL: https://issues.apache.org/jira/browse/SPARK-10246
             Project: Spark
          Issue Type: Bug
          Components: PySpark, SQL
            Reporter: Michal Monselise


Currently, there are two supported methods to perform a join: join condition 
and one column name.

The documentation specifies that the join function can accept a list of 
conditions or a list of column names but neither are currently supported. This 
is discussed in issue SPARK-7197 as well.

Functionality should match the documentation which currently contains an 
example in /spark/python/pyspark/sql/dataframe.py line 560:

>>> df.join(df4, ['name', 'age']).select(df.name, df.age).collect()
        [Row(name=u'Bob', age=5)]
        """



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to