[GitHub] spark pull request #22227: [SPARK-25202] [SQL] Implements split with limit s...

phegstrom Fri, 31 Aug 2018 06:46:50 -0700

Github user phegstrom commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22227#discussion_r214358111
  
    --- Diff: python/pyspark/sql/functions.py ---
    @@ -1669,20 +1669,36 @@ def repeat(col, n):
         return Column(sc._jvm.functions.repeat(_to_java_column(col), n))
     
     
    -@since(1.5)
    +@since(2.4)
     @ignore_unicode_prefix
    -def split(str, pattern):
    -    """
    -    Splits str around pattern (pattern is a regular expression).
    -
    -    .. note:: pattern is a string represent the regular expression.
    -
    -    >>> df = spark.createDataFrame([('ab12cd',)], ['s',])
    -    >>> df.select(split(df.s, '[0-9]+').alias('s')).collect()
    -    [Row(s=[u'ab', u'cd'])]
    -    """
    -    sc = SparkContext._active_spark_context
    -    return Column(sc._jvm.functions.split(_to_java_column(str), pattern))
    +def split(str, regex, limit=-1):
    --- End diff --
    
    @HyukjinKwon do you want `regex` -> `pattern` just here in python or every 
where in this PR?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #22227: [SPARK-25202] [SQL] Implements split with limit s...

Reply via email to