[jira] [Created] (SPARK-45505) Refactor analyzeInPython function to make it reusable

2023-10-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-45505: Summary: Refactor analyzeInPython function to make it reusable Key: SPARK-45505 URL: https://issues.apache.org/jira/browse/SPARK-45505 Project: Spark Issue T

[jira] [Updated] (SPARK-44076) SPIP: Python Data Source API

2023-10-10 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44076: - Affects Version/s: 4.0.0 (was: 3.5.0) > SPIP: Python Data Source API

[jira] [Created] (SPARK-45442) Refine docstring of `DataFrame.show`

2023-10-06 Thread Allison Wang (Jira)
Allison Wang created SPARK-45442: Summary: Refine docstring of `DataFrame.show` Key: SPARK-45442 URL: https://issues.apache.org/jira/browse/SPARK-45442 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45428) Add Matomo analytics to all released docs pages

2023-10-05 Thread Allison Wang (Jira)
Allison Wang created SPARK-45428: Summary: Add Matomo analytics to all released docs pages Key: SPARK-45428 URL: https://issues.apache.org/jira/browse/SPARK-45428 Project: Spark Issue Type: S

[jira] [Commented] (SPARK-45428) Add Matomo analytics to all released docs pages

2023-10-05 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17772395#comment-17772395 ] Allison Wang commented on SPARK-45428: -- cc [~podongfeng]  > Add Matomo analytics t

[jira] [Updated] (SPARK-44729) Add canonical links to the PySpark docs page

2023-10-05 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44729: - Description: We should add the canonical link to the PySpark docs page [https://spark.apache.or

[jira] [Commented] (SPARK-45264) Configurable error when generating Python docs

2023-09-21 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17767743#comment-17767743 ] Allison Wang commented on SPARK-45264: -- [~podongfeng] do we have ways to bypass suc

[jira] [Created] (SPARK-45264) Configurable error when generating Python docs

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45264: Summary: Configurable error when generating Python docs Key: SPARK-45264 URL: https://issues.apache.org/jira/browse/SPARK-45264 Project: Spark Issue Type: Su

[jira] [Created] (SPARK-45260) Refine docstring of count_distinct

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45260: Summary: Refine docstring of count_distinct Key: SPARK-45260 URL: https://issues.apache.org/jira/browse/SPARK-45260 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45259) Refine docstring of `count`

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45259: Summary: Refine docstring of `count` Key: SPARK-45259 URL: https://issues.apache.org/jira/browse/SPARK-45259 Project: Spark Issue Type: Sub-task Co

[jira] [Created] (SPARK-45258) Refine docstring of `sum`

2023-09-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-45258: Summary: Refine docstring of `sum` Key: SPARK-45258 URL: https://issues.apache.org/jira/browse/SPARK-45258 Project: Spark Issue Type: Sub-task Comp

[jira] [Updated] (SPARK-45220) Refine docstring of `DataFrame.join`

2023-09-19 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45220: - Description: Refine the docstring of `DataFrame.join`. The examples should also include: left j

[jira] [Created] (SPARK-45223) Refine docstring of `Column.when`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45223: Summary: Refine docstring of `Column.when` Key: SPARK-45223 URL: https://issues.apache.org/jira/browse/SPARK-45223 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45222) Refine docstring of `DataFrameReader.json`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45222: Summary: Refine docstring of `DataFrameReader.json` Key: SPARK-45222 URL: https://issues.apache.org/jira/browse/SPARK-45222 Project: Spark Issue Type: Sub-ta

[jira] [Created] (SPARK-45221) Refine docstring of `DataFrameReader.parquet`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45221: Summary: Refine docstring of `DataFrameReader.parquet` Key: SPARK-45221 URL: https://issues.apache.org/jira/browse/SPARK-45221 Project: Spark Issue Type: Sub

[jira] [Created] (SPARK-45220) Refine docstring of `DataFrame.join`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45220: Summary: Refine docstring of `DataFrame.join` Key: SPARK-45220 URL: https://issues.apache.org/jira/browse/SPARK-45220 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45219) Refine docstring of `DataFrame.withColumnRenamed`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45219: Summary: Refine docstring of `DataFrame.withColumnRenamed` Key: SPARK-45219 URL: https://issues.apache.org/jira/browse/SPARK-45219 Project: Spark Issue Type:

[jira] [Created] (SPARK-45218) Refine docstring of `Column.isin`

2023-09-19 Thread Allison Wang (Jira)
Allison Wang created SPARK-45218: Summary: Refine docstring of `Column.isin` Key: SPARK-45218 URL: https://issues.apache.org/jira/browse/SPARK-45218 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-45119) Refine docstring of `inline`

2023-09-11 Thread Allison Wang (Jira)
Allison Wang created SPARK-45119: Summary: Refine docstring of `inline` Key: SPARK-45119 URL: https://issues.apache.org/jira/browse/SPARK-45119 Project: Spark Issue Type: Sub-task C

[jira] [Created] (SPARK-45107) Refine docstring of `explode`

2023-09-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-45107: Summary: Refine docstring of `explode` Key: SPARK-45107 URL: https://issues.apache.org/jira/browse/SPARK-45107 Project: Spark Issue Type: Sub-task

[jira] [Resolved] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-09-08 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang resolved SPARK-44819. -- Resolution: Duplicate Fixed in https://issues.apache.org/jira/browse/SPARK-42642 > Make Pytho

[jira] [Created] (SPARK-45083) Refine docstring of `min`

2023-09-05 Thread Allison Wang (Jira)
Allison Wang created SPARK-45083: Summary: Refine docstring of `min` Key: SPARK-45083 URL: https://issues.apache.org/jira/browse/SPARK-45083 Project: Spark Issue Type: Sub-task Comp

[jira] [Created] (SPARK-45058) Refine the docstring of `DataFrame.distinct`

2023-09-01 Thread Allison Wang (Jira)
Allison Wang created SPARK-45058: Summary: Refine the docstring of `DataFrame.distinct` Key: SPARK-45058 URL: https://issues.apache.org/jira/browse/SPARK-45058 Project: Spark Issue Type: Sub-

[jira] [Updated] (SPARK-45058) Refine docstring of `DataFrame.distinct`

2023-09-01 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45058: - Summary: Refine docstring of `DataFrame.distinct` (was: Refine the docstring of `DataFrame.dist

[jira] [Created] (SPARK-45038) Refine docstring of `max`

2023-08-31 Thread Allison Wang (Jira)
Allison Wang created SPARK-45038: Summary: Refine docstring of `max` Key: SPARK-45038 URL: https://issues.apache.org/jira/browse/SPARK-45038 Project: Spark Issue Type: Sub-task Comp

[jira] [Updated] (SPARK-45023) SPIP: Python Stored Procedures

2023-08-30 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-45023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-45023: - Shepherd: Hyukjin Kwon > SPIP: Python Stored Procedures > -- > >

[jira] [Created] (SPARK-45023) SPIP: Python Stored Procedures

2023-08-30 Thread Allison Wang (Jira)
Allison Wang created SPARK-45023: Summary: SPIP: Python Stored Procedures Key: SPARK-45023 URL: https://issues.apache.org/jira/browse/SPARK-45023 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-45011) Refine docstring of `Column.between`

2023-08-29 Thread Allison Wang (Jira)
Allison Wang created SPARK-45011: Summary: Refine docstring of `Column.between` Key: SPARK-45011 URL: https://issues.apache.org/jira/browse/SPARK-45011 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-44994) Refine docstring of `DataFrame.filter`

2023-08-28 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44994: - Summary: Refine docstring of `DataFrame.filter` (was: Refine docstring for `DataFrame.filter`)

[jira] [Created] (SPARK-44994) Refine the docstring of `DataFrame.filter`

2023-08-28 Thread Allison Wang (Jira)
Allison Wang created SPARK-44994: Summary: Refine the docstring of `DataFrame.filter` Key: SPARK-44994 URL: https://issues.apache.org/jira/browse/SPARK-44994 Project: Spark Issue Type: Sub-ta

[jira] [Updated] (SPARK-44994) Refine docstring for `DataFrame.filter`

2023-08-28 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44994: - Summary: Refine docstring for `DataFrame.filter` (was: Refine the docstring of `DataFrame.filte

[jira] [Updated] (SPARK-44899) Refine the docstring of `DataFrame.collect`

2023-08-21 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44899: - Summary: Refine the docstring of `DataFrame.collect` (was: Refine the docstring of `DataFrame.c

[jira] [Created] (SPARK-44899) Refine the docstring of `DataFrame.collect()`

2023-08-21 Thread Allison Wang (Jira)
Allison Wang created SPARK-44899: Summary: Refine the docstring of `DataFrame.collect()` Key: SPARK-44899 URL: https://issues.apache.org/jira/browse/SPARK-44899 Project: Spark Issue Type: Sub

[jira] [Created] (SPARK-44879) Refine docstring of `createDataFrame`

2023-08-18 Thread Allison Wang (Jira)
Allison Wang created SPARK-44879: Summary: Refine docstring of `createDataFrame` Key: SPARK-44879 URL: https://issues.apache.org/jira/browse/SPARK-44879 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-44728) Improve PySpark documentations

2023-08-18 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44728: - Affects Version/s: 3.5.0 > Improve PySpark documentations > -- > >

[jira] [Created] (SPARK-44858) Refine docstring of `DataFrame.isEmpty`

2023-08-17 Thread Allison Wang (Jira)
Allison Wang created SPARK-44858: Summary: Refine docstring of `DataFrame.isEmpty` Key: SPARK-44858 URL: https://issues.apache.org/jira/browse/SPARK-44858 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-44856) Improve Python UDTF arrow serializer performance

2023-08-17 Thread Allison Wang (Jira)
Allison Wang created SPARK-44856: Summary: Improve Python UDTF arrow serializer performance Key: SPARK-44856 URL: https://issues.apache.org/jira/browse/SPARK-44856 Project: Spark Issue Type:

[jira] [Created] (SPARK-44853) Refine docstring of `DataFrame.columns` property

2023-08-17 Thread Allison Wang (Jira)
Allison Wang created SPARK-44853: Summary: Refine docstring of `DataFrame.columns` property Key: SPARK-44853 URL: https://issues.apache.org/jira/browse/SPARK-44853 Project: Spark Issue Type:

[jira] [Created] (SPARK-44834) Add SQL query test suites for Python UDTFs

2023-08-16 Thread Allison Wang (Jira)
Allison Wang created SPARK-44834: Summary: Add SQL query test suites for Python UDTFs Key: SPARK-44834 URL: https://issues.apache.org/jira/browse/SPARK-44834 Project: Spark Issue Type: Sub-ta

[jira] [Created] (SPARK-44822) Make Python UDTFs by default non-deterministic

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44822: Summary: Make Python UDTFs by default non-deterministic Key: SPARK-44822 URL: https://issues.apache.org/jira/browse/SPARK-44822 Project: Spark Issue Type: Su

[jira] [Updated] (SPARK-44820) Switch languages consistently across docs for all code snippets

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44820: - Description: When a user chooses a different language for a code snippet, all code snippets on

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Attachment: Screenshot 2023-08-15 at 11.59.11.png > Make Python the first language in all Spark

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Description: Currently, the first and default language for all code snippets is Sacla. For inst

[jira] [Updated] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44819: - Description: Currently, the first and default language for all code snippets is Sacla. We shoul

[jira] [Created] (SPARK-44820) Switch languages consistently across docs for all code snippets

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44820: Summary: Switch languages consistently across docs for all code snippets Key: SPARK-44820 URL: https://issues.apache.org/jira/browse/SPARK-44820 Project: Spark

[jira] [Created] (SPARK-44819) Make Python the first language in all Spark code snippet

2023-08-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44819: Summary: Make Python the first language in all Spark code snippet Key: SPARK-44819 URL: https://issues.apache.org/jira/browse/SPARK-44819 Project: Spark Issu

[jira] [Created] (SPARK-44766) Cache the pandas converter for Python UDTFs

2023-08-10 Thread Allison Wang (Jira)
Allison Wang created SPARK-44766: Summary: Cache the pandas converter for Python UDTFs Key: SPARK-44766 URL: https://issues.apache.org/jira/browse/SPARK-44766 Project: Spark Issue Type: Sub-t

[jira] [Created] (SPARK-44746) Improve the documentation for TABLE input arguments for UDTFs

2023-08-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-44746: Summary: Improve the documentation for TABLE input arguments for UDTFs Key: SPARK-44746 URL: https://issues.apache.org/jira/browse/SPARK-44746 Project: Spark

[jira] [Updated] (SPARK-44508) Add user guide for Python UDTFs

2023-08-09 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44508: - Component/s: Documentation > Add user guide for Python UDTFs > --- >

[jira] [Created] (SPARK-44742) Add Spark version drop down to the PySpark doc site

2023-08-09 Thread Allison Wang (Jira)
Allison Wang created SPARK-44742: Summary: Add Spark version drop down to the PySpark doc site Key: SPARK-44742 URL: https://issues.apache.org/jira/browse/SPARK-44742 Project: Spark Issue Typ

[jira] [Updated] (SPARK-44734) Add documentation for type casting rules in Python UDFs/UDTFs

2023-08-08 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44734: - Description: In addition to type mappings between Spark data types and Python data types (SPARK

[jira] [Created] (SPARK-44734) Add documentation for type casting rules in Python UDFs/UDTFs

2023-08-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44734: Summary: Add documentation for type casting rules in Python UDFs/UDTFs Key: SPARK-44734 URL: https://issues.apache.org/jira/browse/SPARK-44734 Project: Spark

[jira] [Updated] (SPARK-44733) Add documentation for type mappings between Spark and Python data types

2023-08-08 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44733: - Summary: Add documentation for type mappings between Spark and Python data types (was: Add type

[jira] [Updated] (SPARK-44733) Add type mappings between Spark data types and Python types

2023-08-08 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44733: - Summary: Add type mappings between Spark data types and Python types (was: Add type mappings be

[jira] [Created] (SPARK-44733) Add type mappings between Spark data type and Python type

2023-08-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44733: Summary: Add type mappings between Spark data type and Python type Key: SPARK-44733 URL: https://issues.apache.org/jira/browse/SPARK-44733 Project: Spark Iss

[jira] [Created] (SPARK-44729) Add canonical links to the PySpark docs page

2023-08-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44729: Summary: Add canonical links to the PySpark docs page Key: SPARK-44729 URL: https://issues.apache.org/jira/browse/SPARK-44729 Project: Spark Issue Type: Sub-

[jira] [Created] (SPARK-44728) Improve PySpark documentations

2023-08-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44728: Summary: Improve PySpark documentations Key: SPARK-44728 URL: https://issues.apache.org/jira/browse/SPARK-44728 Project: Spark Issue Type: Umbrella

[jira] [Updated] (SPARK-44508) Add user guide for Python UDTFs

2023-08-07 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44508: - Priority: Major (was: Blocker) > Add user guide for Python UDTFs >

[jira] [Commented] (SPARK-44508) Add user guide for Python UDTFs

2023-08-07 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17751807#comment-17751807 ] Allison Wang commented on SPARK-44508: -- [~holden] Yup you're right. This isn't a bl

[jira] [Updated] (SPARK-44005) Improve error messages for regular Python UDTFs that return non-tuple values

2023-08-04 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Summary: Improve error messages for regular Python UDTFs that return non-tuple values (was: Imp

[jira] [Updated] (SPARK-44005) Improve error messages when regular Python UDTFs that return non-tuple values

2023-08-04 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Summary: Improve error messages when regular Python UDTFs that return non-tuple values (was: Su

[jira] [Updated] (SPARK-44005) Support returning non-tuple values for regular Python UDTFs

2023-08-04 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Description: Currently, if you have a UDTF like this: {code:java} class TestUDTF: def eval(s

[jira] [Updated] (SPARK-43797) Python User-defined Table Functions

2023-08-04 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43797: - Affects Version/s: 4.0.0 > Python User-defined Table Functions > ---

[jira] [Updated] (SPARK-44009) Support profiler for Python UDTFs

2023-08-03 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44009: - Summary: Support profiler for Python UDTFs (was: Support memory_profiler for UDTFs ) > Suppor

[jira] [Updated] (SPARK-44663) Disable arrow optimization by default for Python UDTFs

2023-08-03 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44663: - Summary: Disable arrow optimization by default for Python UDTFs (was: Disable arrow optimizatio

[jira] [Created] (SPARK-44663) Disable arrow optimization by default

2023-08-03 Thread Allison Wang (Jira)
Allison Wang created SPARK-44663: Summary: Disable arrow optimization by default Key: SPARK-44663 URL: https://issues.apache.org/jira/browse/SPARK-44663 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-44644) Improve error messages for creating Python UDTFs with pickling errors

2023-08-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-44644: Summary: Improve error messages for creating Python UDTFs with pickling errors Key: SPARK-44644 URL: https://issues.apache.org/jira/browse/SPARK-44644 Project: Spark

[jira] [Updated] (SPARK-44640) Improve error messages for Python UDTF returning non iterable

2023-08-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44640: - Summary: Improve error messages for Python UDTF returning non iterable (was: Improve error mess

[jira] [Created] (SPARK-44640) Improve error messages for invalid Python UDTF return type

2023-08-02 Thread Allison Wang (Jira)
Allison Wang created SPARK-44640: Summary: Improve error messages for invalid Python UDTF return type Key: SPARK-44640 URL: https://issues.apache.org/jira/browse/SPARK-44640 Project: Spark Is

[jira] (SPARK-44559) Improve error messages for Python UDTF arrow type casts

2023-08-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44559 ] Allison Wang deleted comment on SPARK-44559: -- was (Author: allisonwang-db): Resolved by [https://github.com/apache/spark/pull/42191] > Improve error messages for Python UDTF arrow type cast

[jira] [Reopened] (SPARK-44561) Fix AssertionError when converting UDTF output to a complex type

2023-08-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang reopened SPARK-44561: -- > Fix AssertionError when converting UDTF output to a complex type > -

[jira] [Resolved] (SPARK-44559) Improve error messages for Python UDTF arrow type casts

2023-08-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang resolved SPARK-44559. -- Fix Version/s: 3.5.0 Target Version/s: 3.5.0 Resolution: Fixed Resolved by [h

[jira] [Commented] (SPARK-44559) Improve error messages for Python UDTF arrow type casts

2023-08-02 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17750419#comment-17750419 ] Allison Wang commented on SPARK-44559: -- Resolved by [https://github.com/apache/spar

[jira] [Updated] (SPARK-44508) Add user guide for Python UDTFs

2023-07-31 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44508: - Summary: Add user guide for Python UDTFs (was: Add user guide and documentation for Python UDTF

[jira] [Updated] (SPARK-44559) Improve error messages for Python UDTF arrow type casts

2023-07-27 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44559: - Summary: Improve error messages for Python UDTF arrow type casts (was: Improve error messages f

[jira] [Created] (SPARK-44561) Fix AssertionError when converting UDTF output to a complex type

2023-07-26 Thread Allison Wang (Jira)
Allison Wang created SPARK-44561: Summary: Fix AssertionError when converting UDTF output to a complex type Key: SPARK-44561 URL: https://issues.apache.org/jira/browse/SPARK-44561 Project: Spark

[jira] [Created] (SPARK-44559) Improve error messages for invalid Python UDTF arrow type casts

2023-07-26 Thread Allison Wang (Jira)
Allison Wang created SPARK-44559: Summary: Improve error messages for invalid Python UDTF arrow type casts Key: SPARK-44559 URL: https://issues.apache.org/jira/browse/SPARK-44559 Project: Spark

[jira] [Updated] (SPARK-43968) Improve error messages for Python UDTFs with wrong number of outputs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43968: - Description: Improve the error messages for Python UDTFs when the number of outputs mismatches t

[jira] [Updated] (SPARK-43968) Improve error messages for Python UDTFs with wrong number of outputs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43968: - Summary: Improve error messages for Python UDTFs with wrong number of outputs (was: Add more co

[jira] [Updated] (SPARK-44005) Support returning non-tuple values for regular Python UDTFs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Description: Currently, if you have a UDTF like this: {code:java} class TestUDTF: def eval(s

[jira] [Updated] (SPARK-44005) Support returning non-tuple values for regular Python UDTFs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Description: Currently, if you have a UDTF like this: {code:java} class TestUDTF: def eval(s

[jira] [Updated] (SPARK-44005) Support returning non-tuple values for regular Python UDTFs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Summary: Support returning non-tuple values for regular Python UDTFs (was: Support returning a

[jira] [Updated] (SPARK-44005) Support returning a non-tuple value for regular Python UDTFs

2023-07-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44005: - Summary: Support returning a non-tuple value for regular Python UDTFs (was: Improve the error m

[jira] [Created] (SPARK-44508) Add user guide and documentation for Python UDTFs

2023-07-20 Thread Allison Wang (Jira)
Allison Wang created SPARK-44508: Summary: Add user guide and documentation for Python UDTFs Key: SPARK-44508 URL: https://issues.apache.org/jira/browse/SPARK-44508 Project: Spark Issue Type:

[jira] [Updated] (SPARK-43967) Support Python UDTFs with empty return values

2023-07-13 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43967: - Description: Support UDTFs with empty returns, for example: {code:java} @udtf(returnType="a: int

[jira] [Updated] (SPARK-44249) Refactor PythonUDTFRunner to send its return type separately

2023-07-03 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-44249: - Parent: SPARK-43797 Issue Type: Sub-task (was: Improvement) > Refactor PythonUDTFRunner

[jira] [Created] (SPARK-44076) SPIP: Python Data Source API

2023-06-15 Thread Allison Wang (Jira)
Allison Wang created SPARK-44076: Summary: SPIP: Python Data Source API Key: SPARK-44076 URL: https://issues.apache.org/jira/browse/SPARK-44076 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-44009) Support memory_profiler for UDTFs

2023-06-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44009: Summary: Support memory_profiler for UDTFs Key: SPARK-44009 URL: https://issues.apache.org/jira/browse/SPARK-44009 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-44008) Include the name of the UDTF in the error messages generated during the function execution

2023-06-08 Thread Allison Wang (Jira)
Allison Wang created SPARK-44008: Summary: Include the name of the UDTF in the error messages generated during the function execution Key: SPARK-44008 URL: https://issues.apache.org/jira/browse/SPARK-44008

[jira] [Created] (SPARK-44005) Improve the error messages when a UDTF returns a non-tuple value

2023-06-07 Thread Allison Wang (Jira)
Allison Wang created SPARK-44005: Summary: Improve the error messages when a UDTF returns a non-tuple value Key: SPARK-44005 URL: https://issues.apache.org/jira/browse/SPARK-44005 Project: Spark

[jira] [Created] (SPARK-43968) Add more compile-time checks when creating Python UDTFs

2023-06-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-43968: Summary: Add more compile-time checks when creating Python UDTFs Key: SPARK-43968 URL: https://issues.apache.org/jira/browse/SPARK-43968 Project: Spark Issue

[jira] [Updated] (SPARK-43967) Support Python UDTFs with empty return values

2023-06-04 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43967: - Description: Support UDTFs with empty returns, for example: {code:java} @udtf(returnType="a: int

[jira] [Created] (SPARK-43967) Support Python UDTFs with empty return values

2023-06-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-43967: Summary: Support Python UDTFs with empty return values Key: SPARK-43967 URL: https://issues.apache.org/jira/browse/SPARK-43967 Project: Spark Issue Type: Sub

[jira] [Created] (SPARK-43966) Support non-deterministic Python UDTFs

2023-06-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-43966: Summary: Support non-deterministic Python UDTFs Key: SPARK-43966 URL: https://issues.apache.org/jira/browse/SPARK-43966 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-43965) Support Python UDTFs in Spark Connect

2023-06-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-43965: Summary: Support Python UDTFs in Spark Connect Key: SPARK-43965 URL: https://issues.apache.org/jira/browse/SPARK-43965 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-43964) Support arrow-optimized Python UDTFs

2023-06-04 Thread Allison Wang (Jira)
Allison Wang created SPARK-43964: Summary: Support arrow-optimized Python UDTFs Key: SPARK-43964 URL: https://issues.apache.org/jira/browse/SPARK-43964 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-43798) Initial support for Python UDTFs with batch eval

2023-05-25 Thread Allison Wang (Jira)
Allison Wang created SPARK-43798: Summary: Initial support for Python UDTFs with batch eval Key: SPARK-43798 URL: https://issues.apache.org/jira/browse/SPARK-43798 Project: Spark Issue Type:

[jira] [Updated] (SPARK-43798) Initial support for Python UDTFs

2023-05-25 Thread Allison Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Allison Wang updated SPARK-43798: - Summary: Initial support for Python UDTFs (was: Initial support for Python UDTFs with batch eva

[jira] [Created] (SPARK-43797) Python User-defined Table Functions

2023-05-25 Thread Allison Wang (Jira)
Allison Wang created SPARK-43797: Summary: Python User-defined Table Functions Key: SPARK-43797 URL: https://issues.apache.org/jira/browse/SPARK-43797 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-43375) Improve the error messages for INVALID_CONNECT_URL

2023-05-03 Thread Allison Wang (Jira)
Allison Wang created SPARK-43375: Summary: Improve the error messages for INVALID_CONNECT_URL Key: SPARK-43375 URL: https://issues.apache.org/jira/browse/SPARK-43375 Project: Spark Issue Type

<    1   2   3   4   5   >