[jira] [Updated] (SPARK-37971) Apply and evaluate expressions row-wise in a Spark DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Updated] (SPARK-37971) Apply and evaluate expressions row-wise in a Spark DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Updated] (SPARK-37971) Apply and evaluate expressions row-wise in a Spark DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Summary: Apply and evaluate expressions row-wise in a Spark DataFrame (was: Apply and

[jira] [Updated] (SPARK-37971) Apply and evaluate expressiosn row-wise in a Spark DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Updated] (SPARK-37971) Apply and evaluate expressiosn row-wise in a Spark DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Summary: Apply and evaluate expressiosn row-wise in a Spark DataFrame (was: Apply and

[jira] [Updated] (SPARK-37971) Apply and evaluate expressiosn row-wise in a DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Updated] (SPARK-37971) Apply and evaluate expressiosn row-wise in a DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Updated] (SPARK-37971) Apply and evaluate expressiosn row-wise in a DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37971: --- Description: This functionality would serve very specific use cases. Consider a DataFrame

[jira] [Created] (SPARK-37971) Apply and evaluate expressiosn row-wise in a DataFrame

2022-01-20 Thread Carlos Gameiro (Jira)
Carlos Gameiro created SPARK-37971: -- Summary: Apply and evaluate expressiosn row-wise in a DataFrame Key: SPARK-37971 URL: https://issues.apache.org/jira/browse/SPARK-37971 Project: Spark

[jira] [Comment Edited] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448500#comment-17448500 ] Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:50 AM:

[jira] [Comment Edited] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539 ] Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:49 AM:

[jira] [Comment Edited] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539 ] Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:49 AM:

[jira] [Commented] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539 ] Carlos Gameiro commented on SPARK-37449: Sometimes there is no natural way to group a dataframe

[jira] [Commented] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-24 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448500#comment-17448500 ] Carlos Gameiro commented on SPARK-37449: You are right. I'm selecting the first 4 indexes of

[jira] [Updated] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: Let's create a simple Pandas Dataframe with a single column named 'id' with a

[jira] [Updated] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: Let's create a simple Pandas Dataframe with a single column named 'id' that

[jira] [Updated] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: Let's create a simple Pandas Dataframe with a single column named 'id' with a

[jira] [Updated] (SPARK-37449) Side effects between PySpark and Numpy

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: I'm using pygeos 0.11.1. Let's create a simple Pandas Dataframe with a single

[jira] [Updated] (SPARK-37449) Side effects between PySpark Pandas UDF and Numpy indexing

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Summary: Side effects between PySpark Pandas UDF and Numpy indexing (was: Side effects

[jira] [Updated] (SPARK-37449) Side effects between PySpark and Numpy

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: I'm using pygeos 0.11.1. Let's create a simple Pandas Dataframe with a single

[jira] [Updated] (SPARK-37449) Side effects between PySpark and Numpy

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: I'm using pygeos 0.11.1. Let's create a simple Pandas Dataframe with a single

[jira] [Updated] (SPARK-37449) Side effects between PySpark and Numpy

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Summary: Side effects between PySpark and Numpy (was: Side effects between PySpark, Numpy

[jira] [Updated] (SPARK-37449) Side effects between PySpark, Numpy and Pygeos

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Priority: Critical (was: Major) > Side effects between PySpark, Numpy and Pygeos >

[jira] [Updated] (SPARK-37449) Side effects between PySpark, Numpy and Pygeos

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Labels: applyInPandas (was: ) > Side effects between PySpark, Numpy and Pygeos >

[jira] [Updated] (SPARK-37449) Side effects between PySpark, Numpy and Pygeos

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Labels: NumPy Pandas Pygeos UDF applyInPandas (was: applyInPandas) > Side effects between

[jira] [Updated] (SPARK-37449) Side effects between PySpark, Numpy and Pygeos

2021-11-23 Thread Carlos Gameiro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos Gameiro updated SPARK-37449: --- Description: I'm using pygeos 0.11.1. Let's create a simple Pandas Dataframe with a single

[jira] [Created] (SPARK-37449) Side effects between PySpark, Numpy and Pygeos

2021-11-23 Thread Carlos Gameiro (Jira)
Carlos Gameiro created SPARK-37449: -- Summary: Side effects between PySpark, Numpy and Pygeos Key: SPARK-37449 URL: https://issues.apache.org/jira/browse/SPARK-37449 Project: Spark Issue