[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Summary: Apply and evaluate expressions row-wise in a Spark DataFrame
(was: Apply and
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Summary: Apply and evaluate expressiosn row-wise in a Spark DataFrame
(was: Apply and
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
[
https://issues.apache.org/jira/browse/SPARK-37971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37971:
---
Description:
This functionality would serve very specific use cases.
Consider a DataFrame
Carlos Gameiro created SPARK-37971:
--
Summary: Apply and evaluate expressiosn row-wise in a DataFrame
Key: SPARK-37971
URL: https://issues.apache.org/jira/browse/SPARK-37971
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448500#comment-17448500
]
Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:50 AM:
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539
]
Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:49 AM:
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539
]
Carlos Gameiro edited comment on SPARK-37449 at 11/24/21, 10:49 AM:
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448539#comment-17448539
]
Carlos Gameiro commented on SPARK-37449:
Sometimes there is no natural way to group a dataframe
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17448500#comment-17448500
]
Carlos Gameiro commented on SPARK-37449:
You are right. I'm selecting the first 4 indexes of
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
Let's create a simple Pandas Dataframe with a single column named 'id' with a
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
Let's create a simple Pandas Dataframe with a single column named 'id' that
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
Let's create a simple Pandas Dataframe with a single column named 'id' with a
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
I'm using pygeos 0.11.1.
Let's create a simple Pandas Dataframe with a single
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Summary: Side effects between PySpark Pandas UDF and Numpy indexing (was:
Side effects
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
I'm using pygeos 0.11.1.
Let's create a simple Pandas Dataframe with a single
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
I'm using pygeos 0.11.1.
Let's create a simple Pandas Dataframe with a single
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Summary: Side effects between PySpark and Numpy (was: Side effects between
PySpark, Numpy
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Priority: Critical (was: Major)
> Side effects between PySpark, Numpy and Pygeos
>
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Labels: applyInPandas (was: )
> Side effects between PySpark, Numpy and Pygeos
>
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Labels: NumPy Pandas Pygeos UDF applyInPandas (was: applyInPandas)
> Side effects between
[
https://issues.apache.org/jira/browse/SPARK-37449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Carlos Gameiro updated SPARK-37449:
---
Description:
I'm using pygeos 0.11.1.
Let's create a simple Pandas Dataframe with a single
Carlos Gameiro created SPARK-37449:
--
Summary: Side effects between PySpark, Numpy and Pygeos
Key: SPARK-37449
URL: https://issues.apache.org/jira/browse/SPARK-37449
Project: Spark
Issue
27 matches
Mail list logo