Vladimir Feinberg created SPARK-15809:
-
Summary: PySpark SQL UDF default returnType
Key: SPARK-15809
URL: https://issues.apache.org/jira/browse/SPARK-15809
Project: Spark
Issue Type:
Vladimir Feinberg created SPARK-15888:
-
Summary: UDF fails in Python
Key: SPARK-15888
URL: https://issues.apache.org/jira/browse/SPARK-15888
Project: Spark
Issue Type: Bug
Vladimir Feinberg created SPARK-15971:
-
Summary: GroupedData's member incorrectly named
Key: SPARK-15971
URL: https://issues.apache.org/jira/browse/SPARK-15971
Project: Spark
Issue Type:
Vladimir Feinberg created SPARK-15973:
-
Summary: GroupedData.pivot documentation off
Key: SPARK-15973
URL: https://issues.apache.org/jira/browse/SPARK-15973
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-15973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-15973:
--
Description:
(1)
{{GroupedData.pivot}} documenation uses {{//}} instead of {{#}} for
[
https://issues.apache.org/jira/browse/SPARK-15973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-15973:
--
Summary: Fix GroupedData Documentation (was: GroupedData.pivot
documentation off)
>
[
https://issues.apache.org/jira/browse/SPARK-15971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-15971:
--
Description:
The {{pyspark.sql.GroupedData}} object calls the Java object it wraps
[
https://issues.apache.org/jira/browse/SPARK-15972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-15972:
--
Description:
Simple aggregation functions which take column names {{cols}} as varargs
[
https://issues.apache.org/jira/browse/SPARK-15973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15332374#comment-15332374
]
Vladimir Feinberg commented on SPARK-15973:
---
Done
> Fix GroupedData Documentation
>
[
https://issues.apache.org/jira/browse/SPARK-15971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg resolved SPARK-15971.
---
Resolution: Duplicate
> GroupedData's member incorrectly named
>
[
https://issues.apache.org/jira/browse/SPARK-15972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg resolved SPARK-15972.
---
Resolution: Duplicate
> GroupedData varargs arguments misnamed
>
[
https://issues.apache.org/jira/browse/SPARK-15972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg closed SPARK-15972.
-
> GroupedData varargs arguments misnamed
> --
>
>
[
https://issues.apache.org/jira/browse/SPARK-15971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg closed SPARK-15971.
-
> GroupedData's member incorrectly named
> --
>
>
Vladimir Feinberg created SPARK-15972:
-
Summary: GroupedData varargs arguments misnamed
Key: SPARK-15972
URL: https://issues.apache.org/jira/browse/SPARK-15972
Project: Spark
Issue Type:
Vladimir Feinberg created SPARK-15993:
-
Summary: PySpark RuntimeConfig should be immutable
Key: SPARK-15993
URL: https://issues.apache.org/jira/browse/SPARK-15993
Project: Spark
Issue
Vladimir Feinberg created SPARK-15989:
-
Summary: PySpark SQL python-only UDTs don't support nested types
Key: SPARK-15989
URL: https://issues.apache.org/jira/browse/SPARK-15989
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-15989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-15989:
--
Component/s: SQL
> PySpark SQL python-only UDTs don't support nested types
>
[
https://issues.apache.org/jira/browse/SPARK-15993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336381#comment-15336381
]
Vladimir Feinberg commented on SPARK-15993:
---
So the intent is that changing {{RuntimeConfig}}
[
https://issues.apache.org/jira/browse/SPARK-16175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16175:
--
Attachment: nullvector.dbc
databricks nb demonstrating the issue
> Handle None for
Vladimir Feinberg created SPARK-16179:
-
Summary: UDF explosion yielding empty dataframe fails
Key: SPARK-16179
URL: https://issues.apache.org/jira/browse/SPARK-16179
Project: Spark
Issue
Vladimir Feinberg created SPARK-16237:
-
Summary: PySpark gapply
Key: SPARK-16237
URL: https://issues.apache.org/jira/browse/SPARK-16237
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-16718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15392914#comment-15392914
]
Vladimir Feinberg commented on SPARK-16718:
---
L1 support for loss-based impurity will be delayed
Vladimir Feinberg created SPARK-16728:
-
Summary: migrate internal API for MLlib trees from spark.mllib to
spark.ml
Key: SPARK-16728
URL: https://issues.apache.org/jira/browse/SPARK-16728
Project:
Vladimir Feinberg created SPARK-16739:
-
Summary: GBTClassifier should be a Classifier, not Predictor
Key: SPARK-16739
URL: https://issues.apache.org/jira/browse/SPARK-16739
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-16718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16718:
--
Description:
.As an initial minimal change, we should provide TreeBoost as implemented
[
https://issues.apache.org/jira/browse/SPARK-16504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15373769#comment-15373769
]
Vladimir Feinberg commented on SPARK-16504:
---
fwiw {{merge}} has type {{(MAB, Row):Unit}}
Vladimir Feinberg created SPARK-16551:
-
Summary: Accumulator Examples should demonstrate different use
case from UDAFs
Key: SPARK-16551
URL: https://issues.apache.org/jira/browse/SPARK-16551
[
https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366502#comment-15366502
]
Vladimir Feinberg edited comment on SPARK-4240 at 7/22/16 4:47 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-16718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16718:
--
Description:
As an initial minimal change, we should provide TreeBoost as implemented
Vladimir Feinberg created SPARK-16718:
-
Summary: gbm-style treeboost
Key: SPARK-16718
URL: https://issues.apache.org/jira/browse/SPARK-16718
Project: Spark
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/SPARK-16718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16718:
--
Description:
As an initial minimal change, we should provide TreeBoost as implemented
[
https://issues.apache.org/jira/browse/SPARK-16900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15414455#comment-15414455
]
Vladimir Feinberg commented on SPARK-16900:
---
Alternatively, if we could have some way of
[
https://issues.apache.org/jira/browse/SPARK-16572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg closed SPARK-16572.
-
Resolution: Fixed
Layout is just not github-compatible.
> DStream Kinesis Connector Doc
Vladimir Feinberg created SPARK-16572:
-
Summary: DStream Kinesis Connector Doc formatting
Key: SPARK-16572
URL: https://issues.apache.org/jira/browse/SPARK-16572
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15371420#comment-15371420
]
Vladimir Feinberg commented on SPARK-10931:
---
[~josephkb] Te intention of this JIRA is a bit
Vladimir Feinberg created SPARK-16504:
-
Summary: UDAF should be typed
Key: SPARK-16504
URL: https://issues.apache.org/jira/browse/SPARK-16504
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-16237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353495#comment-15353495
]
Vladimir Feinberg commented on SPARK-16237:
---
cc [~mengxr] [~thunterdb] [~josephkb] Comments re
[
https://issues.apache.org/jira/browse/SPARK-16237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16237:
--
Description:
To maintain feature parity, `gapply` functionality should be added to
[
https://issues.apache.org/jira/browse/SPARK-16237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16237:
--
Description:
To maintain feature parity, {{gapply}} functionality should be added to
[
https://issues.apache.org/jira/browse/SPARK-16263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16263:
--
Description:
The following use case demonstrates the issue.
cls.spark =
[
https://issues.apache.org/jira/browse/SPARK-16263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16263:
--
Description:
The following use case demonstrates the issue. Note that as a workaround
Vladimir Feinberg created SPARK-16263:
-
Summary: SparkSession caches configuration in an unituitive global
way
Key: SPARK-16263
URL: https://issues.apache.org/jira/browse/SPARK-16263
Project:
Vladimir Feinberg created SPARK-16262:
-
Summary: Impossible to remake new SparkContext using SparkSession
API in Pyspark
Key: SPARK-16262
URL: https://issues.apache.org/jira/browse/SPARK-16262
[
https://issues.apache.org/jira/browse/SPARK-16263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16263:
--
Description: The following use case demonstrates the issue.
> SparkSession caches
[
https://issues.apache.org/jira/browse/SPARK-16262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353642#comment-15353642
]
Vladimir Feinberg commented on SPARK-16262:
---
Ah, are you suggesting that line should be inside
[
https://issues.apache.org/jira/browse/SPARK-16263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353658#comment-15353658
]
Vladimir Feinberg commented on SPARK-16263:
---
Right, I'm not arguing for the need for multiple
[
https://issues.apache.org/jira/browse/SPARK-16262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353630#comment-15353630
]
Vladimir Feinberg commented on SPARK-16262:
---
What do you mean by "clearing that variable"? Are
[
https://issues.apache.org/jira/browse/SPARK-16262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15353659#comment-15353659
]
Vladimir Feinberg commented on SPARK-16262:
---
Sure, I think we're agreeing.
> Impossible to
[
https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366502#comment-15366502
]
Vladimir Feinberg commented on SPARK-4240:
--
Pending some dramatic response from \[~sethah\]
[
https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15366502#comment-15366502
]
Vladimir Feinberg edited comment on SPARK-4240 at 7/7/16 6:03 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15362721#comment-15362721
]
Vladimir Feinberg commented on SPARK-4240:
--
Sorry for delay in response - I was on vacation for
Vladimir Feinberg created SPARK-16920:
-
Summary: Investigate and fix issues introduced in SPARK-15858
Key: SPARK-16920
URL: https://issues.apache.org/jira/browse/SPARK-16920
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-16899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16899:
--
Description:
The structured streaming checkpointing example at the bottom of the page
Vladimir Feinberg created SPARK-16899:
-
Summary: Structured Streaming Checkpointing Example invalid
Key: SPARK-16899
URL: https://issues.apache.org/jira/browse/SPARK-16899
Project: Spark
Vladimir Feinberg created SPARK-16900:
-
Summary: Complete-mode output for file sinks
Key: SPARK-16900
URL: https://issues.apache.org/jira/browse/SPARK-16900
Project: Spark
Issue Type:
Vladimir Feinberg created SPARK-16957:
-
Summary: Use weighted midpoints for split values.
Key: SPARK-16957
URL: https://issues.apache.org/jira/browse/SPARK-16957
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-12381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15412459#comment-15412459
]
Vladimir Feinberg commented on SPARK-12381:
---
Yeah, that'd be a good idea.
> Copy public
[
https://issues.apache.org/jira/browse/SPARK-12381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15412438#comment-15412438
]
Vladimir Feinberg commented on SPARK-12381:
---
[~sethah] Just so we don't clash, I think these
[
https://issues.apache.org/jira/browse/SPARK-16957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16957:
--
Issue Type: Improvement (was: Sub-task)
Parent: (was: SPARK-14045)
> Use
Vladimir Feinberg created SPARK-16969:
-
Summary: GBTClassifier needs a raw prediction column
Key: SPARK-16969
URL: https://issues.apache.org/jira/browse/SPARK-16969
Project: Spark
Issue
Vladimir Feinberg created SPARK-16860:
-
Summary: UDT Stringification Incorrect in PySpark
Key: SPARK-16860
URL: https://issues.apache.org/jira/browse/SPARK-16860
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15357868#comment-15357868
]
Vladimir Feinberg commented on SPARK-4240:
--
[~sethah] Hi Seth, it seems like your comment is
[
https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15450836#comment-15450836
]
Vladimir Feinberg commented on SPARK-15575:
---
Some of the biggest issues with Breeze perf I've
[
https://issues.apache.org/jira/browse/SPARK-16728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vladimir Feinberg updated SPARK-16728:
--
Description:
Currently, spark.ml trees rely on spark.mllib implementations. There are
64 matches
Mail list logo