[
https://issues.apache.org/jira/browse/SPARK-10466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10466:
Assignee: (was: Apache Spark)
> UnsafeRow exception in Sort-Based Shuffle with data
[
https://issues.apache.org/jira/browse/SPARK-10466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733178#comment-14733178
]
Apache Spark commented on SPARK-10466:
--
User 'chenghao-intel' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-10466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10466:
Assignee: Apache Spark
> UnsafeRow exception in Sort-Based Shuffle with data spill
>
[
https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733223#comment-14733223
]
Saisai Shao commented on SPARK-10288:
-
Thanks a lot Marcelo for your comments, indeed these things
holdenk created SPARK-10469:
---
Summary: Document tungsten-sort
Key: SPARK-10469
URL: https://issues.apache.org/jira/browse/SPARK-10469
Project: Spark
Issue Type: Documentation
Reporter:
[
https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733143#comment-14733143
]
Apache Spark commented on SPARK-10273:
--
User 'noel-smith' has created a pull request for this issue:
Cheng Hao created SPARK-10466:
-
Summary: UnsafeRow exception in Sort-Based Shuffle with data spill
Key: SPARK-10466
URL: https://issues.apache.org/jira/browse/SPARK-10466
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10449:
Assignee: (was: Apache Spark)
> StructType.merge shouldn't merge DecimalTypes with
[
https://issues.apache.org/jira/browse/SPARK-10449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733163#comment-14733163
]
Apache Spark commented on SPARK-10449:
--
User 'holdenk' has created a pull request for this issue:
Vinod KC created SPARK-10468:
Summary: Verify schema before Dataframe select API call
Key: SPARK-10468
URL: https://issues.apache.org/jira/browse/SPARK-10468
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-8793?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733135#comment-14733135
]
Alexey Grishchenko commented on SPARK-8793:
---
This error usually means that you are using
[
https://issues.apache.org/jira/browse/SPARK-10462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733136#comment-14733136
]
Joseph E. Gonzalez commented on SPARK-10462:
Good question. I raised the issue here since it
[
https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732533#comment-14732533
]
Apache Spark commented on SPARK-9642:
-
User 'rotationsymmetry' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-9642:
---
Assignee: (was: Apache Spark)
> LinearRegression should supported weighted data
>
[
https://issues.apache.org/jira/browse/SPARK-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-9642:
---
Assignee: Apache Spark
> LinearRegression should supported weighted data
>
Maciej Szymkiewicz created SPARK-10467:
--
Summary: Vector is converted to tuple when extracted from Row
using __getitem__
Key: SPARK-10467
URL: https://issues.apache.org/jira/browse/SPARK-10467
[
https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10273:
Assignee: (was: Apache Spark)
> Add @since annotation to pyspark.mllib.feature
>
[
https://issues.apache.org/jira/browse/SPARK-10273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10273:
Assignee: Apache Spark
> Add @since annotation to pyspark.mllib.feature
>
[
https://issues.apache.org/jira/browse/SPARK-3369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733175#comment-14733175
]
Ryan Schmitt commented on SPARK-3369:
-
I've found a reasonably elegant workaround for this issue
[
https://issues.apache.org/jira/browse/SPARK-10468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10468:
Assignee: Apache Spark
> Verify schema before Dataframe select API call
>
[
https://issues.apache.org/jira/browse/SPARK-10468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10468:
Assignee: (was: Apache Spark)
> Verify schema before Dataframe select API call
>
[
https://issues.apache.org/jira/browse/SPARK-10468?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733242#comment-14733242
]
Apache Spark commented on SPARK-10468:
--
User 'vinodkc' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10449:
Assignee: Apache Spark
> StructType.merge shouldn't merge DecimalTypes with different
[
https://issues.apache.org/jira/browse/SPARK-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yi Zhou updated SPARK-10310:
Summary: [Spark SQL] All result records will be popluated into ONE line
during the script transform due to
[
https://issues.apache.org/jira/browse/SPARK-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733195#comment-14733195
]
Yi Zhou commented on SPARK-10310:
-
Hi [~marmbrus]
Could you please help to review and evaluate this
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732537#comment-14732537
]
Alexey Grishchenko commented on SPARK-10362:
_createDataFrame()_ in Python, when called for
[
https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14733169#comment-14733169
]
Paul Wais commented on SPARK-10399:
---
Image processing is a great use case. I've deployed a JNA-based
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
{code}
from pyspark.ml.feature import HashingTF
df =
[
https://issues.apache.org/jira/browse/SPARK-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732216#comment-14732216
]
Apache Spark commented on SPARK-10249:
--
User 'hhbyyh' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732300#comment-14732300
]
Sean Owen commented on SPARK-10462:
---
Meta-question -- since EC2 support has moved out of Spark, is
Yanbo Liang created SPARK-10464:
---
Summary: Add WeibullGenerator for RandomDataGenerator
Key: SPARK-10464
URL: https://issues.apache.org/jira/browse/SPARK-10464
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10464:
Assignee: (was: Apache Spark)
> Add WeibullGenerator for RandomDataGenerator
>
[
https://issues.apache.org/jira/browse/SPARK-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732305#comment-14732305
]
Apache Spark commented on SPARK-10464:
--
User 'yanboliang' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yanbo Liang updated SPARK-10464:
Component/s: (was: ML)
MLlib
> Add WeibullGenerator for RandomDataGenerator
>
[
https://issues.apache.org/jira/browse/SPARK-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10464:
Assignee: Apache Spark
> Add WeibullGenerator for RandomDataGenerator
>
[
https://issues.apache.org/jira/browse/SPARK-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10249:
Assignee: Apache Spark
> Add Python Code Example to StopWordsRemover User Guide
>
[
https://issues.apache.org/jira/browse/SPARK-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10249:
Assignee: (was: Apache Spark)
> Add Python Code Example to StopWordsRemover User
[
https://issues.apache.org/jira/browse/SPARK-9666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732235#comment-14732235
]
yuhao yang commented on SPARK-9666:
---
Sure. Thanks.
> ML 1.5 QA: model save/load audit
>
Adrian Wang created SPARK-10463:
---
Summary: remove PromotePrecision during optimization
Key: SPARK-10463
URL: https://issues.apache.org/jira/browse/SPARK-10463
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10463:
Assignee: Apache Spark
> remove PromotePrecision during optimization
>
[
https://issues.apache.org/jira/browse/SPARK-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10463:
Assignee: (was: Apache Spark)
> remove PromotePrecision during optimization
>
[
https://issues.apache.org/jira/browse/SPARK-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732293#comment-14732293
]
Apache Spark commented on SPARK-10463:
--
User 'adrian-wang' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732302#comment-14732302
]
Sean Owen commented on SPARK-10433:
---
Quite possible; would that have resulted in excessively large
[
https://issues.apache.org/jira/browse/SPARK-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated SPARK-10463:
--
Component/s: SQL
> remove PromotePrecision during optimization
>
[
https://issues.apache.org/jira/browse/SPARK-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732240#comment-14732240
]
yuhao yang commented on SPARK-8696:
---
Hi [~josephkb], I got a prototype on this. Is this a desirable
[
https://issues.apache.org/jira/browse/SPARK-10094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732317#comment-14732317
]
Apache Spark commented on SPARK-10094:
--
User 'noel-smith' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10094:
Assignee: (was: Apache Spark)
> Mark ML PySpark feature transformers as Experimental
[
https://issues.apache.org/jira/browse/SPARK-10094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10094:
Assignee: Apache Spark
> Mark ML PySpark feature transformers as Experimental to match
[
https://issues.apache.org/jira/browse/SPARK-10249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732220#comment-14732220
]
yuhao yang commented on SPARK-10249:
Thanks [~fliang] for creating the jira.
> Add Python Code
[
https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732230#comment-14732230
]
Lior Chaga commented on SPARK-9096:
---
I would suggest the following alternative to changing partitioner:
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anita Tailor updated SPARK-10465:
-
Description:
Currently there is no API in graphX to calculate shortest path between two
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anita Tailor updated SPARK-10465:
-
Summary: Shortest Path between two vertices, using distance and results
carries shortest path
[
https://issues.apache.org/jira/browse/SPARK-10433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732392#comment-14732392
]
DB Tsai commented on SPARK-10433:
-
++1 I saw exactly the same symptom and I was wondering the same
[
https://issues.apache.org/jira/browse/SPARK-8630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732408#comment-14732408
]
Shixiong Zhu commented on SPARK-8630:
-
Sent a PR to output a warning:
[
https://issues.apache.org/jira/browse/SPARK-10071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732406#comment-14732406
]
Apache Spark commented on SPARK-10071:
--
User 'zsxwing' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10071:
Assignee: (was: Apache Spark)
> QueueInputDStream Should Allow Checkpointing
>
[
https://issues.apache.org/jira/browse/SPARK-10071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10071:
Assignee: Apache Spark
> QueueInputDStream Should Allow Checkpointing
>
[
https://issues.apache.org/jira/browse/SPARK-10148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu resolved SPARK-10148.
--
Resolution: Fixed
Fix Version/s: 1.5.0
> Display active and inactive receiver numbers
[
https://issues.apache.org/jira/browse/SPARK-10071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu updated SPARK-10071:
-
Affects Version/s: 1.5.0
> QueueInputDStream Should Allow Checkpointing
>
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anita Tailor updated SPARK-10465:
-
Description:
Currently there is no API in graphX to calculate shortest path between two
vertex
Anita Tailor created SPARK-10465:
Summary: Shortest Path between two vertex, using distance and
results carries shortest path and distance
Key: SPARK-10465
URL: https://issues.apache.org/jira/browse/SPARK-10465
[
https://issues.apache.org/jira/browse/SPARK-9096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732358#comment-14732358
]
Alexey Pechorin commented on SPARK-9096:
This problem has happened to me as well, while using
[
https://issues.apache.org/jira/browse/SPARK-9834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732440#comment-14732440
]
DB Tsai commented on SPARK-9834:
In fact, for linear regression, if the # of features is small, X^TX is
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10465:
Assignee: Apache Spark
> Shortest Path between two vertices, using distance and results
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10465:
Assignee: (was: Apache Spark)
> Shortest Path between two vertices, using distance
[
https://issues.apache.org/jira/browse/SPARK-10465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732416#comment-14732416
]
Apache Spark commented on SPARK-10465:
--
User 'anitaguavus' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732448#comment-14732448
]
Apache Spark commented on SPARK-10269:
--
User 'noel-smith' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10269:
Assignee: Apache Spark
> Add @since annotation to pyspark.mllib.classification
>
[
https://issues.apache.org/jira/browse/SPARK-10269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10269:
Assignee: (was: Apache Spark)
> Add @since annotation to pyspark.mllib.classification
[
https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10271:
Assignee: (was: Apache Spark)
> Add @since annotation to pyspark.mllib.clustering
>
[
https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732506#comment-14732506
]
Apache Spark commented on SPARK-10271:
--
User 'noel-smith' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-10271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10271:
Assignee: Apache Spark
> Add @since annotation to pyspark.mllib.clustering
>
[
https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-10272:
Assignee: (was: Apache Spark)
> Add @since annotation to pyspark.mllib.evaluation
>
[
https://issues.apache.org/jira/browse/SPARK-10272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14732513#comment-14732513
]
Apache Spark commented on SPARK-10272:
--
User 'noel-smith' has created a pull request for this issue:
77 matches
Mail list logo