[
https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-2620:
--
Affects Version/s: 2.1.0
> case class cannot be used as key for reduce
> ---
[
https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-2620:
--
Affects Version/s: 1.6.0
2.0.0
> case class cannot be used as key
Maciej Szymkiewicz created SPARK-17587:
--
Summary: SparseVector __getitem__ should follow __getitem__
contract
Key: SPARK-17587
URL: https://issues.apache.org/jira/browse/SPARK-17587
Project: Spar
[
https://issues.apache.org/jira/browse/SPARK-17027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15418071#comment-15418071
]
Maciej Szymkiewicz commented on SPARK-17027:
Yes, this exactly the problem.
[
https://issues.apache.org/jira/browse/SPARK-17027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15418071#comment-15418071
]
Maciej Szymkiewicz edited comment on SPARK-17027 at 8/11/16 10:38 PM:
-
Maciej Szymkiewicz created SPARK-17027:
--
Summary: PolynomialExpansion.choose is prone to integer overflow
Key: SPARK-17027
URL: https://issues.apache.org/jira/browse/SPARK-17027
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15401190#comment-15401190
]
Maciej Szymkiewicz commented on SPARK-12157:
Well, it is alpha component (see
[
https://issues.apache.org/jira/browse/SPARK-14155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15401156#comment-15401156
]
Maciej Szymkiewicz commented on SPARK-14155:
[~rxin] Is there any progress on
[
https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15401153#comment-15401153
]
Maciej Szymkiewicz commented on SPARK-12157:
[~nchammas]You're using incorrec
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391108#comment-15391108
]
Maciej Szymkiewicz commented on SPARK-16589:
[~holdenk] Makes sense. I was th
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-16589:
---
Description:
Chaining cartesian calls in PySpark results in the number of records low
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-16589:
---
Affects Version/s: 1.4.0
1.5.0
> Chained cartesian produces in
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15384235#comment-15384235
]
Maciej Szymkiewicz commented on SPARK-16589:
Thanks [~dongjoon].
[~joshrosen
Maciej Szymkiewicz created SPARK-16626:
--
Summary: Code duplication after SPARK-14906
Key: SPARK-16626
URL: https://issues.apache.org/jira/browse/SPARK-16626
Project: Spark
Issue Type: Im
[
https://issues.apache.org/jira/browse/SPARK-16589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15382614#comment-15382614
]
Maciej Szymkiewicz commented on SPARK-16589:
[~dongjoon] I'll work on that bu
Maciej Szymkiewicz created SPARK-16589:
--
Summary: Chained cartesian produces incorrect number of records
Key: SPARK-16589
URL: https://issues.apache.org/jira/browse/SPARK-16589
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides eq method requires hash method f
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides {{__eq__}} method requires {{__ha
[
https://issues.apache.org/jira/browse/SPARK-15559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-15559:
---
Description:
In Python 3.x any object that provides `__eq__` method requires {__hash_
Maciej Szymkiewicz created SPARK-15559:
--
Summary: TopicAndPartition should provide __hash__ method
Key: SPARK-15559
URL: https://issues.apache.org/jira/browse/SPARK-15559
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249061#comment-15249061
]
Maciej Szymkiewicz commented on SPARK-14739:
I extracted relevant test fixes
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249053#comment-15249053
]
Maciej Szymkiewicz edited comment on SPARK-14739 at 4/20/16 12:47 AM:
-
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249053#comment-15249053
]
Maciej Szymkiewicz commented on SPARK-14739:
Sure, but your latest PR still d
[
https://issues.apache.org/jira/browse/SPARK-14739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15248994#comment-15248994
]
Maciej Szymkiewicz commented on SPARK-14739:
This solves only small part of t
Maciej Szymkiewicz created SPARK-14739:
--
Summary: Vectors.parse doesn't handle dense vectors of size 0 and
sparse vectros with no indices
Key: SPARK-14739
URL: https://issues.apache.org/jira/browse/SPARK-1473
[
https://issues.apache.org/jira/browse/SPARK-14202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-14202:
---
Affects Version/s: (was: 1.3.0)
> python_full_outer_join should use generator exp
Maciej Szymkiewicz created SPARK-14202:
--
Summary: python_full_outer_join should use generator expression
instead of list comp
Key: SPARK-14202
URL: https://issues.apache.org/jira/browse/SPARK-14202
[
https://issues.apache.org/jira/browse/SPARK-12916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15206030#comment-15206030
]
Maciej Szymkiewicz commented on SPARK-12916:
Since PySpark `Row` is just a su
Maciej Szymkiewicz created SPARK-14058:
--
Summary: Incorrect docstring in Window.orderBy
Key: SPARK-14058
URL: https://issues.apache.org/jira/browse/SPARK-14058
Project: Spark
Issue Type
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:56 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:55 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15098738#comment-15098738
]
Maciej Szymkiewicz edited comment on SPARK-12824 at 1/14/16 7:51 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-12824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15098738#comment-15098738
]
Maciej Szymkiewicz commented on SPARK-12824:
??It seems that all the keys in
[
https://issues.apache.org/jira/browse/SPARK-7683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-7683:
--
Comment: was deleted
(was: [~srowen] Do you have any example how it could break existing
[
https://issues.apache.org/jira/browse/SPARK-7683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15076650#comment-15076650
]
Maciej Szymkiewicz commented on SPARK-7683:
---
[~srowen] Do you have any example h
Maciej Szymkiewicz created SPARK-12595:
--
Summary: fold should pass arguments to op in the correct order
Key: SPARK-12595
URL: https://issues.apache.org/jira/browse/SPARK-12595
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15074342#comment-15074342
]
Maciej Szymkiewicz commented on SPARK-6459:
---
Thanks for clarification.
> Warn w
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15074322#comment-15074322
]
Maciej Szymkiewicz commented on SPARK-6459:
---
I've been trying to reproduce the p
[
https://issues.apache.org/jira/browse/SPARK-6459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15074268#comment-15074268
]
Maciej Szymkiewicz commented on SPARK-6459:
---
[~marmbrus] Isn't this warning obso
[
https://issues.apache.org/jira/browse/SPARK-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15029170#comment-15029170
]
Maciej Szymkiewicz commented on SPARK-9137:
---
[~josephkb] Could you take a look a
[
https://issues.apache.org/jira/browse/SPARK-12006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-12006:
---
Description:
Steps to reproduce :
{code}
from pyspark.mllib.clustering import Gaussi
Maciej Szymkiewicz created SPARK-12006:
--
Summary: GaussianMixture.train crashes if an itnital model is not
None
Key: SPARK-12006
URL: https://issues.apache.org/jira/browse/SPARK-12006
Project: Sp
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15007038#comment-15007038
]
Maciej Szymkiewicz commented on SPARK-11281:
[~shivaram] I've tested both cur
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-11281:
---
Comment: was deleted
(was: [~sunrui], [~shivaram] I don't think it is resolved by [SP
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15006970#comment-15006970
]
Maciej Szymkiewicz commented on SPARK-11281:
[~sunrui], [~shivaram] I don't t
[
https://issues.apache.org/jira/browse/SPARK-11281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15006960#comment-15006960
]
Maciej Szymkiewicz commented on SPARK-11281:
[~shivaram] No, there isn't. I r
[
https://issues.apache.org/jira/browse/SPARK-11086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15006292#comment-15006292
]
Maciej Szymkiewicz commented on SPARK-11086:
[~shivaram] Does it resolve [SPA
[
https://issues.apache.org/jira/browse/SPARK-11530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14995874#comment-14995874
]
Maciej Szymkiewicz commented on SPARK-11530:
It should actually target MLlib,
[
https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14995580#comment-14995580
]
Maciej Szymkiewicz commented on SPARK-11569:
It looks this problem affects Sc
Maciej Szymkiewicz created SPARK-11569:
--
Summary: StringIndexer transform fails when column contains nulls
Key: SPARK-11569
URL: https://issues.apache.org/jira/browse/SPARK-11569
Project: Spark
Maciej Szymkiewicz created SPARK-11283:
--
Summary: List column gets additional level of nesting when
converted to Spark DataFrame
Key: SPARK-11283
URL: https://issues.apache.org/jira/browse/SPARK-11283
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-11167:
---
Comment: was deleted
(was: Related problem: https://issues.apache.org/jira/browse/SPA
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14970922#comment-14970922
]
Maciej Szymkiewicz commented on SPARK-11167:
Related problem: https://issues.
Maciej Szymkiewicz created SPARK-11281:
--
Summary: Issue with creating and collecting DataFrame using
environments
Key: SPARK-11281
URL: https://issues.apache.org/jira/browse/SPARK-11281
Project:
[
https://issues.apache.org/jira/browse/SPARK-11167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14970839#comment-14970839
]
Maciej Szymkiewicz commented on SPARK-11167:
spark-csv has a much simpler job
Maciej Szymkiewicz created SPARK-11167:
--
Summary: Incorrect type resolution on heterogeneous data structures
Key: SPARK-11167
URL: https://issues.apache.org/jira/browse/SPARK-11167
Project: Spark
Maciej Szymkiewicz created SPARK-11086:
--
Summary: createDataFrame should dropFactor column-wise not
cell-wise
Key: SPARK-11086
URL: https://issues.apache.org/jira/browse/SPARK-11086
Project: Spa
Maciej Szymkiewicz created SPARK-11084:
--
Summary: SparseVector.__getitem__ should check if value can be
non-zero before executing searchsorted
Key: SPARK-11084
URL: https://issues.apache.org/jira/browse/SPARK
[
https://issues.apache.org/jira/browse/SPARK-10973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10973:
---
External issue URL: https://github.com/apache/spark/pull/9009
> __gettitem__ method t
[
https://issues.apache.org/jira/browse/SPARK-10973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10973:
---
External issue URL: (was: https://github.com/apache/spark/pull/9009)
> __gettitem__
Maciej Szymkiewicz created SPARK-10973:
--
Summary: __gettitem__ method throws IndexError exception when we
try to access index after the last non-zero entry.
Key: SPARK-10973
URL: https://issues.apache.org/jir
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
If we take a row from a data frame and try to extract vector element by
[
https://issues.apache.org/jira/browse/SPARK-10467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-10467:
---
Description:
{code}
from pyspark.ml.feature import HashingTF
df = sqlContext.createD
Maciej Szymkiewicz created SPARK-10467:
--
Summary: Vector is converted to tuple when extracted from Row
using __getitem__
Key: SPARK-10467
URL: https://issues.apache.org/jira/browse/SPARK-10467
Pr
[
https://issues.apache.org/jira/browse/SPARK-9978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maciej Szymkiewicz updated SPARK-9978:
--
Description:
I am trying to reproduce following SQL query:
{code}
df.registerTempTable(
Maciej Szymkiewicz created SPARK-9978:
-
Summary: Window functions require partitionBy to work as expected
Key: SPARK-9978
URL: https://issues.apache.org/jira/browse/SPARK-9978
Project: Spark
Maciej Szymkiewicz created SPARK-9098:
-
Summary: Inconsistent Dense Vectors hashing between PySpark and
Scala
Key: SPARK-9098
URL: https://issues.apache.org/jira/browse/SPARK-9098
Project: Spark
601 - 669 of 669 matches
Mail list logo