[jira] [Commented] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them

2018-06-06 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16503835#comment-16503835 ] Joel Croteau commented on SPARK-24357: -- [~viirya], yes that's what I said. What I am saying is that

[jira] [Commented] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type

2018-05-24 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16489507#comment-16489507 ] Joel Croteau commented on SPARK-24358: -- This does mean that the current implementation has some

[jira] [Commented] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type

2018-05-23 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16488090#comment-16488090 ] Joel Croteau commented on SPARK-24358: -- This may be trickier than I first thought. In Python 2,

[jira] [Commented] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type

2018-05-22 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486594#comment-16486594 ] Joel Croteau commented on SPARK-24358: -- Done. > createDataFrame in Python 3 should be able to infer

[jira] [Updated] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type

2018-05-22 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Croteau updated SPARK-24358: - Labels: Python3 (was: ) Description: createDataFrame can infer Python 3's bytearray

[jira] [Comment Edited] (SPARK-24358) createDataFrame in Python should be able to infer bytes type as Binary type

2018-05-22 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486581#comment-16486581 ] Joel Croteau edited comment on SPARK-24358 at 5/23/18 1:47 AM: --- No, I mean

[jira] [Commented] (SPARK-24358) createDataFrame in Python should be able to infer bytes type as Binary type

2018-05-22 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486581#comment-16486581 ] Joel Croteau commented on SPARK-24358: -- No, I mean the bytes type in Python 3. This code:

[jira] [Commented] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them

2018-05-22 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24357?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16486571#comment-16486571 ] Joel Croteau commented on SPARK-24357: -- Fair enough, here is some code to reproduce it:

[jira] [Created] (SPARK-24358) createDataFrame in Python should be able to infer bytes type as Binary type

2018-05-22 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-24358: Summary: createDataFrame in Python should be able to infer bytes type as Binary type Key: SPARK-24358 URL: https://issues.apache.org/jira/browse/SPARK-24358 Project:

[jira] [Created] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them

2018-05-22 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-24357: Summary: createDataFrame in Python infers large integers as long type and then fails silently when converting them Key: SPARK-24357 URL:

[jira] [Created] (SPARK-22430) Unknown tag warnings when building R docs with Roxygen 6.0.1

2017-11-02 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22430: Summary: Unknown tag warnings when building R docs with Roxygen 6.0.1 Key: SPARK-22430 URL: https://issues.apache.org/jira/browse/SPARK-22430 Project: Spark

[jira] [Created] (SPARK-22415) lint-r fails if lint-r.R installs any new packages

2017-11-01 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22415: Summary: lint-r fails if lint-r.R installs any new packages Key: SPARK-22415 URL: https://issues.apache.org/jira/browse/SPARK-22415 Project: Spark Issue

[jira] [Created] (SPARK-22376) run-tests.py fails at exec-sbt if run with Python 3

2017-10-27 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22376: Summary: run-tests.py fails at exec-sbt if run with Python 3 Key: SPARK-22376 URL: https://issues.apache.org/jira/browse/SPARK-22376 Project: Spark Issue

[jira] [Created] (SPARK-22375) Test script can fail if eggs are installed by setup.py during test process

2017-10-27 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22375: Summary: Test script can fail if eggs are installed by setup.py during test process Key: SPARK-22375 URL: https://issues.apache.org/jira/browse/SPARK-22375 Project:

[jira] [Created] (SPARK-22353) ResultIterable should be indexable

2017-10-25 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22353: Summary: ResultIterable should be indexable Key: SPARK-22353 URL: https://issues.apache.org/jira/browse/SPARK-22353 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-22338) namedtuple serialization is inefficient

2017-10-23 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joel Croteau updated SPARK-22338: - Description: I greatly appreciate the level of hack that PySpark contains in order to make

[jira] [Created] (SPARK-22338) namedtuple serialization is inefficient

2017-10-23 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22338: Summary: namedtuple serialization is inefficient Key: SPARK-22338 URL: https://issues.apache.org/jira/browse/SPARK-22338 Project: Spark Issue Type:

[jira] [Commented] (SPARK-22209) PySpark does not recognize imports from submodules

2017-10-23 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16215922#comment-16215922 ] Joel Croteau commented on SPARK-22209: -- I don't really have time to look for the exact cause, so if

[jira] [Commented] (SPARK-22209) PySpark does not recognize imports from submodules

2017-10-19 Thread Joel Croteau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212027#comment-16212027 ] Joel Croteau commented on SPARK-22209: -- Yes [~bryanc], I know I can do that. It's just annoying that

[jira] [Created] (SPARK-22209) PySpark does not recognize imports from submodules

2017-10-05 Thread Joel Croteau (JIRA)
Joel Croteau created SPARK-22209: Summary: PySpark does not recognize imports from submodules Key: SPARK-22209 URL: https://issues.apache.org/jira/browse/SPARK-22209 Project: Spark Issue