[jira] [Comment Edited] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284362#comment-17284362
 ] 

Jungtaek Lim edited comment on SPARK-34427 at 2/14/21, 7:28 AM:


I'd say SPARK-10816 has no active progress so far because of lack of interest. 
Once you found the necessity then SPARK-10816 can be unblocked instead of 
spending another non-trivial time to reinvent the wheel, no?

See SPARK-10816, there're SPIP docs, discussions, ideas around them, even perf 
tests on WIP PRs. These efforts were actually more than a month. What this 
issue provides? It's just a simple link Flink provides. No design doc, no 
implementation, no test. This JIRA issue basically does nothing yet. That said, 
it's far behind than the existing effort. You can't simply say existing PR is 
complicated than you have in mind unless you can prove it via similar sort of 
effort, SPIP doc with design doc.

I pointed out JIRA issue assignee issue because you're trying to take up 
multiple major efforts where there're folks in community want to take anything 
up. You're also assigning yourself on SPARK-34198 which isn't even having a PR 
up for reviewing, right? Don't try to occupy major efforts in parallel and 
block others to even try out.


was (Author: kabhwan):
I'd say SPARK-10816 has no active progress so far because of lack of interest. 
Once you found the necessity then SPARK-10816 can be unblocked instead of 
spending another non-trivial time to reinvent the wheel, no?

See SPARK-10816, there're SPIP docs, discussions, ideas around them, even perf 
tests on WIP PRs. These efforts were actually more than a month. What this 
issue provides? It's just a simple link Flink provides. No design doc, no 
implementation, no test. This JIRA issue basically does nothing yet. That said, 
it's far behind than the existing effort. You can't simply say existing PR is 
complicated than you have in mind unless you can prove it via similar sort of 
effort, SPIP doc with design doc.

I pointed out JIRA issue assignee issue because you're trying to take up 
multiple major efforts where there're folks in community want to take anything 
up. You're also assigning yourself on SPARK-34198 which isn't even having a PR 
up for reviewing, right? Don't try to occupy major efforts in parallel.

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284363#comment-17284363
 ] 

Jungtaek Lim commented on SPARK-34427:
--

If you'd like to say assignee is to avoid stepping on others toes, what if I 
assign myself in SPARK-10816 and claim I own the issue and I have more progress 
then this? Once I claim then my claim is quite true, right?

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284362#comment-17284362
 ] 

Jungtaek Lim commented on SPARK-34427:
--

I'd say SPARK-10816 has no active progress so far because of lack of interest. 
Once you found the necessity then SPARK-10816 can be unblocked instead of 
spending another non-trivial time to reinvent the wheel, no?

See SPARK-10816, there're SPIP docs, discussions, ideas around them, even perf 
tests on WIP PRs. These efforts were actually more than a month. What this 
issue provides? It's just a simple link Flink provides. No design doc, no 
implementation, no test. This JIRA issue basically does nothing yet. That said, 
it's far behind than the existing effort. You can't simply say existing PR is 
complicated than you have in mind unless you can prove it via similar sort of 
effort, SPIP doc with design doc.

I pointed out JIRA issue assignee issue because you're trying to take up 
multiple major efforts where there're folks in community want to take anything 
up. You're also assigning yourself on SPARK-34198 which isn't even having a PR 
up for reviewing, right? Don't try to occupy major efforts in parallel.

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34427) Session window support in SS

2021-02-13 Thread L. C. Hsieh (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284348#comment-17284348
 ] 

L. C. Hsieh commented on SPARK-34427:
-

If I don't miss anything, SPARK-10816 has no active progress in more than two 
years. I don't know about your intention here, but I don't really see that it 
was actively pushed during such long time. Seems to me the effort was abandoned 
there and it looks totally okay to me that others can work on it, isn't?

About assigning JIRA issue, I'm not sure if you really do not know but 
basically I remember the committers can assign JIRA issues to themselves if 
they are working on it. We don't assign the JIRA issues created by contributors 
to ourselves, because it is really unfair. For JIRA issues created for our 
ongoing work, this is like a convention by other committers too. It is for not 
step on others toes.

If you are really against it in general, maybe you can raise a discussion to 
formally disallow it in Spark community. I'm happy to follow it if we finally 
have a consensus about it. Thanks.

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34426) Add driver and executors POD logs to integration tests log when the test fails

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284346#comment-17284346
 ] 

Apache Spark commented on SPARK-34426:
--

User 'attilapiros' has created a pull request for this issue:
https://github.com/apache/spark/pull/31561

> Add driver and executors POD logs to integration tests log when the test fails
> --
>
> Key: SPARK-34426
> URL: https://issues.apache.org/jira/browse/SPARK-34426
> Project: Spark
>  Issue Type: Improvement
>  Components: Kubernetes, Tests
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> Now both the driver and executors logs are lost.
> In https://spark.apache.org/developer-tools.html there is a hint:
> "Getting logs from the pods and containers directly is an exercise left to 
> the reader."
> But when the test is executed by Jenkins and there is failure we really need 
> the POD logs to analyze problem.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34426) Add driver and executors POD logs to integration tests log when the test fails

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34426:


Assignee: (was: Apache Spark)

> Add driver and executors POD logs to integration tests log when the test fails
> --
>
> Key: SPARK-34426
> URL: https://issues.apache.org/jira/browse/SPARK-34426
> Project: Spark
>  Issue Type: Improvement
>  Components: Kubernetes, Tests
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> Now both the driver and executors logs are lost.
> In https://spark.apache.org/developer-tools.html there is a hint:
> "Getting logs from the pods and containers directly is an exercise left to 
> the reader."
> But when the test is executed by Jenkins and there is failure we really need 
> the POD logs to analyze problem.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34426) Add driver and executors POD logs to integration tests log when the test fails

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284345#comment-17284345
 ] 

Apache Spark commented on SPARK-34426:
--

User 'attilapiros' has created a pull request for this issue:
https://github.com/apache/spark/pull/31561

> Add driver and executors POD logs to integration tests log when the test fails
> --
>
> Key: SPARK-34426
> URL: https://issues.apache.org/jira/browse/SPARK-34426
> Project: Spark
>  Issue Type: Improvement
>  Components: Kubernetes, Tests
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> Now both the driver and executors logs are lost.
> In https://spark.apache.org/developer-tools.html there is a hint:
> "Getting logs from the pods and containers directly is an exercise left to 
> the reader."
> But when the test is executed by Jenkins and there is failure we really need 
> the POD logs to analyze problem.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34426) Add driver and executors POD logs to integration tests log when the test fails

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34426:


Assignee: Apache Spark

> Add driver and executors POD logs to integration tests log when the test fails
> --
>
> Key: SPARK-34426
> URL: https://issues.apache.org/jira/browse/SPARK-34426
> Project: Spark
>  Issue Type: Improvement
>  Components: Kubernetes, Tests
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Assignee: Apache Spark
>Priority: Major
>
> Now both the driver and executors logs are lost.
> In https://spark.apache.org/developer-tools.html there is a hint:
> "Getting logs from the pods and containers directly is an exercise left to 
> the reader."
> But when the test is executed by Jenkins and there is failure we really need 
> the POD logs to analyze problem.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-25075) Build and test Spark against Scala 2.13

2021-02-13 Thread Dongjoon Hyun (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284343#comment-17284343
 ] 

Dongjoon Hyun commented on SPARK-25075:
---

Thank you for sharing the progress, [~tisue]. We will try Scala 2.13.5 
definitely.

> Build and test Spark against Scala 2.13
> ---
>
> Key: SPARK-25075
> URL: https://issues.apache.org/jira/browse/SPARK-25075
> Project: Spark
>  Issue Type: Umbrella
>  Components: Build, MLlib, Project Infra, Spark Core, SQL
>Affects Versions: 3.0.0
>Reporter: Guillaume Massé
>Priority: Major
>
> This umbrella JIRA tracks the requirements for building and testing Spark 
> against the current Scala 2.13 milestone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284342#comment-17284342
 ] 

Jungtaek Lim commented on SPARK-34427:
--

Just to make clear, SPARK-10816 has SPIP docs from two different groups and 
details/comparison docs as well.

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34198) Add RocksDB StateStore as external module

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284336#comment-17284336
 ] 

Jungtaek Lim commented on SPARK-34198:
--

Now we have consensus on having this, which is great. Thanks for raising the 
discussion and gathering consensus.

Which implementation we should take as baseline remains the question. Two 
implementations what I'm aware of are:

1. https://github.com/chermenin/spark-states
2. https://github.com/apache/spark/pull/24922

Ideally it'd be nice to take the first one as it has been known to be the way 
to use RocksDB state store with SS for years. Probably we need to pursue the 
maintainer of the project, but in most cases they tend to be happy with 
donating the code (with simple PR, as it doesn't seem to be tied with specific 
employer) so worth trying out.

Second case is probably easier to continue, as we can leverage existing PR 
without explicit approval from original author. What we should keep in mind is 
"retaining" main authorship, otherwise it wouldn't be a problem to take it over.

Another crazy idea (but probably most stable among all implementations) is 
asking Databricks to donate the commercial implementation of RocksDB state 
store. I guess it should have been used by their customers for years, so proven 
to be stable relatively.

[~rxin] Given you'd also give +1 on adding RocksDB state store to Spark 
codebase, any chance Databricks donate the existing implementation to Spark? Is 
it just me and would it be crazy idea asking this?

> Add RocksDB StateStore as external module
> -
>
> Key: SPARK-34198
> URL: https://issues.apache.org/jira/browse/SPARK-34198
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Assignee: L. C. Hsieh
>Priority: Major
>
> Currently Spark SS only has one built-in StateStore implementation 
> HDFSBackedStateStore. Actually it uses in-memory map to store state rows. As 
> there are more and more streaming applications, some of them requires to use 
> large state in stateful operations such as streaming aggregation and join.
> Several other major streaming frameworks already use RocksDB for state 
> management. So it is proven to be good choice for large state usage. But 
> Spark SS still lacks of a built-in state store for the requirement.
> We would like to explore the possibility to add RocksDB-based StateStore into 
> Spark SS. For the concern about adding RocksDB as a direct dependency, our 
> plan is to add this StateStore as an external module first.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jungtaek Lim reassigned SPARK-34427:


Assignee: (was: L. C. Hsieh)

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Resolved] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jungtaek Lim resolved SPARK-34427.
--
Resolution: Duplicate

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Assignee: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34427) Session window support in SS

2021-02-13 Thread Jungtaek Lim (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284334#comment-17284334
 ] 

Jungtaek Lim commented on SPARK-34427:
--

We can close this one and continue from SPARK-10816. This JIRA issue loses all 
inputs and efforts in SPARK-10816 which were worth a month.

For complexity, I can simply push back linked-list version and it'll reduce 
1000+ lines. That was to address one of new requirements in SPARK-10816 and I 
don't think it should be addressed. I'm also OK to revisit [~XuanYuan] and 
decide to take one of twos. Both I and [~XuanYuan] are active in the community, 
so any minor issues could be handled without taking over or new implementation.

One thing I would like to say is, let's not assign the JIRA issue - that is 
against what we do with most JIRA issues, and simply "unfair" to contributors. 
I'd like to see major efforts be well distributed across community.

> Session window support in SS
> 
>
> Key: SPARK-34427
> URL: https://issues.apache.org/jira/browse/SPARK-34427
> Project: Spark
>  Issue Type: New Feature
>  Components: Structured Streaming
>Affects Versions: 3.2.0
>Reporter: L. C. Hsieh
>Assignee: L. C. Hsieh
>Priority: Major
>
> Currently structured streaming supports two kinds of windows: tumbling window 
> and sliding window. Another useful window function is session window. Which 
> is not supported by SS. We have user requirement to use session window. We'd 
> like to have this support in the upstream.
> About session window, there is some info: 
> https://ci.apache.org/projects/flink/flink-docs-stable/dev/stream/operators/windows.html#session-windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34432) add a java implementation for the simple writable data source

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284315#comment-17284315
 ] 

Apache Spark commented on SPARK-34432:
--

User 'kevincmchen' has created a pull request for this issue:
https://github.com/apache/spark/pull/31560

> add a java implementation for the simple writable data source
> -
>
> Key: SPARK-34432
> URL: https://issues.apache.org/jira/browse/SPARK-34432
> Project: Spark
>  Issue Type: Test
>  Components: SQL, Tests
>Affects Versions: 3.1.1
>Reporter: Kevin Pis
>Priority: Minor
>
> This is a followup of https://github.com/apache/spark/pull/19269
> In #19269 , there is only a scala implementation of simple writable data 
> source in `DataSourceV2Suite`.
> This PR adds a java implementation of it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34433) Lock jekyll version by Gemfile and Bundler

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34433:


Assignee: (was: Apache Spark)

> Lock jekyll version by Gemfile and Bundler
> --
>
> Key: SPARK-34433
> URL: https://issues.apache.org/jira/browse/SPARK-34433
> Project: Spark
>  Issue Type: Improvement
>  Components: Build, Deploy, Documentation
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> The Jekyll version can be pinned to specific 4.2.0 with Gemfile.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34433) Lock jekyll version by Gemfile and Bundler

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284299#comment-17284299
 ] 

Apache Spark commented on SPARK-34433:
--

User 'attilapiros' has created a pull request for this issue:
https://github.com/apache/spark/pull/31559

> Lock jekyll version by Gemfile and Bundler
> --
>
> Key: SPARK-34433
> URL: https://issues.apache.org/jira/browse/SPARK-34433
> Project: Spark
>  Issue Type: Improvement
>  Components: Build, Deploy, Documentation
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> The Jekyll version can be pinned to specific 4.2.0 with Gemfile.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34433) Lock jekyll version by Gemfile and Bundler

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34433:


Assignee: Apache Spark

> Lock jekyll version by Gemfile and Bundler
> --
>
> Key: SPARK-34433
> URL: https://issues.apache.org/jira/browse/SPARK-34433
> Project: Spark
>  Issue Type: Improvement
>  Components: Build, Deploy, Documentation
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Assignee: Apache Spark
>Priority: Major
>
> The Jekyll version can be pinned to specific 4.2.0 with Gemfile.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-25075) Build and test Spark against Scala 2.13

2021-02-13 Thread Seth Tisue (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284288#comment-17284288
 ] 

Seth Tisue edited comment on SPARK-25075 at 2/13/21, 8:56 PM:
--

We at Lightbend will be releasing Scala 2.13.5 soon (thread 
[https://contributors.scala-lang.org/t/scala-2-13-5-planning/4825/10),] 
hopefully within a week or two, with a fix for 
[https://github.com/scala/bug/issues/12038] 


was (Author: tisue):
We at Lightbend will be releasing Scala 2.13.5 soon (thread 
[https://contributors.scala-lang.org/t/scala-2-13-5-planning/4825/10),] 
hopefully within a week or two, with a fix 
[https://github.com/scala/bug/issues/12038] 

> Build and test Spark against Scala 2.13
> ---
>
> Key: SPARK-25075
> URL: https://issues.apache.org/jira/browse/SPARK-25075
> Project: Spark
>  Issue Type: Umbrella
>  Components: Build, MLlib, Project Infra, Spark Core, SQL
>Affects Versions: 3.0.0
>Reporter: Guillaume Massé
>Priority: Major
>
> This umbrella JIRA tracks the requirements for building and testing Spark 
> against the current Scala 2.13 milestone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-25075) Build and test Spark against Scala 2.13

2021-02-13 Thread Seth Tisue (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284288#comment-17284288
 ] 

Seth Tisue edited comment on SPARK-25075 at 2/13/21, 8:54 PM:
--

We at Lightbend will be releasing Scala 2.13.5 soon (thread 
[https://contributors.scala-lang.org/t/scala-2-13-5-planning/4825/10),] 
hopefully within a week or two, with a fix 
[https://github.com/scala/bug/issues/12038] 


was (Author: tisue):
We'll be releasing Scala 2.13.5 soon (thread 
[https://contributors.scala-lang.org/t/scala-2-13-5-planning/4825/10),] 
hopefully within a week or two, with a fix 
[https://github.com/scala/bug/issues/12038] 

> Build and test Spark against Scala 2.13
> ---
>
> Key: SPARK-25075
> URL: https://issues.apache.org/jira/browse/SPARK-25075
> Project: Spark
>  Issue Type: Umbrella
>  Components: Build, MLlib, Project Infra, Spark Core, SQL
>Affects Versions: 3.0.0
>Reporter: Guillaume Massé
>Priority: Major
>
> This umbrella JIRA tracks the requirements for building and testing Spark 
> against the current Scala 2.13 milestone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-25075) Build and test Spark against Scala 2.13

2021-02-13 Thread Seth Tisue (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-25075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284288#comment-17284288
 ] 

Seth Tisue commented on SPARK-25075:


We'll be releasing Scala 2.13.5 soon (thread 
[https://contributors.scala-lang.org/t/scala-2-13-5-planning/4825/10),] 
hopefully within a week or two, with a fix 
[https://github.com/scala/bug/issues/12038] 

> Build and test Spark against Scala 2.13
> ---
>
> Key: SPARK-25075
> URL: https://issues.apache.org/jira/browse/SPARK-25075
> Project: Spark
>  Issue Type: Umbrella
>  Components: Build, MLlib, Project Infra, Spark Core, SQL
>Affects Versions: 3.0.0
>Reporter: Guillaume Massé
>Priority: Major
>
> This umbrella JIRA tracks the requirements for building and testing Spark 
> against the current Scala 2.13 milestone.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-34433) Lock jekyll version by Gemfile and Bundler

2021-02-13 Thread Attila Zsolt Piros (Jira)
Attila Zsolt Piros created SPARK-34433:
--

 Summary: Lock jekyll version by Gemfile and Bundler
 Key: SPARK-34433
 URL: https://issues.apache.org/jira/browse/SPARK-34433
 Project: Spark
  Issue Type: Improvement
  Components: Build, Deploy, Documentation
Affects Versions: 3.2.0
Reporter: Attila Zsolt Piros


The Jekyll version can be pinned to specific 4.2.0 with Gemfile.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34433) Lock jekyll version by Gemfile and Bundler

2021-02-13 Thread Attila Zsolt Piros (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284287#comment-17284287
 ] 

Attila Zsolt Piros commented on SPARK-34433:


I am working on this.

> Lock jekyll version by Gemfile and Bundler
> --
>
> Key: SPARK-34433
> URL: https://issues.apache.org/jira/browse/SPARK-34433
> Project: Spark
>  Issue Type: Improvement
>  Components: Build, Deploy, Documentation
>Affects Versions: 3.2.0
>Reporter: Attila Zsolt Piros
>Priority: Major
>
> The Jekyll version can be pinned to specific 4.2.0 with Gemfile.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34432) add a java implementation for the simple writable data source

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34432:


Assignee: Apache Spark

> add a java implementation for the simple writable data source
> -
>
> Key: SPARK-34432
> URL: https://issues.apache.org/jira/browse/SPARK-34432
> Project: Spark
>  Issue Type: Test
>  Components: SQL, Tests
>Affects Versions: 3.1.1
>Reporter: Kevin Pis
>Assignee: Apache Spark
>Priority: Minor
>
> This is a followup of https://github.com/apache/spark/pull/19269
> In #19269 , there is only a scala implementation of simple writable data 
> source in `DataSourceV2Suite`.
> This PR adds a java implementation of it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34432) add a java implementation for the simple writable data source

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284264#comment-17284264
 ] 

Apache Spark commented on SPARK-34432:
--

User 'kevincmchen' has created a pull request for this issue:
https://github.com/apache/spark/pull/31558

> add a java implementation for the simple writable data source
> -
>
> Key: SPARK-34432
> URL: https://issues.apache.org/jira/browse/SPARK-34432
> Project: Spark
>  Issue Type: Test
>  Components: SQL, Tests
>Affects Versions: 3.1.1
>Reporter: Kevin Pis
>Priority: Minor
>
> This is a followup of https://github.com/apache/spark/pull/19269
> In #19269 , there is only a scala implementation of simple writable data 
> source in `DataSourceV2Suite`.
> This PR adds a java implementation of it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34432) add a java implementation for the simple writable data source

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34432:


Assignee: (was: Apache Spark)

> add a java implementation for the simple writable data source
> -
>
> Key: SPARK-34432
> URL: https://issues.apache.org/jira/browse/SPARK-34432
> Project: Spark
>  Issue Type: Test
>  Components: SQL, Tests
>Affects Versions: 3.1.1
>Reporter: Kevin Pis
>Priority: Minor
>
> This is a followup of https://github.com/apache/spark/pull/19269
> In #19269 , there is only a scala implementation of simple writable data 
> source in `DataSourceV2Suite`.
> This PR adds a java implementation of it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-33210) Set the rebasing mode for parquet INT96 type to `EXCEPTION` by default

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-33210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284245#comment-17284245
 ] 

Apache Spark commented on SPARK-33210:
--

User 'MaxGekk' has created a pull request for this issue:
https://github.com/apache/spark/pull/31557

> Set the rebasing mode for parquet INT96 type to `EXCEPTION` by default
> --
>
> Key: SPARK-33210
> URL: https://issues.apache.org/jira/browse/SPARK-33210
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.1.0
>Reporter: Maxim Gekk
>Assignee: Maxim Gekk
>Priority: Major
> Fix For: 3.1.0
>
>
> The ticket aims to set the following SQL configs:
> - spark.sql.legacy.parquet.int96RebaseModeInWrite
> - spark.sql.legacy.parquet.int96RebaseModeInRead
> to EXCEPTION by default.
> The reason is let users to decide should Spark modify loaded/saved timestamps 
> instead of silently shifting timestamps while rebasing.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-34432) add a java implementation for the simple writable data source

2021-02-13 Thread Kevin Pis (Jira)
Kevin Pis created SPARK-34432:
-

 Summary: add a java implementation for the simple writable data 
source
 Key: SPARK-34432
 URL: https://issues.apache.org/jira/browse/SPARK-34432
 Project: Spark
  Issue Type: Test
  Components: SQL, Tests
Affects Versions: 3.1.1
Reporter: Kevin Pis


This is a followup of https://github.com/apache/spark/pull/19269

In #19269 , there is only a scala implementation of simple writable data source 
in `DataSourceV2Suite`.

This PR adds a java implementation of it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-33210) Set the rebasing mode for parquet INT96 type to `EXCEPTION` by default

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-33210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284244#comment-17284244
 ] 

Apache Spark commented on SPARK-33210:
--

User 'MaxGekk' has created a pull request for this issue:
https://github.com/apache/spark/pull/31557

> Set the rebasing mode for parquet INT96 type to `EXCEPTION` by default
> --
>
> Key: SPARK-33210
> URL: https://issues.apache.org/jira/browse/SPARK-33210
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.1.0
>Reporter: Maxim Gekk
>Assignee: Maxim Gekk
>Priority: Major
> Fix For: 3.1.0
>
>
> The ticket aims to set the following SQL configs:
> - spark.sql.legacy.parquet.int96RebaseModeInWrite
> - spark.sql.legacy.parquet.int96RebaseModeInRead
> to EXCEPTION by default.
> The reason is let users to decide should Spark modify loaded/saved timestamps 
> instead of silently shifting timestamps while rebasing.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Resolved] (SPARK-33434) Document spark.conf.isModifiable()

2021-02-13 Thread Sean R. Owen (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-33434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean R. Owen resolved SPARK-33434.
--
Fix Version/s: 3.2.0
   Resolution: Fixed

Issue resolved by pull request 31483
[https://github.com/apache/spark/pull/31483]

> Document spark.conf.isModifiable()
> --
>
> Key: SPARK-33434
> URL: https://issues.apache.org/jira/browse/SPARK-33434
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation, PySpark
>Affects Versions: 3.1.0
>Reporter: Nicholas Chammas
>Priority: Minor
> Fix For: 3.2.0
>
>
> PySpark's docs make no mention of {{conf.isModifiable()}}, though it appears 
> to be a public method introduced in SPARK-24761.
> http://spark.apache.org/docs/3.0.1/api/python/pyspark.sql.html#pyspark.sql.SparkSession.conf



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-33434) Document spark.conf.isModifiable()

2021-02-13 Thread Sean R. Owen (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-33434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean R. Owen reassigned SPARK-33434:


Assignee: Eric Lemmon

> Document spark.conf.isModifiable()
> --
>
> Key: SPARK-33434
> URL: https://issues.apache.org/jira/browse/SPARK-33434
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation, PySpark
>Affects Versions: 3.1.0
>Reporter: Nicholas Chammas
>Assignee: Eric Lemmon
>Priority: Minor
> Fix For: 3.2.0
>
>
> PySpark's docs make no mention of {{conf.isModifiable()}}, though it appears 
> to be a public method introduced in SPARK-24761.
> http://spark.apache.org/docs/3.0.1/api/python/pyspark.sql.html#pyspark.sql.SparkSession.conf



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-33434) Document spark.conf.isModifiable()

2021-02-13 Thread Sean R. Owen (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-33434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean R. Owen updated SPARK-33434:
-
Priority: Trivial  (was: Minor)

> Document spark.conf.isModifiable()
> --
>
> Key: SPARK-33434
> URL: https://issues.apache.org/jira/browse/SPARK-33434
> Project: Spark
>  Issue Type: Improvement
>  Components: Documentation, PySpark
>Affects Versions: 3.1.0
>Reporter: Nicholas Chammas
>Assignee: Eric Lemmon
>Priority: Trivial
> Fix For: 3.2.0
>
>
> PySpark's docs make no mention of {{conf.isModifiable()}}, though it appears 
> to be a public method introduced in SPARK-24761.
> http://spark.apache.org/docs/3.0.1/api/python/pyspark.sql.html#pyspark.sql.SparkSession.conf



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34431) Only load hive-site.xml once

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284183#comment-17284183
 ] 

Apache Spark commented on SPARK-34431:
--

User 'MaxGekk' has created a pull request for this issue:
https://github.com/apache/spark/pull/31556

> Only load hive-site.xml once
> 
>
> Key: SPARK-34431
> URL: https://issues.apache.org/jira/browse/SPARK-34431
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.2.0
>Reporter: Maxim Gekk
>Priority: Major
>
> Hive configs from hive-site.xml are parsed over and over again. We can 
> optimize this, and parse it only once.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34431) Only load hive-site.xml once

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34431:


Assignee: (was: Apache Spark)

> Only load hive-site.xml once
> 
>
> Key: SPARK-34431
> URL: https://issues.apache.org/jira/browse/SPARK-34431
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.2.0
>Reporter: Maxim Gekk
>Priority: Major
>
> Hive configs from hive-site.xml are parsed over and over again. We can 
> optimize this, and parse it only once.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34431) Only load hive-site.xml once

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34431:


Assignee: Apache Spark

> Only load hive-site.xml once
> 
>
> Key: SPARK-34431
> URL: https://issues.apache.org/jira/browse/SPARK-34431
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 3.2.0
>Reporter: Maxim Gekk
>Assignee: Apache Spark
>Priority: Major
>
> Hive configs from hive-site.xml are parsed over and over again. We can 
> optimize this, and parse it only once.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-34431) Only load hive-site.xml once

2021-02-13 Thread Maxim Gekk (Jira)
Maxim Gekk created SPARK-34431:
--

 Summary: Only load hive-site.xml once
 Key: SPARK-34431
 URL: https://issues.apache.org/jira/browse/SPARK-34431
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Affects Versions: 3.2.0
Reporter: Maxim Gekk


Hive configs from hive-site.xml are parsed over and over again. We can optimize 
this, and parse it only once.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34430) Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, int) not available

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34430:


Assignee: Apache Spark

> Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, 
> int) not available
> 
>
> Key: SPARK-34430
> URL: https://issues.apache.org/jira/browse/SPARK-34430
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 3.0.0
>Reporter: Marco van der Linden
>Assignee: Apache Spark
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 3.0.0
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Took us a while to figure out how to fix this with pyspark this might save a 
> few people a few hours...
>  
> The documentation describes vaguely how to fix the issue, by setting a 
> parameter but without an actual working example. 
> With the given PySpark example it should hold enough information to set this 
> in other scenarios as well.
>  
>  
> Kept the change to the docs as small as possible. 
> h3. What changes were proposed in this pull request?
> doc update, see title
> h3. Why are the changes needed?
> save people time figuring out how to resolve it
> h3. Does this PR introduce _any_ user-facing change?
> no
> h3. How was this patch tested?
> no code changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-34430) Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, int) not available

2021-02-13 Thread Apache Spark (Jira)


 [ 
https://issues.apache.org/jira/browse/SPARK-34430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-34430:


Assignee: (was: Apache Spark)

> Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, 
> int) not available
> 
>
> Key: SPARK-34430
> URL: https://issues.apache.org/jira/browse/SPARK-34430
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 3.0.0
>Reporter: Marco van der Linden
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 3.0.0
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Took us a while to figure out how to fix this with pyspark this might save a 
> few people a few hours...
>  
> The documentation describes vaguely how to fix the issue, by setting a 
> parameter but without an actual working example. 
> With the given PySpark example it should hold enough information to set this 
> in other scenarios as well.
>  
>  
> Kept the change to the docs as small as possible. 
> h3. What changes were proposed in this pull request?
> doc update, see title
> h3. Why are the changes needed?
> save people time figuring out how to resolve it
> h3. Does this PR introduce _any_ user-facing change?
> no
> h3. How was this patch tested?
> no code changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34430) Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, int) not available

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284161#comment-17284161
 ] 

Apache Spark commented on SPARK-34430:
--

User 'Nozziel' has created a pull request for this issue:
https://github.com/apache/spark/pull/31551

> Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, 
> int) not available
> 
>
> Key: SPARK-34430
> URL: https://issues.apache.org/jira/browse/SPARK-34430
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 3.0.0
>Reporter: Marco van der Linden
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 3.0.0
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Took us a while to figure out how to fix this with pyspark this might save a 
> few people a few hours...
>  
> The documentation describes vaguely how to fix the issue, by setting a 
> parameter but without an actual working example. 
> With the given PySpark example it should hold enough information to set this 
> in other scenarios as well.
>  
>  
> Kept the change to the docs as small as possible. 
> h3. What changes were proposed in this pull request?
> doc update, see title
> h3. Why are the changes needed?
> save people time figuring out how to resolve it
> h3. Does this PR introduce _any_ user-facing change?
> no
> h3. How was this patch tested?
> no code changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-34430) Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, int) not available

2021-02-13 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-34430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284160#comment-17284160
 ] 

Apache Spark commented on SPARK-34430:
--

User 'Nozziel' has created a pull request for this issue:
https://github.com/apache/spark/pull/31551

> Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, 
> int) not available
> 
>
> Key: SPARK-34430
> URL: https://issues.apache.org/jira/browse/SPARK-34430
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 3.0.0
>Reporter: Marco van der Linden
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 3.0.0
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Took us a while to figure out how to fix this with pyspark this might save a 
> few people a few hours...
>  
> The documentation describes vaguely how to fix the issue, by setting a 
> parameter but without an actual working example. 
> With the given PySpark example it should hold enough information to set this 
> in other scenarios as well.
>  
>  
> Kept the change to the docs as small as possible. 
> h3. What changes were proposed in this pull request?
> doc update, see title
> h3. Why are the changes needed?
> save people time figuring out how to resolve it
> h3. Does this PR introduce _any_ user-facing change?
> no
> h3. How was this patch tested?
> no code changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-34430) Update index.md with a pyspark hint to avoid java.nio.DirectByteBuffer.(long, int) not available

2021-02-13 Thread Marco van der Linden (Jira)
Marco van der Linden created SPARK-34430:


 Summary: Update index.md with a pyspark hint to avoid 
java.nio.DirectByteBuffer.(long, int) not available
 Key: SPARK-34430
 URL: https://issues.apache.org/jira/browse/SPARK-34430
 Project: Spark
  Issue Type: Documentation
  Components: Documentation
Affects Versions: 3.0.0
Reporter: Marco van der Linden
 Fix For: 3.0.0


Took us a while to figure out how to fix this with pyspark this might save a 
few people a few hours...

 

The documentation describes vaguely how to fix the issue, by setting a 
parameter but without an actual working example. 

With the given PySpark example it should hold enough information to set this in 
other scenarios as well.

 

 

Kept the change to the docs as small as possible. 
h3. What changes were proposed in this pull request?

doc update, see title
h3. Why are the changes needed?

save people time figuring out how to resolve it
h3. Does this PR introduce _any_ user-facing change?

no
h3. How was this patch tested?

no code changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org