Felix Cheung created SPARK-21149:
Summary: Add job description API for R
Key: SPARK-21149
URL: https://issues.apache.org/jira/browse/SPARK-21149
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-21148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21148:
Assignee: Apache Spark
> Set SparkUncaughtExceptionHandler to the Master
>
[
https://issues.apache.org/jira/browse/SPARK-21148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21148:
Assignee: (was: Apache Spark)
> Set SparkUncaughtExceptionHandler to the Master
>
[
https://issues.apache.org/jira/browse/SPARK-21148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16055070#comment-16055070
]
Apache Spark commented on SPARK-21148:
--
User 'devaraj-kavali' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-20889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Felix Cheung resolved SPARK-20889.
--
Resolution: Fixed
Assignee: Wayne Zhang
Fix Version/s: 2.3.0
Devaraj K created SPARK-21148:
-
Summary: Set SparkUncaughtExceptionHandler to the Master
Key: SPARK-21148
URL: https://issues.apache.org/jira/browse/SPARK-21148
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16055049#comment-16055049
]
Takeshi Yamamuro commented on SPARK-21144:
--
okay, I'm currently looking into this.
> Unexpected
Fei Shao created SPARK-21147:
Summary: the schema of socket source can not be set.
Key: SPARK-21147
URL: https://issues.apache.org/jira/browse/SPARK-21147
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-21133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan reassigned SPARK-21133:
---
Assignee: Yuming Wang
> HighlyCompressedMapStatus#writeExternal throws NPE
>
[
https://issues.apache.org/jira/browse/SPARK-21133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-21133.
-
Resolution: Fixed
Fix Version/s: 2.2.0
Issue resolved by pull request 18343
[
https://issues.apache.org/jira/browse/SPARK-21146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16055024#comment-16055024
]
Apache Spark commented on SPARK-21146:
--
User 'devaraj-kavali' has created a pull request for this
[
https://issues.apache.org/jira/browse/SPARK-21146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21146:
Assignee: Apache Spark
> Worker should handle and shutdown when any thread gets
[
https://issues.apache.org/jira/browse/SPARK-21146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21146:
Assignee: (was: Apache Spark)
> Worker should handle and shutdown when any thread
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21144:
Assignee: Apache Spark
> Unexpected results when the data schema and partition schema
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16055015#comment-16055015
]
Apache Spark commented on SPARK-21144:
--
User 'maropu' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21144:
Assignee: (was: Apache Spark)
> Unexpected results when the data schema and partition
Devaraj K created SPARK-21146:
-
Summary: Worker should handle and shutdown when any thread gets
UncaughtException
Key: SPARK-21146
URL: https://issues.apache.org/jira/browse/SPARK-21146
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-18191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054978#comment-16054978
]
Shridhar Ramachandran commented on SPARK-18191:
---
I see this got committed only in 2.2.0 and
[
https://issues.apache.org/jira/browse/SPARK-18191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054978#comment-16054978
]
Shridhar Ramachandran edited comment on SPARK-18191 at 6/20/17 12:11 AM:
[
https://issues.apache.org/jira/browse/SPARK-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcelo Vanzin resolved SPARK-8642.
---
Resolution: Won't Fix
Even though a better error here would be nice, I'll close this because
[
https://issues.apache.org/jira/browse/SPARK-21145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21145:
Assignee: Apache Spark (was: Tathagata Das)
> Restarted queries reuse same
[
https://issues.apache.org/jira/browse/SPARK-21145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054945#comment-16054945
]
Apache Spark commented on SPARK-21145:
--
User 'tdas' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-21145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21145:
Assignee: Tathagata Das (was: Apache Spark)
> Restarted queries reuse same
Tathagata Das created SPARK-21145:
-
Summary: Restarted queries reuse same StateStoreProvider, causing
multiple concurrent tasks to update same StateStore
Key: SPARK-21145
URL:
[
https://issues.apache.org/jira/browse/SPARK-21138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcelo Vanzin resolved SPARK-21138.
Resolution: Fixed
Assignee: sharkd tu
Fix Version/s: 2.3.0
[
https://issues.apache.org/jira/browse/SPARK-21124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcelo Vanzin resolved SPARK-21124.
Resolution: Fixed
Assignee: Marcelo Vanzin
Fix Version/s: 2.3.0
> Wrong
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiao Li updated SPARK-21144:
Target Version/s: 2.2.0
> Unexpected results when the data schema and partition schema have the
>
[
https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054788#comment-16054788
]
Xiao Li commented on SPARK-21144:
-
cc [~maropu]
> Unexpected results when the data schema and partition
[
https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054785#comment-16054785
]
Apache Spark commented on SPARK-18016:
--
User 'bdrillard' has created a pull request for this issue:
Xiao Li created SPARK-21144:
---
Summary: Unexpected results when the data schema and partition
schema have the duplicate columns
Key: SPARK-21144
URL: https://issues.apache.org/jira/browse/SPARK-21144
[
https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054784#comment-16054784
]
Aleksander Eskilson commented on SPARK-18016:
-
[~cloud_fan], [~divshukla], I've created a PR
[
https://issues.apache.org/jira/browse/SPARK-21143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054707#comment-16054707
]
Ryan Williams commented on SPARK-21143:
---
[~zsxwing]
bq. it's too risky to upgrade from 4.0.X to
[
https://issues.apache.org/jira/browse/SPARK-11170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054701#comment-16054701
]
remoteServer commented on SPARK-11170:
--
Do we have steps to reproduce the issue? I have enabled
[
https://issues.apache.org/jira/browse/SPARK-20928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054690#comment-16054690
]
Cody Koeninger commented on SPARK-20928:
Cool, can you label it SPIP so it shows up linked from
[
https://issues.apache.org/jira/browse/SPARK-21143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054631#comment-16054631
]
Sean Owen commented on SPARK-21143:
---
If this reduces to a 4.0 vs 4.1 conflict, then this is SPARK-19552
[
https://issues.apache.org/jira/browse/SPARK-21142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21142:
Assignee: Apache Spark
> spark-streaming-kafka-0-10 has too fat dependency on kafka
>
[
https://issues.apache.org/jira/browse/SPARK-21142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054612#comment-16054612
]
Apache Spark commented on SPARK-21142:
--
User 'timvw' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-21142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21142:
Assignee: (was: Apache Spark)
> spark-streaming-kafka-0-10 has too fat dependency on
[
https://issues.apache.org/jira/browse/SPARK-21133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Armbrust updated SPARK-21133:
-
Target Version/s: 2.2.0
Priority: Blocker (was: Major)
Description:
[
https://issues.apache.org/jira/browse/SPARK-21102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054600#comment-16054600
]
Reynold Xin commented on SPARK-21102:
-
Can you submit a pull request so we can discuss the details of
[
https://issues.apache.org/jira/browse/SPARK-20928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054596#comment-16054596
]
Michael Armbrust commented on SPARK-20928:
--
Hi Cody, I do plan to flesh this out with the other
[
https://issues.apache.org/jira/browse/SPARK-21143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054593#comment-16054593
]
Shixiong Zhu commented on SPARK-21143:
--
The reason you cannot use 4.0.42.Final is because you are
[
https://issues.apache.org/jira/browse/SPARK-19975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiao Li resolved SPARK-19975.
-
Resolution: Fixed
Assignee: Yong Tang
Fix Version/s: 2.3.0
> Add map_keys and map_values
[
https://issues.apache.org/jira/browse/SPARK-21143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054592#comment-16054592
]
Shixiong Zhu commented on SPARK-21143:
--
As Netty is so core to Spark, it's too risky to upgrade from
[
https://issues.apache.org/jira/browse/SPARK-21102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054591#comment-16054591
]
Anton Okolnychyi commented on SPARK-21102:
--
Hi [~rxin],
I took a look at this issue and have a
[
https://issues.apache.org/jira/browse/SPARK-12414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054529#comment-16054529
]
Ritesh Tijoriwala commented on SPARK-12414:
---
I have a similar situation. I have several classes
[
https://issues.apache.org/jira/browse/SPARK-21142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu updated SPARK-21142:
-
Component/s: (was: Structured Streaming)
DStreams
>
[
https://issues.apache.org/jira/browse/SPARK-21123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu resolved SPARK-21123.
--
Resolution: Fixed
Fix Version/s: 2.3.0
2.2.0
> Options for file
[
https://issues.apache.org/jira/browse/SPARK-16430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu updated SPARK-16430:
-
Fix Version/s: (was: 2.1.0)
2.0.0
> Add an option in file stream source
[
https://issues.apache.org/jira/browse/SPARK-16430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu updated SPARK-16430:
-
Fix Version/s: 2.1.0
> Add an option in file stream source to read 1 file at a time
>
[
https://issues.apache.org/jira/browse/SPARK-19688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Marcelo Vanzin resolved SPARK-19688.
Resolution: Fixed
Fix Version/s: 2.3.0
2.2.1
Ryan Williams created SPARK-21143:
-
Summary: Fail to fetch blocks >1MB in size in presence of
conflicting Netty version
Key: SPARK-21143
URL: https://issues.apache.org/jira/browse/SPARK-21143
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054176#comment-16054176
]
sam edited comment on SPARK-21137 at 6/19/17 3:20 PM:
--
[~srowen] Ah OK, sorry, not
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054176#comment-16054176
]
sam commented on SPARK-21137:
-
[~srowen] Ah OK, sorry, not used to that process. On other projects I've seen
[
https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054162#comment-16054162
]
Michael Schmeißer commented on SPARK-650:
-
[~riteshtijoriwala] - Sorry, but I am not familiar with
[
https://issues.apache.org/jira/browse/SPARK-21080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054155#comment-16054155
]
Lukasz Raszka commented on SPARK-21080:
---
[~jerryshao] Yes, it's in HA mode. Updating to newer HDFS
[
https://issues.apache.org/jira/browse/SPARK-17176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-17176.
---
Resolution: Won't Fix
> Task are sorted by "Index" in Stage Page.
>
Tim Van Wassenhove created SPARK-21142:
--
Summary: spark-streaming-kafka-0-10 has too fat dependency on kafka
Key: SPARK-21142
URL: https://issues.apache.org/jira/browse/SPARK-21142
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-21142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054130#comment-16054130
]
Tim Van Wassenhove commented on SPARK-21142:
Opened a PR on github:
[
https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054118#comment-16054118
]
Aleksander Eskilson commented on SPARK-18016:
-
[~cloud_fan], [~divshukla], yeah, I'd be happy
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054115#comment-16054115
]
Sean Owen commented on SPARK-21137:
---
Try a thread dump on the driver. Until there's some more detail
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054111#comment-16054111
]
sam edited comment on SPARK-21137 at 6/19/17 2:36 PM:
--
[~srowen]
> what stages are
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054111#comment-16054111
]
sam edited comment on SPARK-21137 at 6/19/17 2:35 PM:
--
[~srowen]
> what stages are
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054111#comment-16054111
]
sam commented on SPARK-21137:
-
[~srowen]
> what stages are executing if any?
*None, no tasks are started*.
[
https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054108#comment-16054108
]
Dongjoon Hyun commented on SPARK-19809:
---
Yep. I'm trying to fix this with new ORC data source. It
[
https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054076#comment-16054076
]
Hyukjin Kwon commented on SPARK-19809:
--
What you see is what you get. This is "Reopened" per the
[
https://issues.apache.org/jira/browse/SPARK-21141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-21141.
---
Resolution: Not A Problem
[~mprocop] please don't reopen JIRAs. We can reopen if needed. As I say, I
[
https://issues.apache.org/jira/browse/SPARK-21140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054055#comment-16054055
]
Sean Owen commented on SPARK-21140:
---
Yes, it's possible the executor makes a copy of some data during
[
https://issues.apache.org/jira/browse/SPARK-21141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
michael procopio reopened SPARK-21141:
--
My apologies, I mean spark-submit --version.
> spark-update --version is hard to parse
>
[
https://issues.apache.org/jira/browse/SPARK-21140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
michael procopio reopened SPARK-21140:
--
I am not sure what detail you are looking for. I provided the test code I was
using.
[
https://issues.apache.org/jira/browse/SPARK-21140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054043#comment-16054043
]
Sean Owen edited comment on SPARK-21140 at 6/19/17 2:02 PM:
I disagree
[
https://issues.apache.org/jira/browse/SPARK-21140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054043#comment-16054043
]
michael procopio commented on SPARK-21140:
--
I disagree executor memory does depend on the size
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sam updated SPARK-21137:
Description:
A very common use case in big data is to read a large number of small files.
For example the Enron
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054042#comment-16054042
]
Sean Owen commented on SPARK-21137:
---
Are you sure it's not just appearing to be stuck reading the file
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054026#comment-16054026
]
sam edited comment on SPARK-21137 at 6/19/17 1:53 PM:
--
[~srowen] As I said in the
[
https://issues.apache.org/jira/browse/SPARK-19809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054031#comment-16054031
]
Renu Yadav commented on SPARK-19809:
What is the resolution of this issue.
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054026#comment-16054026
]
sam commented on SPARK-21137:
-
[~srowen] As I said in the description, which you may have missed, the logs
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16054004#comment-16054004
]
Sean Owen commented on SPARK-21137:
---
As i say, you're not setting anything about the partitioning here.
[
https://issues.apache.org/jira/browse/SPARK-21141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-21141.
---
Resolution: Not A Problem
There is no spark-update. It is not intended as an API to determine the
[
https://issues.apache.org/jira/browse/SPARK-21140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-21140.
---
Resolution: Invalid
There's no real detail here. Executor memory doesn't directly matter to how
michael procopio created SPARK-21141:
Summary: spark-update --version is hard to parse
Key: SPARK-21141
URL: https://issues.apache.org/jira/browse/SPARK-21141
Project: Spark
Issue Type:
michael procopio created SPARK-21140:
Summary: Reduce collect high memory requrements
Key: SPARK-21140
URL: https://issues.apache.org/jira/browse/SPARK-21140
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053977#comment-16053977
]
sam commented on SPARK-21137:
-
[~srowen]
So I've provided full reproduce steps here (including code and
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sam updated SPARK-21137:
Description:
A very common use case in big data is to read a large number of small files.
For example the Enron
[
https://issues.apache.org/jira/browse/SPARK-20931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yuming Wang resolved SPARK-20931.
-
Resolution: Fixed
Fix Version/s: 2.3.0
> Built-in SQL Function ABS support string type
>
[
https://issues.apache.org/jira/browse/SPARK-21139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053911#comment-16053911
]
Sean Owen commented on SPARK-21139:
---
That looks like an issue from the HBase client, not Spark.
>
shining created SPARK-21139:
---
Summary: java.util.concurrent.RejectedExecutionException: rejected
from java.util.concurrent.ThreadPoolExecutor@46477dd0[Terminated, pool size =
0, active threads = 0, queued tasks = 0, completed tasks = 14109]
[
https://issues.apache.org/jira/browse/SPARK-20568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053871#comment-16053871
]
Fei Shao commented on SPARK-20568:
--
I also do not support this feature too.
If we delete files
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053820#comment-16053820
]
Sean Owen commented on SPARK-21137:
---
Here's a hint, or example of what could be going wrong: you may
[
https://issues.apache.org/jira/browse/SPARK-21138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21138:
Assignee: (was: Apache Spark)
> Cannot delete staging dir when the clusters of
[
https://issues.apache.org/jira/browse/SPARK-21138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053817#comment-16053817
]
Apache Spark commented on SPARK-21138:
--
User 'sharkdtu' has created a pull request for this issue:
[
https://issues.apache.org/jira/browse/SPARK-21138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-21138:
Assignee: Apache Spark
> Cannot delete staging dir when the clusters of
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen closed SPARK-21137.
-
> Spark cannot read many small files (wholeTextFiles)
> ---
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16053808#comment-16053808
]
sam edited comment on SPARK-21137 at 6/19/17 11:14 AM:
---
[~srowen] Sorry about the
[
https://issues.apache.org/jira/browse/SPARK-21138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sharkd tu updated SPARK-21138:
--
Description:
When I set different clusters for "spark.hadoop.fs.defaultFS" and
sharkd tu created SPARK-21138:
-
Summary: Cannot delete staging dir when the clusters of
"spark.yarn.stagingDir" and "spark.hadoop.fs.defaultFS" are different
Key: SPARK-21138
URL:
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sam reopened SPARK-21137:
-
Reopened after adding detail.
> Spark cannot read many small files (wholeTextFiles)
>
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sam updated SPARK-21137:
Description:
A very common use case in big data is to read a large number of small files.
For example the Enron
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-21137.
---
Resolution: Invalid
Don't reopen this please. Someone will do that if it's appropriate.
This still
[
https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
sam updated SPARK-21137:
Description:
A very common use case in big data is to read a large number of small files.
For example the Enron
1 - 100 of 125 matches
Mail list logo