from:"twinkle"

Unsubscribe

2021-07-19 Thread twinkle

Re: [Spark Streaming] Iterative programming on an ordered spark stream using Java?

2015-06-18 Thread twinkle sachdeva

Hi, UpdateStateByKey : if you can brief the issue you are facing with this,that will be great. Regarding not keeping whole dataset in memory, you can tweak the parameter of remember, such that it does checkpoint at appropriate time. Thanks Twinkle On Thursday, June 18, 2015, Nipun Arora

Re: Spark Streaming to Kafka

2015-05-19 Thread twinkle sachdeva

Thanks Saisai. On Wed, May 20, 2015 at 11:23 AM, Saisai Shao wrote: > I think here is the PR https://github.com/apache/spark/pull/2994 you > could refer to. > > 2015-05-20 13:41 GMT+08:00 twinkle sachdeva : > >> Hi, >> >> As Spark streaming is being nicely i

Spark Streaming to Kafka

2015-05-19 Thread twinkle sachdeva

Hi, As Spark streaming is being nicely integrated with consuming messages from Kafka, so I thought of asking the forum, that is there any implementation available for pushing data to Kafka from Spark Streaming too? Any link(s) will be helpful. Thanks and Regards, Twinkle

Re: FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-06 Thread twinkle sachdeva

Hi, Can you please share your compression etc settings, which you are using. Thanks, Twinkle On Wed, May 6, 2015 at 4:15 PM, Jianshi Huang wrote: > I'm facing this error in Spark 1.3.1 > > https://issues.apache.org/jira/browse/SPARK-4105 > > Anyone knows what's t

Re: Addition of new Metrics for killed executors.

2015-04-22 Thread twinkle sachdeva

somebody please comment if it is a bug or some intended behaviour w.r.t performance or some other bottleneck. --Twinkle On Mon, Apr 20, 2015 at 2:47 PM, Archit Thakur wrote: > Hi Twinkle, > > We have a use case in where we want to debug the reason of how n why an > executor got killed

Re: Addition of new Metrics for killed executors.

2015-04-20 Thread twinkle sachdeva

Hi Archit, What is your use case and what kind of metrics are you planning to add? Thanks, Twinkle On Fri, Apr 17, 2015 at 4:07 PM, Archit Thakur wrote: > Hi, > > We are planning to add new Metrics in Spark for the executors that got > killed during the execution. Was just curio

Re: set spark.storage.memoryFraction to 0 when no cached RDD and memory area for broadcast value?

2015-04-14 Thread twinkle sachdeva

Hi, In one of the application we have made which had no clone stuff, we have set the value of spark.storage.memoryFraction to very low, and yes that gave us performance benefits. Regarding that issue, you should also look at the data you are trying to broadcast, as sometimes creating that data st

Re: RDD generated on every query

2015-04-13 Thread twinkle sachdeva

Hi, If you have the same spark context, then you can cache the query result via caching the table ( sqlContext.cacheTable("tableName") ). Maybe you can have a look at OOyola server also. On Tue, Apr 14, 2015 at 11:36 AM, Akhil Das wrote: > You can use a tachyon based storage for that and eve

Regarding benefits of using more than one cpu for a task in spark

2015-04-07 Thread twinkle sachdeva

of this setting? ( which again let me think over this setting ). Comments please. Thanks, Twinkle

Re: Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

2015-04-06 Thread twinkle sachdeva

to a window duration. I will upload the PR shortly. Thanks, Twinkle On Tue, Apr 7, 2015 at 2:02 AM, Sandy Ryza wrote: > What's the advantage of killing an application for lack of resources? > > I think the rationale behind killing an app based on executor failures is > that, if we

Re: Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

2015-04-01 Thread twinkle sachdeva

f time, then we should fail the application. Adding time factor here, will allow some window for spark to get more executors allocated if some of them fails. Thoughts please. Thanks, Twinkle On Wed, Apr 1, 2015 at 10:19 PM, Sandy Ryza wrote: > That's a good question, Twinkle. > > O

Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

2015-03-31 Thread twinkle sachdeva

t at some point even a single executor failure ( which application could have survived ) can make the application quit. Sending it to the community to listen what kind of behaviour / strategy they think will be suitable for long running spark jobs or spark streaming jobs. Thanks and Regards, Twinkle

Re: Priority queue in spark

2015-03-17 Thread twinkle sachdeva

ed to the > spark cluster based on the priority. jobs will lower priority or less than > some threshold will be discarded. > > Thanks, > Abhi > > > On Mon, Mar 16, 2015 at 10:36 PM, twinkle sachdeva < > twinkle.sachd...@gmail.com> wrote: > >> Hi Abhi, &

Re: Priority queue in spark

2015-03-16 Thread twinkle sachdeva

>> Thanks, >> Abhi >> >> >> On Mon, Mar 16, 2015 at 9:48 PM, twinkle sachdeva < >> twinkle.sachd...@gmail.com> wrote: >> >>> Hi, >>> >>> Maybe this is what you are looking for : >>> http://spark.apache.org/docs/1.2.0

Re: Priority queue in spark

2015-03-16 Thread twinkle sachdeva

Hi, Maybe this is what you are looking for : http://spark.apache.org/docs/1.2.0/job-scheduling.html#fair-scheduler-pools Thanks, On Mon, Mar 16, 2015 at 8:15 PM, abhi wrote: > Hi > Current all the jobs in spark gets submitted using queue . i have a > requirement where submitted job will genera

delay between removing the block manager of an executor, and marking that as lost

2015-03-03 Thread twinkle sachdeva

disassociated* How can i make this to happen faster? Thanks, Twinkle

Re: One of the executor not getting StopExecutor message

2015-03-03 Thread twinkle sachdeva

wrote: > Mostly, that particular executor is stuck on GC Pause, what operation are > you performing? You can try increasing the parallelism if you see only 1 > executor is doing the task. > > Thanks > Best Regards > > On Fri, Feb 27, 2015 at 11:39 AM, twinkle sachdeva < &

One of the executor not getting StopExecutor message

2015-02-26 Thread twinkle sachdeva

nager BlockManagerId(7, TMO-DN73, 34106) with no recent heart beats: 80515ms exceeds 45000ms I am using spark 1.2.1. Any pointer(s) ? Thanks, Twinkle

Regarding shuffle data file format

2015-02-20 Thread twinkle sachdeva

Hi, What is the file format which is used to write files while shuffle write? Is it dependent on the spark shuffle manager or output format? Is it possible to change the file format for shuffle, irrespective of the output format of the file? Thanks, Twinkle

Re: Regarding minimum number of partitions while reading data from Hadoop

2015-02-20 Thread twinkle sachdeva

; the Hadoop Configuration to suggest a max/min split size, and > therefore bound the number of partitions you get back. > > On Thu, Feb 19, 2015 at 11:07 AM, twinkle sachdeva > wrote: > > Hi, > > > > In our job, we need to process the data in small chunks, so as to avoid

Regarding minimum number of partitions while reading data from Hadoop

2015-02-19 Thread twinkle sachdeva

from older API? I am little bit aware of split size stuff, but not much aware regarding any promise that minimum number of partitions criteria gets satisfied or not. Any pointers will be of help. Thanks, Twinkle

Re: Spark can't find jars

2014-10-27 Thread twinkle sachdeva

Hi, Try running following in the spark folder: bin/*run-example *SparkPi 10 If this runs fine, just see the set of arguments being passed via this script, and try in similar way. Thanks, On Thu, Oct 16, 2014 at 2:59 PM, Christophe Préaud < christophe.pre...@kelkoo.com> wrote: > Hi, > > I ha

Regarding using spark sql with yarn

2014-10-17 Thread twinkle sachdeva

Hi, I have been using spark sql with yarn. It works fine with yarn-client mode, but with yarn-cluster mode, we are facing 2 issues. Is yarn-cluster mode not recommended for spark-sql using hiveContext ?? *Problem #1* We are not able to use any query with very simple filtering operation "like",

Regarding java version requirement in spark 1.2.0 or upcoming releases

2014-10-13 Thread twinkle sachdeva

Hi, Can somebody please share the plans regarding java version's support for apache spark 1.2.0 or near future releases. Will java 8 become the all feature supported version in apache spark 1.2 or java 1.7 will suffice? Thanks,

Re: Using one sql query's result inside another sql query

2014-09-28 Thread twinkle sachdeva

, 2014 at 5:13 PM, Cheng Lian wrote: > H Twinkle, > > The failure is caused by case sensitivity. The temp table actually stores > the original un-analyzed logical plan, thus field names remain capital (F1, > F2, etc.). I believe this issue has already been fixed by PR #2382 > &

Using one sql query's result inside another sql query

2014-09-24 Thread twinkle sachdeva

Hi, I am using Hive Context to fire the sql queries inside spark. I have created a schemaRDD( Let's call it cachedSchema ) inside my code. If i fire a sql query ( Query 1 ) on top of it, then it works. But if I refer to Query1's result inside another sql, that fails. Note that I have already regi

Has anybody faced SPARK-2604 issue regarding Application hang state

2014-09-01 Thread twinkle sachdeva

Hi, Has anyone else also experienced https://issues.apache.org/jira/browse/SPARK-2604? It is an edge case scenario of mis configuration, where the executor memory asked is same as the maximum allowed memory by yarn. In such situation, application stays in hang state, and the reason is not logged

Unsubscribe

Re: [Spark Streaming] Iterative programming on an ordered spark stream using Java?

Re: Spark Streaming to Kafka

Spark Streaming to Kafka

Re: FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

Re: Addition of new Metrics for killed executors.

Re: Addition of new Metrics for killed executors.

Re: set spark.storage.memoryFraction to 0 when no cached RDD and memory area for broadcast value?

Re: RDD generated on every query

Regarding benefits of using more than one cpu for a task in spark

Re: Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

Re: Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

Strategy regarding maximum number of executor's failure for log running jobs/ spark streaming jobs

Re: Priority queue in spark

Re: Priority queue in spark

Re: Priority queue in spark

delay between removing the block manager of an executor, and marking that as lost

Re: One of the executor not getting StopExecutor message

One of the executor not getting StopExecutor message

Regarding shuffle data file format

Re: Regarding minimum number of partitions while reading data from Hadoop

Regarding minimum number of partitions while reading data from Hadoop

Re: Spark can't find jars

Regarding using spark sql with yarn

Regarding java version requirement in spark 1.2.0 or upcoming releases

Re: Using one sql query's result inside another sql query

Using one sql query's result inside another sql query

Has anybody faced SPARK-2604 issue regarding Application hang state

28 matches

Site Navigation

Mail list logo

Footer information