Yes, please if you can open a bug JIRA that'd be great! On Wed, Jun 19, 2019 at 7:51 PM David Crespi < david.cre...@storedgesystems.com> wrote:
> Patrick, would you like me to open up a bug, or do you have this? > > > > Regards, > > > > David > > > > ________________________________ > From: Patrick Stuedi <pstu...@gmail.com> > Sent: Wednesday, June 19, 2019 6:40:10 AM > To: dev@crail.apache.org > Cc: Jonas Pfefferle; d...@crail.incubator.apache.org > Subject: Re: Crail used as type 2 storage for TeraSort does not catch the > "finished" signal > > Ok. The closing issue you reported is a bug I think. And the changes of > Adrian are targeting the shuffle plugin only, so they're not fixing the > closing issue. We need to look into this more carefully, right now there is > no easy fix I guess, sorry. > > -Patrick > > On Wed, Jun 19, 2019 at 3:32 PM David Crespi < > david.cre...@storedgesystems.com> wrote: > > > Hi Patrick, I am attempting to use the crail shuffle plugin… different > > thread. > > > > This was just about using crail for input/output. I have HDFS set up > along > > > > side, to also benchmark the different configs. > > > > > > > > Regards, > > > > > > > > David > > > > > > > > > > > > ________________________________ > > From: Patrick Stuedi <pstu...@gmail.com> > > Sent: Wednesday, June 19, 2019 6:29:11 AM > > To: dev@crail.apache.org > > Cc: Jonas Pfefferle; d...@crail.incubator.apache.org > > Subject: Re: Crail used as type 2 storage for TeraSort does not catch the > > "finished" signal > > > > The closing issue is related to using Crail for input/output. The changes > > Adrian made just earlier today are changes on the shuffle plugin. Are you > > using the Crail shuffle plugin at all? If not then the changes of Adrian > > are not relevant to you. > > > > -Patrick > > > > On Wed, Jun 19, 2019 at 3:22 PM David Crespi < > > david.cre...@storedgesystems.com> wrote: > > > > > Thanks for the description. It seemed odd that it behaved this way as > > > HDFS does close as > > > > > > expected, so I wasn’t sure. Wouldn’t this change the Terasort benchmark > > > numbers? > > > > > > > > > > > > Regards, > > > > > > > > > > > > David > > > > > > > > > > > > C: 714-476-2692 > > > > > > > > > > > > ________________________________ > > > From: Jonas Pfefferle <peppe...@japf.ch> > > > Sent: Wednesday, June 19, 2019 12:17:30 AM > > > To: dev@crail.apache.org; David Crespi; d...@crail.incubator.apache.org > > > Subject: Re: Crail used as type 2 storage for TeraSort does not catch > the > > > "finished" signal > > > > > > Hi David, > > > > > > > > > Unfortunately, if you use Crail for input/output with Spark this is > > > expected. The problem is Spark never closes the filesystem correctly. I > > > haven't look into this lately but if I remember correctly there was no > > easy > > > way otherwise to determine Spark is about to close. > > > > > > Regards, > > > Jonas > > > > > > On Tue, 18 Jun 2019 22:17:16 +0000 > > > David Crespi <david.cre...@storedgesystems.com> wrote: > > > > Hi, > > > > I’m running Crail as the temporary backend storage for Terasort. > > > > After each section (TeraGen, TeraSort, TeraVerify) > > > > the program waits until a Cntl-C is given, then moves on to the next > > > >section. Is this the expected behavior, or is > > > > this a bug? > > > > > > > > Here’s a small snippet of the output. Terasort waits where the > > > >bolded “Number of records” is listed, until > > > > The ^c is given. Each of the three programs does the same, but the > > > >program does finish without errors. > > > > > > > > > > > > 19/06/18 15:13:19 DEBUG TaskSchedulerImpl: parentName: , name: > > > >TaskSet_1.0, runningTasks: 1 > > > > 19/06/18 15:13:19 INFO TaskSetManager: Finished task 1.0 in stage > > > >1.0 (TID 3) in 142 ms on 192.168.3.10 (executor 4) (1/2) > > > > 19/06/18 15:13:19 INFO BlockManagerInfo: Added broadcast_1_piece0 in > > > >memory on 192.168.3.12:34011 (size: 1825.0 B, free: 366.3 MB) > > > > 19/06/18 15:13:19 DEBUG TaskSchedulerImpl: parentName: , name: > > > >TaskSet_1.0, runningTasks: 0 > > > > 19/06/18 15:13:19 INFO TaskSetManager: Finished task 0.0 in stage > > > >1.0 (TID 2) in 977 ms on 192.168.3.12 (executor 3) (2/2) > > > > 19/06/18 15:13:19 INFO TaskSchedulerImpl: Removed TaskSet 1.0, whose > > > >tasks have all completed, from pool > > > > 19/06/18 15:13:19 INFO DAGScheduler: ResultStage 1 (count at > > > >TeraGen.scala:94) finished in 0.995 s > > > > 19/06/18 15:13:19 DEBUG DAGScheduler: After removal of stage 1, > > > >remaining stages = 0 > > > > 19/06/18 15:13:19 INFO DAGScheduler: Job 1 finished: count at > > > >TeraGen.scala:94, took 1.003537 s > > > > Number of records written: 10000 > > > > ^C19/06/18 15:13:36 INFO SparkContext: Invoking stop() from shutdown > > > >hook > > > > > > > > Regards, > > > > > > > > David > > > > > > > > > > > > > > > > >