Re: [Spark SQL] Task failed while writing rows

2016-12-19 Thread Michael Stratton
, but rare, cases have been reported before, > e.g. https://issues.apache.org/jira/browse/HDFS-770 which is *very* old. > > > > If you’re pretty sure Spark couldn’t be responsible for issues at this > level I’ll stick to the Hadoop mailing list. > > > > Thanks > >

Re: [Spark SQL] Task failed while writing rows

2016-12-19 Thread Michael Stratton
It seems like an issue w/ Hadoop. What do you get when you run hdfs dfsadmin -report? Anecdotally(And w/o specifics as it has been a while), I've generally used Parquet instead of ORC as I've gotten a bunch of random problems reading and writing ORC w/ Spark... but given ORC performs a lot better

Re: unit testing in spark

2016-12-09 Thread Michael Stratton
That sounds great, please include me so I can get involved. On Fri, Dec 9, 2016 at 7:39 AM, Marco Mistroni wrote: > Me too as I spent most of my time writing unit/integ tests pls advise > on where I can start > Kr > > On 9 Dec 2016 12:15 am, "Miguel Morales" wrote: > >> I would be interes