, but rare, cases have been reported before,
> e.g. https://issues.apache.org/jira/browse/HDFS-770 which is *very* old.
>
>
>
> If you’re pretty sure Spark couldn’t be responsible for issues at this
> level I’ll stick to the Hadoop mailing list.
>
>
>
> Thanks
>
>
It seems like an issue w/ Hadoop. What do you get when you run hdfs
dfsadmin -report?
Anecdotally(And w/o specifics as it has been a while), I've generally used
Parquet instead of ORC as I've gotten a bunch of random problems reading
and writing ORC w/ Spark... but given ORC performs a lot better
That sounds great, please include me so I can get involved.
On Fri, Dec 9, 2016 at 7:39 AM, Marco Mistroni wrote:
> Me too as I spent most of my time writing unit/integ tests pls advise
> on where I can start
> Kr
>
> On 9 Dec 2016 12:15 am, "Miguel Morales" wrote:
>
>> I would be interes