See below: > On May 27, 2015, at 12:17 PM, Matt <[email protected]> wrote: > > Attempting to create a Parquet backed table with a CTAS from an 44GB tab > delimited file in HDFS. The process seemed to be running, as CPU and IO was > seen on all 4 nodes in this cluster, and .parquet files being created in the > expected path. > > In however in the last two hours or so, all nodes show near zero CPU or IO, > and the Last Modified date on the .parquet have not changed. Same time delay > shown in the Last Progress column in the active fragment profile.
Did you happen to notice the Last Update column in the profile? If so, was there a time delay in that too? > > What approach can I take to determine what is happening (or not)? >
