Attempting to create a Parquet backed table with a CTAS from an 44GB tab
delimited file in HDFS. The process seemed to be running, as CPU and IO
was seen on all 4 nodes in this cluster, and .parquet files being
created in the expected path.
In however in the last two hours or so, all nodes show near zero CPU or
IO, and the Last Modified date on the .parquet have not changed. Same
time delay shown in the Last Progress column in the active fragment
profile.
What approach can I take to determine what is happening (or not)?