It sounds like the TabletServer on the other end trying to service your bulkimport request failed. A first start would be to go to that server which "IO was stuck" on and see if it's complaining about anything in the logs.

I believe the failures.txt file is a way for your client to know which files bulk import failed on (so you can retry them yourself).

Versions of Hadoop and Accumulo would also be very helpful in trying to help you debug what's happening.

On 11/6/13, 9:04 PM, Korb, Michael [USA] wrote:
Sometimes when I try to run importdirectory on Rfiles, the thread hangs
and eventually fails. The shell says, "WARN : Thread 'shell' stuck on IO
to …" and the Recent Logs in the UI say "Thread 'bulk import XX' stuck
on IO" and "rpc failed server …
org.apache.thrift.transport.TTransportException …"

Sometimes it puts the Rfiles in failures, and sometimes it writes a text
file failures.txt in failures, where failures.txt contains the location
of an Rfile in HDFS under the Accumulo data directory.

Is there any way to fix this Thrift error so I can complete bulk ingest?
Also, what does failures.txt mean? It looks like the Rfile is in the
right place. I would greatly appreciate any help with these issues.

Thanks,
Mike

Reply via email to