> > I am thinking to write the results to a file first, then read and persist > to HBase from the file, to avoid this. The failover would work as Hadoop > will throw out parts of a file that are not marked as completed. Though this > does put a lot of extra IO on the cluster.
Bryan, Have you considered writing your MR output to HFileFormat and then asking the regions to adopt the result? That would allow you to avoid committing any changes to HBase until you knew that the MR job ran successfully. Leif
