Hi all,

I have a scenario where an Avro binary is being stored as a BLOB in an
RDBMS. What's the recommended approach for querying this in bulk,
extracting this specific field, and batching it to HDFS?

   1. GenerateTableFetch OR QueryDatabaseTableRecord
   2. Extract Avro column and assemble output <-- How?
   3. MergeRecord
   4. PutHDFS

Additional clarification is that ultimately I want to make the Avro exactly
as it is (content wise), store in HDFS, with an external Hive table on top.

Thanks,
Jason

Reply via email to