Let us say my log data that I want to place a log file into hive.  And the log 
file itself looks something like this:

Event_time, event_type, event_data_blob

And the blob data looks like
"Key1=value1;key2=value2;key3=value3 ... keyn=valuen"

This looks like maybe I start like this:

Create table my_log(
Event_time STRING,
Event_type INT
Event_blob MAP<string,string>

How am I doing so far?

Then, I know I need to have some reformatting some on the blob to get it into 
the 'right' format, as I doubt the equals sign '=' works as a key to key value 
map in Hive, and I suspect the semi-colon ';' isn't what hive/hadoop wants for 
a key, value pair separator either.

What I don't know is what format to place the blob data into so I can then load 
it into a hive table.

Anyone got some pointers?

---
Mark E. Sunderlin
Solutions Architect |AOL Data Warehouse
P: 703-256-6935 | C: 540-327-6222
AIM: MESunderlin
22000 AOL Way | Dulles, VA | 20166
[aol-comp-logo]

<<inline: image001.gif>>

Reply via email to