Is anybody using Hawq in production? Today I was thinking about speed and what
would be faster. Placing stuff on HDFS or inserts into a distributed database.
Coming from a structured data background, I haven't entirely wrapped my head
around storing data in plain text format. I know you can use stuff like Avro
and Parquet to enforce schema but it's still just binary data on disk without
all the grantees that you've come to expect from relational databases over the
past 20 years. In a perfect world, I'd like to have all the awesomeness of HDFS
but ease of use of relational databases. My question is, are we there yet? Can
we leave behind HDFS (or just be abstracted away from it) and design high speed
BI systems without all the extra IQ points required to deal with writing and
reading to HDFS?
B.