Hi. I am confused on the difference between the Serializer/Deserializer
and InputFormat/OutputFormat in the CREATE TABLE command.

For example, let=E2=80=99s say I am using Sequence files, but my records
are of some unknown type not known to Hive (i.e. not Thrift, Protocol
Buffer, etc.). Could I still use the SequenceFileInputFormat and
SequenceFileOutputFormat, and then specific my own custom SerDe? Or is
the SerDe tied to a specific file format?

FYI, this is related to me trying to use Parquet in Hive, but our
records are our own custom format. I am not sure if I can just do STORED
AS PARQUET, or which of InputFormat/OutputFormat/SerDe I need to use to
get this to work.

Any help or pointers would be much appreciated. Thank you

Reply via email to