Best Practice: Acquire Data from external sources

Markus Resch Fri, 16 Mar 2012 07:25:49 -0700

Hey everyone,

thanks for answering my last question that quick, simple and completely.
You guys are awesome!
But to keep you exited I'll go one with my next question:


I need to acquire additional information from external sources to
process my data properly.
My Idea was to do this by writing a dedicated data store which will
perform e.g. some sql statements on some external data bases which might
contain some results from former native pig results. The results from
this external query could be stored onto the hadoop using the given
default data stores and returned to the caller of LOAD as a common
relation.

My question about this is: Does this make sense? Especially from an
optimization point of view?

I'm curious about you opinions

Thanks
Markus

Best Practice: Acquire Data from external sources

Reply via email to