Also -- I hope your cluster is not too big. It's really easy to DDOS your database using hadoop.
On Thu, Jul 1, 2010 at 9:47 AM, Dmitriy Ryaboy <[email protected]> wrote: > The simplest thing you can do is to have database handle at the object > level, set it to null, and just initialize it in eval() if you see that it's > null. > You can also init the connection in the constructor. > A static dbh will let you share it across tasks, if you persist the jvm. > Naturally you will want to throw in some code to handle dropped connections > and all that. > > > > On Thu, Jul 1, 2010 at 9:01 AM, Dave Viner <[email protected]> wrote: > >> In a custom UDF, what's the most appropriate way to initialize and connect >> to a old-fashioned rdbms? >> >> I wrote a simple UDF which opens/closes a connection on each exec(), but >> this feels a bit like overkill. Is there an "init()" method that is >> invoked >> in a UDF to help with one-time initialization (like a database connection >> or >> sql query preparation)? >> >> Thanks >> Dave Viner >> > >
