Re: hadoop in the ETL process

2008-07-08 Thread Andreas Kostyrka
On Wednesday 02 July 2008 19:51:57 David J. O'Dell wrote: > Is anyone using hadoop for any part of the ETL process? > > Given its ability to process large amounts of log files this seems like > a good fit. Well, we are doing the following data flow: 1.) webservers upload to S3 2.) hadoop jobs get

RE: hadoop in the ETL process

2008-07-02 Thread Ryan Lynch
imagine establishing connections inside map/reduce jobs would not be ideal. Regards, Ryan -Original Message- From: Chris K Wensel [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 02, 2008 11:31 AM To: core-user@hadoop.apache.org Subject: Re: hadoop in the ETL process If your referring to

Re: hadoop in the ETL process

2008-07-02 Thread Chris K Wensel
If your referring to loading an RDBMS with data on Hadoop, this is doable. but you will need to write your own JDBC adapters to your tables. But you might review what you are using the RDBMS for and see if those jobs would be better off running on Hadoop entirely, if not for most of the p

hadoop in the ETL process

2008-07-02 Thread David J. O'Dell
Is anyone using hadoop for any part of the ETL process? Given its ability to process large amounts of log files this seems like a good fit. -- David O'Dell Director, Operations e: [EMAIL PROTECTED] t: (415) 738-5152 180 Townsend St., Third Floor San Francisco, CA 94107