Re: HADOOP-2536 supports Oracle too?

2009-02-16 Thread Fredrik Hedberg
Hi, Although it's not MySQL; this might be of use: http://svn.apache.org/repos/asf/hadoop/core/trunk/src/examples/org/apache/hadoop/examples/DBCountPageView.java Fredrik On Feb 16, 2009, at 8:33 AM, sandhiya wrote: @Amandeep Hi, I'm new to Hadoop and am trying to run a simple database

Pluggable JDBC schemas [Was: How to use DBInputFormat?]

2009-02-13 Thread Fredrik Hedberg
Hi, Please let us know how this works out. Also, it would be nice if people with experience with other RDMBS than MySQL and Oracle could comment on the syntax and performance of their respective RDBMS with regard to Hadoop. Even if the syntax of the current SQL queries are valid for

Re: How to use DBInputFormat?

2009-02-06 Thread Fredrik Hedberg
and will consume the complete file. IMHO the DBInputFormat should follow this behavior and just create 1 split whatsoever. Maybe a future version of hadoop will allow to create splits/map tasks on the fly dynamically? Stefan On Thu, Feb 5, 2009 at 3:28 PM, Fredrik Hedberg fred...@avafan.com wrote

Re: How to use DBInputFormat?

2009-02-06 Thread Fredrik Hedberg
problem, whereas HBase and HDFS is really the answer to the kind of problem your hinting at. Fredrik On Feb 6, 2009, at 4:06 PM, Stefan Podkowinski wrote: On Fri, Feb 6, 2009 at 2:40 PM, Fredrik Hedberg fred...@avafan.com wrote: Well, that obviously depend on the RDBMS' implementation

Re: How to use DBInputFormat?

2009-02-05 Thread Fredrik Hedberg
Indeed sir. The implementation was designed like you describe for two reasons. First and foremost to make is as simple as possible for the user to use a JDBC database as input and output for Hadoop. Secondly because of the specific requirements the MapReduce framework brings to the table

Re: JDBC input/output format

2008-12-12 Thread Fredrik Hedberg
I highly doubt using Hadoop for that would be the most efficient solution, unless you have a sharded database infrastructure and extend the Hadoop database input/output format accordingly. - Fredrik On Dec 12, 2008, at 5:26 AM, Edward J. Yoon wrote: Does anyone think about database to