Hi,
Although it's not MySQL; this might be of use:
http://svn.apache.org/repos/asf/hadoop/core/trunk/src/examples/org/apache/hadoop/examples/DBCountPageView.java
Fredrik
On Feb 16, 2009, at 8:33 AM, sandhiya wrote:
@Amandeep
Hi,
I'm new to Hadoop and am trying to run a simple database
Hi,
Please let us know how this works out. Also, it would be nice if
people with experience with other RDMBS than MySQL and Oracle could
comment on the syntax and performance of their respective RDBMS with
regard to Hadoop. Even if the syntax of the current SQL queries are
valid for
and will consume the complete file. IMHO the DBInputFormat
should follow this behavior and just create 1 split whatsoever.
Maybe a future version of hadoop will allow to create splits/map tasks
on the fly dynamically?
Stefan
On Thu, Feb 5, 2009 at 3:28 PM, Fredrik Hedberg fred...@avafan.com
wrote
problem, whereas HBase and HDFS is really the answer to the kind of
problem your hinting at.
Fredrik
On Feb 6, 2009, at 4:06 PM, Stefan Podkowinski wrote:
On Fri, Feb 6, 2009 at 2:40 PM, Fredrik Hedberg fred...@avafan.com
wrote:
Well, that obviously depend on the RDBMS' implementation
Indeed sir.
The implementation was designed like you describe for two reasons.
First and foremost to make is as simple as possible for the user to
use a JDBC database as input and output for Hadoop. Secondly because
of the specific requirements the MapReduce framework brings to the
table
I highly doubt using Hadoop for that would be the most efficient
solution, unless you have a sharded database infrastructure and
extend the Hadoop database input/output format accordingly.
- Fredrik
On Dec 12, 2008, at 5:26 AM, Edward J. Yoon wrote:
Does anyone think about database to