Re: fyi: A Comparison of Approaches to Large-Scale Data Analysis: MapReduce vs. DBMS Benchmarks

2009-04-15 Thread Jonathan Gray
I agree with you, Andy. This seems to be a great look into what Hadoop MapReduce is not good at. Over in the HBase world, we constantly deal with comparisons like this to RDBMSs, trying to determine if one is better than the other. It's a false choice and completely depends on the use case.

RE: Control over max map/reduce tasks per job

2009-02-04 Thread Jonathan Gray
This sounds good enough for a JIRA ticket to me. -Bryan On Feb 3, 2009, at 11:44 AM, Jonathan Gray wrote: Chris, For my specific use cases, it would be best to be able to set N mappers/reducers per job per node (so I can explicitly say, run at most 2 at a time of this CPU bound

RE: Control over max map/reduce tasks per job

2009-02-03 Thread Jonathan Gray
that no node can run more than N tasks from that job? Or both? just reconciling the conversation we had last week with this thread. ckw On Feb 3, 2009, at 11:16 AM, Jonathan Gray wrote: All, I have a few relatively small clusters (5-20 nodes) and am having trouble keeping them

Control over max map/reduce tasks per job

2009-02-03 Thread Jonathan Gray
. I wanted to ping the list before filing an issue because it seems like someone may have thought about this in the past. Thanks. Jonathan Gray

RE: Using HDFS for common purpose

2009-01-27 Thread Jonathan Gray
Perhaps what you are looking for is HBase? http://hbase.org HBase is a column-oriented, distributed store that sits on top of HDFS and provides random access. JG -Original Message- From: Rasit OZDAS [mailto:rasitoz...@gmail.com] Sent: Tuesday, January 27, 2009 1:20 AM To:

NameNode fatal crash - 0.18.1

2008-12-15 Thread Jonathan Gray
would I specify to the secondary where primary is located? We're also upgrading to Hadoop 0.19.0 at this time. Thank you for any help. Jonathan Gray

RE: NameNode fatal crash - 0.18.1

2008-12-15 Thread Jonathan Gray
I have fixed the issue with the SecondaryNameNode not contacting primary with the 'dfs.http.address' config option. Other issues still unsolved. -Original Message- From: Jonathan Gray [mailto:jl...@streamy.com] Sent: Monday, December 15, 2008 10:55 AM To: core-user@hadoop.apache.org

Large number of deletes takes out DataNodes

2008-11-16 Thread Jonathan Gray
that a number of the blocks had replication 1. Obviously I can handle this by deleting less at any one time, but it seems like there might be something wrong. With no CPU utilization, why does the datanode not respond to the namenode? Thanks. Jonathan Gray

RE: Why is scaling HBase much simpler then scaling a relational db?

2008-08-27 Thread Jonathan Gray
that's my rant for the week. Hope that provides more clarity than confusion. Jonathan Gray Jonathan Gray schrieb: A few very big differences... - HBase/BigTable don't have transactions in the same way that a relational database does. While it is possible (and was just recently

RE: Why is scaling HBase much simpler then scaling a relational db?

2008-08-21 Thread Jonathan Gray
a relational database is never simple. I hope this starts to shed some light on what the differences are. Jonathan Gray Streamy Inc. -Original Message- From: Mork0075 [mailto:[EMAIL PROTECTED] Sent: Thursday, August 21, 2008 8:48 AM To: core-user@hadoop.apache.org; [EMAIL PROTECTED