Re: Hadoop pain points?

2012-03-03 Thread Jie Li
I found a related question here:

http://www.quora.com/Hadoop-Operability/Is-Hadoop-now-easy-to-use-If-not-what-assistance-do-most-users-need

Jie

On Fri, Mar 2, 2012 at 10:55 PM, Russell Jurney russell.jur...@gmail.comwrote:

 +6.28318531

 On Fri, Mar 2, 2012 at 7:35 PM, Leonardo Urbina lurb...@mit.edu wrote:

  +3.14159265358979
 
  Sent from my phone
 
  On Mar 2, 2012, at 6:42 PM, Russell Jurney russell.jur...@gmail.com
  wrote:
 
   +2
  
   Russell Jurney http://datasyndrome.com
  
   On Mar 2, 2012, at 4:38 PM, Mohit Anchlia mohitanch...@gmail.com
  wrote:
  
   +1
  
   On Fri, Mar 2, 2012 at 4:09 PM, Harsh J ha...@cloudera.com wrote:
  
   Since you ask about anything in general, when I forayed into using
   Hadoop, my biggest pain was lack of documentation clarity and
   completeness over the MR and DFS user APIs (and other little points).
  
   It would be nice to have some work done to have one example or
   semi-example for every single Input/OutputFormat, Mapper/Reducer
   implementations, etc. added to the javadocs.
  
   I believe examples and snippets help out a ton (tons more than
   explaining just behavior) to new devs.
  
   On Fri, Mar 2, 2012 at 9:45 PM, Kunaal kunalbha...@alumni.cmu.edu
  wrote:
   I am doing a general poll on what are the most prevalent pain points
  that
   people run into with Hadoop? These could be performance related
  (memory
   usage, IO latencies), usage related or anything really.
  
   The goal is to look for what areas this platform could benefit the
  most
   in
   the near future.
  
   Any feedback is much appreciated.
  
   Thanks,
   Kunal.
  
  
  
   --
   Harsh J
  
 



 --
 Russell Jurney twitter.com/rjurney russell.jur...@gmail.com
 datasyndrome.com



Re: AWS MapReduce

2012-03-03 Thread Mohit Anchlia
I think found answer to this question. However, it's still not clear if
HDFS is on local disk or EBS volumes. Does anyone know?

On Sat, Mar 3, 2012 at 3:54 PM, Mohit Anchlia mohitanch...@gmail.comwrote:

 Just want to check  how many are using AWS mapreduce and understand the
 pros and cons of Amazon's MapReduce machines? Is it true that these map
 reduce machines are really reading and writing from S3 instead of local
 disks? Has anyone found issues with Amazon MapReduce and how does it
 compare with using MapReduce on local attached disks compared to using S3.