hey

2013-04-25 Thread John Stein
http://octpod.net/wp-content/plugins/tv1/yahoo.php?lvook752uzgfxdv --- You, me, five bucks, and a little conversation. -- Reality Bites

Re: POLL: Using YARN or pre-YARN?

2013-04-25 Thread Krishna Kishore Bonagiri
I have been using YARN, i.e. hadopp-2.0.0-alpha to hadoop-2.0.4-alpha, I don't know what you meant by pre-YARN. Thanks, Kishore On Wed, Apr 24, 2013 at 10:41 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hi, Quick poll, would be great to know how many people are using YARN vs.

Re: POLL: Using YARN or pre-YARN?

2013-04-25 Thread Harsh J
By Pre-YARN Otis probably means MRv1 (i.e. JobTracker, etc.). On Thu, Apr 25, 2013 at 12:18 PM, Krishna Kishore Bonagiri write2kish...@gmail.com wrote: I have been using YARN, i.e. hadopp-2.0.0-alpha to hadoop-2.0.4-alpha, I don't know what you meant by pre-YARN. Thanks, Kishore On Wed,

Re: Automatically mapping a job submitted by a particular user to a specific hadoop map-reduce queue

2013-04-25 Thread Nitin Pawar
the current capacity scheduler guarantees that which users can submit jobs to which queue and other related features. More of which you can read at http://hadoop.apache.org/docs/stable/capacity_scheduler.html but on the hive side, unless you set mapred.job.queue.name on the hive cli, they will be

Comparison between JobClient/JobConf and Job/Configuration

2013-04-25 Thread Kevin Burton
I notice that in some beginning texts on starting a Hadoop MapReduce job sometimes JobClient/JobConf is used and sometimes Job/Configuration is used. I have yet to see anyone comment on the features/benefits of either set of methods. Could someone comment on their preferred method for starting a

Re: Need help when running 2TB data

2013-04-25 Thread Nitin Pawar
Failed task can be because the task failed to report to JT or anything. Did your job finished successfully? If not do you have further logs available? On Apr 25, 2013 5:45 PM, 超级塞亚人 shel...@gmail.com wrote: Hi everybody, I am facing a problem after I ran a 2TB data. It says the job is

Re: Job launch from eclipse

2013-04-25 Thread Han JU
Thanks Shashwat and Mohammad. I'm exporting jars and run that with hadoop jar, but I think we should have better ways. I've tried a lot but launch in Eclipse just doesn't work. I don't really want to hard code jobtracker or hdfs information in my code. Maybe it's a bug in hadoop eclipse plugin?

TaskID

2013-04-25 Thread Bharati
Hi Folks, I need to find the task ID of the map task. Please send me any ideas or suggestions. Thanks, Bharati Sent from my iPad Fortigate Filtered

Re: TaskID

2013-04-25 Thread Chris Nauroth
Hi Bharati, Your mapper class implementation methods have access to a MapContext: http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/MapContext.html This is a subclass of TaskAttemptContext:

Re: Query on Cost estimates on Hadoop and Java

2013-04-25 Thread Mohammad Tariq
Hello Sandeep, If you're willing to abide by the terms of the GPL, then yes it's legal. You might wanna have a look at thishttp://www.oracle.com/technetwork/java/javase/downloads/java-se-archive-license-1382604.html . Warm Regards, Tariq https://mtariq.jux.com/

Re: Query on Cost estimates on Hadoop and Java

2013-04-25 Thread Robert Dyer
It isn't GPL. OpenJDK[1] is GPLv2 with a Classpath Exception[2] (which is important). Read more here: http://programmers.stackexchange.com/questions/52534/can-we-use-java-for-commercial-use Also note that Hadoop[3] is licensed under Apache v2[4]. [1] http://openjdk.java.net/legal/ [2]

Re: How to connect to hadoop through ssh tunnel and kerberos authentication

2013-04-25 Thread Daryn Sharp
The important part of the error is Cannot get kdc for realm CORP.EBAY.COMhttp://CORP.EBAY.COM. Check if the gateway's /etc/krb5.conf has an entry for CORP.EBAY.COMhttp://CORP.EBAY.COM in the [realms] section. Or if you actually have appropriate dns service records for kerberos, you can use

Re: Automatically mapping a job submitted by a particular user to a specific hadoop map-reduce queue

2013-04-25 Thread Vinod Kumar Vavilapalli
The 'standard' way to do this is using queu-acls to enforce a particular user to be able to submit jobs to a sub-set of queues and then let the user decide which of that subset of queues he wishes to submit a job to. Thanks, +Vinod Kumar Vavilapalli Hortonworks Inc. http://hortonworks.com/ On

Re: Best way to collect Hadoop logs across cluster

2013-04-25 Thread Mark Kerzner
Thank you for all the advice, it was indeed very useful. Mark On Thu, Apr 18, 2013 at 11:44 PM, Roman Shaposhnik r...@apache.org wrote: On Thu, Apr 18, 2013 at 9:23 PM, Mark Kerzner mark.kerz...@shmsoft.com wrote: Hi, my clusters are on EC2, and they disappear after the cluster's

Re: How to connect to hadoop through ssh tunnel and kerberos authentication

2013-04-25 Thread Jeff Zhang
Yes, I have the entry for CORP.EBAY.COM here's krb5.conf [libdefaults] noaddresses = true default_realm = CORP.EBAY.COM ticket_lifetime = 36000 renew_lifetime = 604800 default_tgs_enctypes = aes256-cts aes128-cts arcfour-hmac-md5 des-cbc-md5 des-cbc-crc default_tkt_enctypes = aes256-cts

Re: Comparison between JobClient/JobConf and Job/Configuration

2013-04-25 Thread Arun C Murthy
Actually, if you care about being binary compatible b/w hadoop-1 and hadoop-2, stay with old mapred API. On Apr 25, 2013, at 10:04 AM, Sandy Ryza wrote: Hi Kevin, JobClient/JobConf is the old mapred API and Job/Configuration is the improved new API, so in general the latter is preferred.