Significance of file.out.index during Shuffle Phase ?

2012-08-18 Thread Pavan Kulkarni
? Is the *file.out.index* play any role? Any help is appreciated .Thanks -- --With Regards Pavan Kulkarni

Re: Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-16 Thread Pavan Kulkarni
, 2012 at 11:43 PM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Hi, I am running a Terasort with a cluster of 8 nodes.The map phase completes but when the reduce phase is around 68-70% I get this following error. * 12/08/10 11:02:36 INFO mapred.JobClient: Task Id

Re: Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-16 Thread Pavan Kulkarni
be caused by a corrupt disk - which is why checking if it's happening on a specific node(s) can help. Arun On Aug 16, 2012, at 10:04 AM, Pavan Kulkarni wrote: Harsh, I see this on couple of nodes.But what may be the cause of this error ?Any idea about it? Thanks On Sun, Aug 12

Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-10 Thread Pavan Kulkarni
.Thanks -- --With Regards Pavan Kulkarni

Setting number of parallel Reducers and Mappers for optimal performance

2012-08-10 Thread Pavan Kulkarni
is wrong.Also suggest a way to get the optimal performance by setting these parameters. Thanks. -- --With Regards Pavan Kulkarni

Re: Setting number of parallel Reducers and Mappers for optimal performance

2012-08-10 Thread Pavan Kulkarni
depending on how you want to slice/dice your slots. Arun On Aug 10, 2012, at 1:24 PM, Pavan Kulkarni wrote: Hi, I was trying to optimize Hadoop-1.0.2 performance by setting *mapred.tasktracker.map.tasks.maximum ,**mapred.tasktracker.reduce.tasks.maximum* such that the entire memory

Shifting to Java 7 . Is it good choice?

2012-07-17 Thread Pavan Kulkarni
Hi, I have to tweak a few classes and for this I needed few packages which are only present in Java 7 like java.nio.file , So I was wondering If I can shift my development environment of Hadoop to Java 7? Would this break anything ? Thanks -- --With Regards Pavan Kulkarni

Re: Shifting to Java 7 . Is it good choice?

2012-07-17 Thread Pavan Kulkarni
present in Java 7 like java.nio.file , So I was wondering If I can shift my development environment of Hadoop to Java 7? Would this break anything ? openjdk 7 works, but nio async file access is slower then traditional. -- --With Regards Pavan Kulkarni

debugging a class hadoop-1.0.2 ,log4j.properties

2012-07-12 Thread Pavan Kulkarni
in the TaskTracker log file. Any idea about how to debug a class? Thanks -- --With Regards Pavan Kulkarni

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-11 Thread Pavan Kulkarni
then, the error pops up. So to avoid this execute *xhost +local:all* in the terminal of the user who owns the X Display socket and then try issuing ant from root. It works fine. This does have some security concerns but this is the easiest workaround On Tue, Jul 10, 2012 at 5:50 PM, Pavan Kulkarni pavan.babu

Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
was how to build a binary tar file for hadoop-0.20 which still uses ANT. The wiki pages only have information for maven. Any help is highly appreciated.Thanks -- --With Regards Pavan Kulkarni

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
://github.com/apache/hadoop-common.git hadoop-1 $ cd hadoop-1; git checkout branch-1 $ ant eclipse (Now export this directory into Eclipse as a Java project) HTH. On Wed, Jul 11, 2012 at 12:00 AM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Hi all, I am trying to setup hadoop 1st

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
the build.xml at the root of your checkout. On Wed, Jul 11, 2012 at 1:15 AM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Thanks a lot Harsh.I could set it up without any errors. It would be great if you could provide me any pointers on how to build a binary distribution tar file

JAVA_HOME setup error in Hadoop-0.23.3 single node

2012-07-09 Thread Pavan Kulkarni
: Error: JAVA_HOME is not set and could not be found.* * * I have checked the settings and .bashrc file and everything seems to correct , also researched for the solution online but couldn't find much. So would really appreciate if anyone knows how to resolve this ?Thanks -- --With Regards Pavan

Re: JAVA_HOME setup error in Hadoop-0.23.3 single node

2012-07-09 Thread Pavan Kulkarni
I resolved this issue. Sorry for the inconvenience. On Mon, Jul 9, 2012 at 11:58 AM, Pavan Kulkarni pavan.babu...@gmail.comwrote: Hi all, I was trying to setup Hadoop-0.23.3 which is the 2nd generation release on single node. I configured all the files as specified and also set

[jira] [Created] (HADOOP-8575) No mapred-site.xml present in the configuration directory. This is very trivial but thought would be less confusing for a new user if it came packaged.

2012-07-06 Thread Pavan Kulkarni (JIRA)
Pavan Kulkarni created HADOOP-8575: -- Summary: No mapred-site.xml present in the configuration directory. This is very trivial but thought would be less confusing for a new user if it came packaged. Key: HADOOP-8575

Re: No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-06 Thread Pavan Kulkarni
don't know how quickly we will be able to get around to doing it. --Bobby Evans On 7/5/12 7:23 PM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Hi, I downloaded the Hadoop-0.23.3 source and tweaked a few classes and when I built the binary distribution and untar'd it .I don't see

Re: No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-06 Thread Pavan Kulkarni
for that right now. Perhaps others on the list might know better then I do. On 7/6/12 12:05 PM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Bobby, Thanks a lot for your clarification. Yes as you said it is just a template, but it may be quite confusing to new users while configuring. I have

No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-05 Thread Pavan Kulkarni
right? So I was just wondering if we are supposed to create the mapred-site.xml , or it doesn't exist at all? Thanks -- --With Regards Pavan Kulkarni

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
Yes you are correct, but we can use Lustre FS and it does scale right? I am new to this so please excuse if I am wrong in some assumptions.Thanks On Wed, Jun 27, 2012 at 9:40 AM, Owen O'Malley omal...@apache.org wrote: On Wed, Jun 27, 2012 at 9:33 AM, Pavan Kulkarni pavan.babu...@gmail.com

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
, Roman Shaposhnik r...@apache.org wrote: On Wed, Jun 27, 2012 at 9:44 AM, Pavan Kulkarni pavan.babu...@gmail.com wrote: Yes you are correct, but we can use Lustre FS and it does scale right? I am new to this so please excuse if I am wrong in some assumptions. You can use hybrid approaches

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
Oh.Thanks a lot Owen. I'll have a look into it. On Wed, Jun 27, 2012 at 10:21 AM, Owen O'Malley omal...@apache.org wrote: Pavan, This is a very big project. Look at the users of IFile.java. IFile is the format for storing the shuffle outputs. -- Owen -- --With Regards Pavan Kulkarni

Doubt regarding the classes to look for tweaking Reducer during Shuffle phase

2012-06-07 Thread Pavan Kulkarni
stated problem?* I went through couple of books but couldn't find much detailed information. Looking into the source code I felt it must be the OutputCollector Class. Please correct me if am wrong. Any help or pointers are highly appreciated.Thanks. -- --With Regards Pavan Kulkarni

Doubt regarding finding Databaode -- Block map.

2011-11-18 Thread Pavan Kulkarni
appreciated. Thanking You, -- With Regards Pavan Kulkarni